Gene Francci3_2116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2116 
Symbol 
ID3905643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2482443 
End bp2483465 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content73% 
IMG OID637879451 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_481217 
Protein GI86740817 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.112325 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.169867 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGGAG GCGACATGCA TGTCCTCGAG CAGGCGCCGG CGACGGAGGA TCGCGGCCCC 
GCCCCCGAAC CCGCGCGCCG CCGGACCGTC CTGCGACACC TGCTGCGACA CCTGTTCTCC
GGTGCCGCGG TACTGCTGGG CGCCGCCACG CTCACCTTCA CCGCGCTGCA GCTGGCCCCG
GGCGATCCGG TGGCGGTCCT GCTAGGGCCG GGAACGTCCG CCTCGCCCCA GGTACGGGCG
GAAATCCAGG CAGAGTACGG TCTCGGCGAG CCGGCTCCGC TGAGGTACGT CCACTATCTC
GGCCATCTCG CCCGCGGCGA CCTGGGTACC TCGTATCAAC TCCAGCAGCC GGTCAGCGAG
GTGATCGTGG ATCAGCTGCG ACCCACGGCC GAACTCGCGG CCGCGGCGCT GGTGCTCGCG
GTCCTGACCG GGGTGGCCAT CGCCGTCGCG ACCGCGGGCC GCCGACCGGG ACTGCGCGCC
GCGGCGATGG CCTGGGAGTC ACTGGCCCTG TCCGTGCCCT CGTTCTGGGT GGGCATCGTG
CTGGTGAGCG TGTTCTCGTT CCAGTTGCGG ATCTTCCCCG GAGCGGGGGC CCAGGGCGCG
GCCAGCCTTG TGTTGCCGTC GGTGACCCTC GCCATGCCGG CCGCCGGCGC GCTTTCCCGG
ACGCTGCGGG AGGGACTGGA AGCCGCCCTC GCGCAGCCGT TCGCCGCCGC CGCTCGGGCC
CGCGGCCTGA GCCCCACCGG CGTCACAATG CGCCACGCCC TGCGCCACGC CGCCGCAAGC
GCGCTCAACC TCGGCGGCTG GCTCGCCGGG ACGCTGCTCG CCGGCACCGT CCTCGTCGAG
ACGGTCTTCG CCCGCCCGGG GCTCGGCGCG CTGACCGTGC ACGCCGTCAT CGACCGGGAC
ATGCCCGTGG TGATGGGGGT CGTCCTCACA TCGGCCCTGG TCTCGGCCGT CGTCTTCACC
GTCGTGGATC TGCTGCAACG CGTCCTCGAC CCGCGGCTTC GGATCGAGGT GATGGACCGG
TGA
 
Protein sequence
MSGGDMHVLE QAPATEDRGP APEPARRRTV LRHLLRHLFS GAAVLLGAAT LTFTALQLAP 
GDPVAVLLGP GTSASPQVRA EIQAEYGLGE PAPLRYVHYL GHLARGDLGT SYQLQQPVSE
VIVDQLRPTA ELAAAALVLA VLTGVAIAVA TAGRRPGLRA AAMAWESLAL SVPSFWVGIV
LVSVFSFQLR IFPGAGAQGA ASLVLPSVTL AMPAAGALSR TLREGLEAAL AQPFAAAARA
RGLSPTGVTM RHALRHAAAS ALNLGGWLAG TLLAGTVLVE TVFARPGLGA LTVHAVIDRD
MPVVMGVVLT SALVSAVVFT VVDLLQRVLD PRLRIEVMDR