Gene Franean1_4413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4413 
Symbol 
ID5672765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5269209 
End bp5270615 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content63% 
IMG OID641243281 
Productsodium:dicarboxylate symporter 
Protein accessionYP_001508698 
Protein GI158316190 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGT TATCGGCCCC GTCGCGGACG GGATCCGCAC GCCGACGCTG GTACCGGCAG 
CTTTACGTCT GGGTGCTGGC GGGGATCGTC GCCGGAATCT TAGTGGGCCA CTTCTCCCCA
GGAGTCGGTG TCGACCTTCA GCCACTCGGC ACTACTTTCG TCGACGCCAT CAAAATGATC
ATCGCTCCCA TCGTGTTCTG CACAGTGGTC GGCGGCATCG CGCAGGTCGA CAACCTGCGC
AAGGTAGGCC GGGTGGGGCT GAAGGCGTTC ACCTACTTTG AGCTCGTCAC CACCGCCGCG
CTGGTGCTCG GGCTCATCGT GATGAACGTG CTGCGTCCCG GTGACGGCGT CAACGCCGAC
CCGGACACGC TGTCGGTCAA TGAGACCGTC GGCGGGTACA TCGCACAGGG CGAGTCAAAG
GGCTGGGGCG ATCACCTGGC AGACGTTGTA CCGGATAGCG TGGTCGGCGC GTTCGCCGAG
GGCAAGGTGC TCCAGGTACT GTTCTTCGCC GTACTATTCG GCATCGCGCT GAACCTCACC
GGCAAGCAAG GTGCCGCAAT CGCCGGCGGG ATAGAGCGGG TCGGTCGCAC CATGTTCCAG
GTACTCCGGT TCGTCATGTA CGCGGCACCA GTCGGCGCGT TCGGCGCCAT GGCCTTCACC
ATAGGCAAGT ACGGAATCGA CACCCTAACC AGCCTCGGAA AGCTTGTCGC GGTCTTCTAC
GGCACGTCGT TGTTCTTCGT CGTCGTTGTG CTCGGCGCGA TTGCTGCAAC CATCGGCGTA
AACATCTTCA AGCTGCTGCG ATACATCCGA GAAGAACTCC TGATCGTTCT TGGGACATCG
TCCTCCGAAT CCGTCCTTCC GCAAATCATG ACAAAACTGG AGAGACTCGG GGCGCCACGC
CAGGTCGTGG GTCTCACAGT TCCTACCGGA TACTCTTTCA ACCTGGACGG GACCTGCATC
TACCTGACGC TCGCCAGCCT GTACCTCGCC CAGGCGGTCG GCGTCGACCT CTCCCTCGGC
GAACAGCTCA CCATCATCGG TGTGCTGCTA CTGACCTCCA AAGGCGCTGC CGGGGTCACC
GGCTCCGGCT TCATCGTGCT CGCGGCGACG CTGTCGACCG TCGGGACAAT TCCGGTCGCC
GCCATCATGC TGATCTTCGG GGTCGACAAG TTTATGTCGG AGTGCCGGGC GCTCACCAAC
GTCTGTGGCA ACACCCTCGC CACCCTCGTC GTCGCGAACT GGGAAGGCGT CCTCGACAAG
GAGCAGATGC GAAAGGCGCT GAACGCGGGT CCGGACTACA CACCGGACGT CACCGACAGG
CTCGACGTGC CGGAGACGAT CGAGACCGGG GAGCTACTTG AGGCACCGGC GCAGGCCCAA
CCCACTCTGA TCGGCGGTAA CCGCTAG
 
Protein sequence
MTTLSAPSRT GSARRRWYRQ LYVWVLAGIV AGILVGHFSP GVGVDLQPLG TTFVDAIKMI 
IAPIVFCTVV GGIAQVDNLR KVGRVGLKAF TYFELVTTAA LVLGLIVMNV LRPGDGVNAD
PDTLSVNETV GGYIAQGESK GWGDHLADVV PDSVVGAFAE GKVLQVLFFA VLFGIALNLT
GKQGAAIAGG IERVGRTMFQ VLRFVMYAAP VGAFGAMAFT IGKYGIDTLT SLGKLVAVFY
GTSLFFVVVV LGAIAATIGV NIFKLLRYIR EELLIVLGTS SSESVLPQIM TKLERLGAPR
QVVGLTVPTG YSFNLDGTCI YLTLASLYLA QAVGVDLSLG EQLTIIGVLL LTSKGAAGVT
GSGFIVLAAT LSTVGTIPVA AIMLIFGVDK FMSECRALTN VCGNTLATLV VANWEGVLDK
EQMRKALNAG PDYTPDVTDR LDVPETIETG ELLEAPAQAQ PTLIGGNR