Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4413 |
Symbol | |
ID | 5672765 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5269209 |
End bp | 5270615 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641243281 |
Product | sodium:dicarboxylate symporter |
Protein accession | YP_001508698 |
Protein GI | 158316190 |
COG category | [C] Energy production and conversion |
COG ID | [COG1301] Na+/H+-dicarboxylate symporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACGT TATCGGCCCC GTCGCGGACG GGATCCGCAC GCCGACGCTG GTACCGGCAG CTTTACGTCT GGGTGCTGGC GGGGATCGTC GCCGGAATCT TAGTGGGCCA CTTCTCCCCA GGAGTCGGTG TCGACCTTCA GCCACTCGGC ACTACTTTCG TCGACGCCAT CAAAATGATC ATCGCTCCCA TCGTGTTCTG CACAGTGGTC GGCGGCATCG CGCAGGTCGA CAACCTGCGC AAGGTAGGCC GGGTGGGGCT GAAGGCGTTC ACCTACTTTG AGCTCGTCAC CACCGCCGCG CTGGTGCTCG GGCTCATCGT GATGAACGTG CTGCGTCCCG GTGACGGCGT CAACGCCGAC CCGGACACGC TGTCGGTCAA TGAGACCGTC GGCGGGTACA TCGCACAGGG CGAGTCAAAG GGCTGGGGCG ATCACCTGGC AGACGTTGTA CCGGATAGCG TGGTCGGCGC GTTCGCCGAG GGCAAGGTGC TCCAGGTACT GTTCTTCGCC GTACTATTCG GCATCGCGCT GAACCTCACC GGCAAGCAAG GTGCCGCAAT CGCCGGCGGG ATAGAGCGGG TCGGTCGCAC CATGTTCCAG GTACTCCGGT TCGTCATGTA CGCGGCACCA GTCGGCGCGT TCGGCGCCAT GGCCTTCACC ATAGGCAAGT ACGGAATCGA CACCCTAACC AGCCTCGGAA AGCTTGTCGC GGTCTTCTAC GGCACGTCGT TGTTCTTCGT CGTCGTTGTG CTCGGCGCGA TTGCTGCAAC CATCGGCGTA AACATCTTCA AGCTGCTGCG ATACATCCGA GAAGAACTCC TGATCGTTCT TGGGACATCG TCCTCCGAAT CCGTCCTTCC GCAAATCATG ACAAAACTGG AGAGACTCGG GGCGCCACGC CAGGTCGTGG GTCTCACAGT TCCTACCGGA TACTCTTTCA ACCTGGACGG GACCTGCATC TACCTGACGC TCGCCAGCCT GTACCTCGCC CAGGCGGTCG GCGTCGACCT CTCCCTCGGC GAACAGCTCA CCATCATCGG TGTGCTGCTA CTGACCTCCA AAGGCGCTGC CGGGGTCACC GGCTCCGGCT TCATCGTGCT CGCGGCGACG CTGTCGACCG TCGGGACAAT TCCGGTCGCC GCCATCATGC TGATCTTCGG GGTCGACAAG TTTATGTCGG AGTGCCGGGC GCTCACCAAC GTCTGTGGCA ACACCCTCGC CACCCTCGTC GTCGCGAACT GGGAAGGCGT CCTCGACAAG GAGCAGATGC GAAAGGCGCT GAACGCGGGT CCGGACTACA CACCGGACGT CACCGACAGG CTCGACGTGC CGGAGACGAT CGAGACCGGG GAGCTACTTG AGGCACCGGC GCAGGCCCAA CCCACTCTGA TCGGCGGTAA CCGCTAG
|
Protein sequence | MTTLSAPSRT GSARRRWYRQ LYVWVLAGIV AGILVGHFSP GVGVDLQPLG TTFVDAIKMI IAPIVFCTVV GGIAQVDNLR KVGRVGLKAF TYFELVTTAA LVLGLIVMNV LRPGDGVNAD PDTLSVNETV GGYIAQGESK GWGDHLADVV PDSVVGAFAE GKVLQVLFFA VLFGIALNLT GKQGAAIAGG IERVGRTMFQ VLRFVMYAAP VGAFGAMAFT IGKYGIDTLT SLGKLVAVFY GTSLFFVVVV LGAIAATIGV NIFKLLRYIR EELLIVLGTS SSESVLPQIM TKLERLGAPR QVVGLTVPTG YSFNLDGTCI YLTLASLYLA QAVGVDLSLG EQLTIIGVLL LTSKGAAGVT GSGFIVLAAT LSTVGTIPVA AIMLIFGVDK FMSECRALTN VCGNTLATLV VANWEGVLDK EQMRKALNAG PDYTPDVTDR LDVPETIETG ELLEAPAQAQ PTLIGGNR
|
| |