Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_0462 |
Symbol | |
ID | 5831167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 502108 |
End bp | 503439 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641366245 |
Product | sodium:dicarboxylate symporter |
Protein accession | YP_001637954 |
Protein GI | 163849911 |
COG category | [C] Energy production and conversion |
COG ID | [COG1301] Na+/H+-dicarboxylate symporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGATT GGCTAAGCCG GCTCTACGTT CAAGTTTTCA TTGCTATCAT TATCGGTGGT CTGGTCGGGT ATTTTCTGCC TCATATCGGC GTCACGCTTC AGCCGCTGGC CGACGGCTTT ATCAAGCTAA TCAAGATGCT GCTGGCGCCG GTCATCTTCG GCACGATCGT CGTCGGCATC GCCAAGATGG GCAACATGAA GGAGGTCGGA CGCATTGGCG TGCGGGCGCT GATCTACTTC GAGGTGGTCT CAACGCTCGC CCTGATCATC GGCCTCGTGG TCGTCAACGT CATGCAGCCT GGCGCCGGCA TGAACATCGA CGCCACGCAC ATGGATAGCA GCGCCATCGC CGGCTATGCC AAGAGGGCCG AGGCGCAGCC TGGCATAATA GGCTTCCTGA TGGACATCAT CCCCTCGACG ATGGTGGATG CCTTCGCCAA GGGCGCCATG CTGCAGATCA TCCTGATCTC GCTGCTGTTC GGCCTCGCTT TTGTCCAGGC TGGCGAGCGC GGCAAGCCAG TGGTCGTCGT CATCGACAGC CTGCTCGACG CGCTGTTCCG CATCGTCGGC ATGGTGATGC GACTCGCCCC GATCGGCGCT GGCGCCGGCG TTGCCTTTAC CATCGGCAAG TACGGCTTCG GCACGGTCTG GTCGCTCGCC TACCTGATGC TCGGCGTCTA CGCGACCTCG ATCATGTTCG TCGGCCTCGT GCTCGGCGCG GTCTGCACTT GGACCGGGTT CTCCCTCATC AAGGTTCTGA GCTACTTCAA GGACGAGATC CTGATCACCT TCGGGACCTG TTCCACCGAG GCTGTCATGC CGCGCATGAT GGCCAAGCTG GAGCGGCTGG GCTGCGAGAA GTCTGTTGTC GGCCTCGTCC TGCCGACGGG CTACACCTTC AACGCGGACG GCACCTGCAT CTACCTCACC ATGGCGGCAA TCTTCGTCGC TCAGGCCACC AACACGCCCC TTGGTTTCGG CGACCAGCTC GTCGTGCTCG GCGTGCTGCT GCTGACCTCG AAGGGCTCAG CCGGTGTGGC CGGGGCCGGC TTCGTTACCC TCGCCGCCAC TCTGTCGAGC ATAAACACCG TCCCAGTCGC CGGCCTCGTG CTGCTGCTCG GCGTCGACCG CTTCATGAAT GAGGCGCGGG CCGTGACGAA CCTGATCGGC AACGCCGTCG CCACCATCGC CGTCGCGAAG TGGGAGGGCG CCTTCGACCA TGCCAAGGCC GAGGACGCCT ACCGCAACCG CGCTGCCGTC GCCGCCGAAG AGGTCATCCC TGAAACGACA GCTGGTCACG CCCTGCCCCG TCCCGTGGCC GTTGCCCAGT GA
|
Protein sequence | MKDWLSRLYV QVFIAIIIGG LVGYFLPHIG VTLQPLADGF IKLIKMLLAP VIFGTIVVGI AKMGNMKEVG RIGVRALIYF EVVSTLALII GLVVVNVMQP GAGMNIDATH MDSSAIAGYA KRAEAQPGII GFLMDIIPST MVDAFAKGAM LQIILISLLF GLAFVQAGER GKPVVVVIDS LLDALFRIVG MVMRLAPIGA GAGVAFTIGK YGFGTVWSLA YLMLGVYATS IMFVGLVLGA VCTWTGFSLI KVLSYFKDEI LITFGTCSTE AVMPRMMAKL ERLGCEKSVV GLVLPTGYTF NADGTCIYLT MAAIFVAQAT NTPLGFGDQL VVLGVLLLTS KGSAGVAGAG FVTLAATLSS INTVPVAGLV LLLGVDRFMN EARAVTNLIG NAVATIAVAK WEGAFDHAKA EDAYRNRAAV AAEEVIPETT AGHALPRPVA VAQ
|
| |