Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3059 |
Symbol | |
ID | 5835386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 3400440 |
End bp | 3401783 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641368859 |
Product | sodium:dicarboxylate symporter |
Protein accession | YP_001640519 |
Protein GI | 163852476 |
COG category | [C] Energy production and conversion |
COG ID | [COG1301] Na+/H+-dicarboxylate symporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACCG TGCCGTCCCC TTTGACCGCG CCCCACGCGC CGCCCGCACC GAAGCCCCTC TATCGCACGC TGTACTTCCA GGTGCTGGTT GCCGTCGCCA TCGGCATTGC CCTCGGGCAT TTCTGCCCGA AGCTCGGCGC CGACATGAAG CCGCTCGGCG ACGCCTTCAT CAAGCTCGTC AAGATGATCA TCGCGCCGGT GATCTTCCTC ACCGTCGTCT CCGGCATCGC CGGCATGACC AATCTCGAGA AGGTCGGCCG CGTCGGCGGC AAGGCGCTCA TCTACTTCAT CACCTTCTCG ACGCTGGCCC TGATCGTCGG CCTCGTCGTC GCCAACGTGC TCCAACCCGG CCACGGCCTG CATATCGACC CGAACTCGCT CGATCCGAAG GCGGTCGCGA CCTATGCCGG CAAGGCCAAG GAGCAGAACA TCGCCGACTT CCTGATGAAC ATCATCCCGA CGACGGCCGT CGGTGCGTTC GCGGGCGGTG AGATCCTCCA GGTGCTGTTC TTCTCGGTGC TGTTCGGCTT CGGCCTCGCC TTCCTCGGCG AGCGCGGCAA GCCGGTGCTC GACATCATCA AGGTGATGTC GGAGGCGATC TTCGGCGTCG TCAACATCAT CATGAAGGTC GCCCCCATCG GTGCCTTCGG CGCGATGGCC TTCACCATCG GCAAGTACGG AATCTCCTCG CTCGCCAACC TCGCCTACCT CGTCGGCGCC TTCTACCTGA CCTCGGCGAT CTTCGTGCTC GGCGTGCTCG GCGCGGTCGC CCGCTACAAC GGCTTCTCCA TCCTCAAGCT CATCCGCTAC ATCAAGGAAG AGCTGATGCT GGTGCTCGGC ACCTCCTCCT CGGAGTCGGC CCTGCCCTCG CTCATCGACA AGATGGAGAA GGCTGGCTGC TCGCGCCCCG TCGTCGGCCT CGTGGTCCCG ACCGGTTACT CGTTCAACCT CGACGGCACC AACATCTACA TGACGATGGC GGCGCTCTTC ATCGCCCAGG CCACCGACAC CCCGATCACC TATGGCGAGC AGATCCTGCT GCTGCTGGTG GCCATGCTCT CCTCGAAGGG CGCTGCGGGC GTGACCGGCT CGGGCTTCAT CACCTTGGCC GCGACGCTCG CCGTCGTCCC CTCCGTGCCG GTCGTCGGCA TGGCGCTGAT CCTCGGCATC GACCGCTTCA TGTCGGAGTG CCGCGCCCTC ACCAACTTCA TCGGCAACGC GGTGGCCTGC ATCGTCGTCG CCCGCTGGGA AGGTGAGGTC GACGAGGCCA AGCTTCACGC CGCGCTGGGT GGCAAGCCGG TCGCCGCGGC GACCCCTGCC CCGGTTCTCC AGCCGGCTGA GTGA
|
Protein sequence | MATVPSPLTA PHAPPAPKPL YRTLYFQVLV AVAIGIALGH FCPKLGADMK PLGDAFIKLV KMIIAPVIFL TVVSGIAGMT NLEKVGRVGG KALIYFITFS TLALIVGLVV ANVLQPGHGL HIDPNSLDPK AVATYAGKAK EQNIADFLMN IIPTTAVGAF AGGEILQVLF FSVLFGFGLA FLGERGKPVL DIIKVMSEAI FGVVNIIMKV APIGAFGAMA FTIGKYGISS LANLAYLVGA FYLTSAIFVL GVLGAVARYN GFSILKLIRY IKEELMLVLG TSSSESALPS LIDKMEKAGC SRPVVGLVVP TGYSFNLDGT NIYMTMAALF IAQATDTPIT YGEQILLLLV AMLSSKGAAG VTGSGFITLA ATLAVVPSVP VVGMALILGI DRFMSECRAL TNFIGNAVAC IVVARWEGEV DEAKLHAALG GKPVAAATPA PVLQPAE
|
| |