Gene Mext_3059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3059 
Symbol 
ID5835386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3400440 
End bp3401783 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content66% 
IMG OID641368859 
Productsodium:dicarboxylate symporter 
Protein accessionYP_001640519 
Protein GI163852476 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACCG TGCCGTCCCC TTTGACCGCG CCCCACGCGC CGCCCGCACC GAAGCCCCTC 
TATCGCACGC TGTACTTCCA GGTGCTGGTT GCCGTCGCCA TCGGCATTGC CCTCGGGCAT
TTCTGCCCGA AGCTCGGCGC CGACATGAAG CCGCTCGGCG ACGCCTTCAT CAAGCTCGTC
AAGATGATCA TCGCGCCGGT GATCTTCCTC ACCGTCGTCT CCGGCATCGC CGGCATGACC
AATCTCGAGA AGGTCGGCCG CGTCGGCGGC AAGGCGCTCA TCTACTTCAT CACCTTCTCG
ACGCTGGCCC TGATCGTCGG CCTCGTCGTC GCCAACGTGC TCCAACCCGG CCACGGCCTG
CATATCGACC CGAACTCGCT CGATCCGAAG GCGGTCGCGA CCTATGCCGG CAAGGCCAAG
GAGCAGAACA TCGCCGACTT CCTGATGAAC ATCATCCCGA CGACGGCCGT CGGTGCGTTC
GCGGGCGGTG AGATCCTCCA GGTGCTGTTC TTCTCGGTGC TGTTCGGCTT CGGCCTCGCC
TTCCTCGGCG AGCGCGGCAA GCCGGTGCTC GACATCATCA AGGTGATGTC GGAGGCGATC
TTCGGCGTCG TCAACATCAT CATGAAGGTC GCCCCCATCG GTGCCTTCGG CGCGATGGCC
TTCACCATCG GCAAGTACGG AATCTCCTCG CTCGCCAACC TCGCCTACCT CGTCGGCGCC
TTCTACCTGA CCTCGGCGAT CTTCGTGCTC GGCGTGCTCG GCGCGGTCGC CCGCTACAAC
GGCTTCTCCA TCCTCAAGCT CATCCGCTAC ATCAAGGAAG AGCTGATGCT GGTGCTCGGC
ACCTCCTCCT CGGAGTCGGC CCTGCCCTCG CTCATCGACA AGATGGAGAA GGCTGGCTGC
TCGCGCCCCG TCGTCGGCCT CGTGGTCCCG ACCGGTTACT CGTTCAACCT CGACGGCACC
AACATCTACA TGACGATGGC GGCGCTCTTC ATCGCCCAGG CCACCGACAC CCCGATCACC
TATGGCGAGC AGATCCTGCT GCTGCTGGTG GCCATGCTCT CCTCGAAGGG CGCTGCGGGC
GTGACCGGCT CGGGCTTCAT CACCTTGGCC GCGACGCTCG CCGTCGTCCC CTCCGTGCCG
GTCGTCGGCA TGGCGCTGAT CCTCGGCATC GACCGCTTCA TGTCGGAGTG CCGCGCCCTC
ACCAACTTCA TCGGCAACGC GGTGGCCTGC ATCGTCGTCG CCCGCTGGGA AGGTGAGGTC
GACGAGGCCA AGCTTCACGC CGCGCTGGGT GGCAAGCCGG TCGCCGCGGC GACCCCTGCC
CCGGTTCTCC AGCCGGCTGA GTGA
 
Protein sequence
MATVPSPLTA PHAPPAPKPL YRTLYFQVLV AVAIGIALGH FCPKLGADMK PLGDAFIKLV 
KMIIAPVIFL TVVSGIAGMT NLEKVGRVGG KALIYFITFS TLALIVGLVV ANVLQPGHGL
HIDPNSLDPK AVATYAGKAK EQNIADFLMN IIPTTAVGAF AGGEILQVLF FSVLFGFGLA
FLGERGKPVL DIIKVMSEAI FGVVNIIMKV APIGAFGAMA FTIGKYGISS LANLAYLVGA
FYLTSAIFVL GVLGAVARYN GFSILKLIRY IKEELMLVLG TSSSESALPS LIDKMEKAGC
SRPVVGLVVP TGYSFNLDGT NIYMTMAALF IAQATDTPIT YGEQILLLLV AMLSSKGAAG
VTGSGFITLA ATLAVVPSVP VVGMALILGI DRFMSECRAL TNFIGNAVAC IVVARWEGEV
DEAKLHAALG GKPVAAATPA PVLQPAE