Gene Mext_3956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3956 
Symbol 
ID5835641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4393838 
End bp4395154 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content68% 
IMG OID641369747 
ProductC4-dicarboxylate transporter DctA 
Protein accessionYP_001641398 
Protein GI163853355 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.205542 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCATG CCGCCACCCT GTCGACCGAG CGGGCGCCGC TCCACAAGTC CCTGTTCGTG 
CAGGTGATCG CCGGGCTTCT GGCCGGCATC CTCGTCGGCG CTCTGGCGCC CGGCTTCGCC
GCAGAGCTGA AGATCCTGAG CGACGCCTTC CTCCGGCTGA TCGCGATGAT CGTGGCGCCG
ATCGTGTTCT GCGTCGTCGT GCACGGCATT GCCGGGGCGG GCGACCTCGG GAAGGTCGGG
CGGGTCGGCG TGAAGGCGCT GATCTATTTC GAGGTGATGA CGAGCCTGGC CCTCCTGCTC
GGGCTTGGGC TCGCCTACCT CGTCGGGCCG GGCCACGGCA TGAACATCGA TGTCGCAAGC
CTCGATGCCG GCGCCCTCGG CGGCTACGCC GATTCCGCGC AGAAGCTGCA GGGCGGCGGC
ATCGCGCATT TCCTCCTCGC CATCATCCCT AAGACCGCCT TCGACGCGTT CGCCCGCAAC
GACGTGCTGC AGGTCCTGTT CTTCGCCGTT CTGTTCGGGG TGAGCCTCGC CCTGGTCGGC
GGCGAGAAGG CCAGGGCGGT CTCGGGCCTG ATCGACGCTC TCTCCACGGT GCTGTTCAAG
GCCATGGGGC TGATCGTGCG GGTGGCCCCG CTCGGGGTGT TCGGCGCCGT CGCCTACACG
GTCGGACGCT ACGGCATCGG CTCGCTGGCG CAGCTCCTCT CGCTCGTGGC CCTGTTCTAC
CTCGCGGTGG CTTTGTTCGT GTTCGTGATC CTCGGCGCGG TGATGCGGCT CGCGGGCCTC
AGCCTCGTAA AGCTCCTGAT CTATCTGCGC GAGGAGCTGA CCATCGTGCT CGGCACCTCC
TCCTCCGACG CGGTGCTGCC GCAGATCATG CGCAAGCTCG TGCATCTCGG GGTGAAGGAT
TCCACCGTCG GCCTCGTGGT GCCCACGGGC TATTCCTTCA ACCTCGATGC CTTCTCGATC
TATCTGACGC TTGCCGTCGT CTTCATCGCG CAGGCGACCA ACACGCCGCT CTCCTTCAGC
GACCTGATGC TGGTGCTCGG CGTCTCGCTG GTCACCTCGA AGGGCGCCCA CGGCGTGCCC
GGCTCGGCCA TCGTGATCCT GGCCGCGACC CTGAACGCCG TCCCCTCGAT TCCCGCGATC
GGCCTCGTGC TGGTGCTCTC GGTCGATTGG TTCGTCGGCA TCGCCCGGTC GCTGGGCAAC
CTGATCGGCA ATTGCGTCGC CACCGTTGTC GTCGCCGCCT GGGAGGGCGA CCTCGACCGG
GAGCGTGCCG TGCGGGTGCT CGACGGCCGG GAGAGCCTGG AGCCCACCGC CGGTTAG
 
Protein sequence
MSHAATLSTE RAPLHKSLFV QVIAGLLAGI LVGALAPGFA AELKILSDAF LRLIAMIVAP 
IVFCVVVHGI AGAGDLGKVG RVGVKALIYF EVMTSLALLL GLGLAYLVGP GHGMNIDVAS
LDAGALGGYA DSAQKLQGGG IAHFLLAIIP KTAFDAFARN DVLQVLFFAV LFGVSLALVG
GEKARAVSGL IDALSTVLFK AMGLIVRVAP LGVFGAVAYT VGRYGIGSLA QLLSLVALFY
LAVALFVFVI LGAVMRLAGL SLVKLLIYLR EELTIVLGTS SSDAVLPQIM RKLVHLGVKD
STVGLVVPTG YSFNLDAFSI YLTLAVVFIA QATNTPLSFS DLMLVLGVSL VTSKGAHGVP
GSAIVILAAT LNAVPSIPAI GLVLVLSVDW FVGIARSLGN LIGNCVATVV VAAWEGDLDR
ERAVRVLDGR ESLEPTAG