Gene Mext_4600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4600 
Symbol 
ID5834210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp5140306 
End bp5142009 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content71% 
IMG OID641370394 
Productthiamine pyrophosphate protein 
Protein accessionYP_001642039 
Protein GI163853996 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0879686 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAGA GCCCAGACGC CGCCCTCCCC GCCCGAAGCC GCGGGCGGAA CGCCGCCCAG 
GCGCTCGTGG ACCAACTCGC GGCCAACGGC GTCACCCACG TCTTCGCAGT GCCGGGCGAG
AGCTACCTGC CGGTGCTCGA TGCGCTCTAC GAATCGGGCA TCGCCCTCAC CGTCTGCCGC
CAGGAGGGCG GCGCGGCGAT GATGGCGGAA GCGCACGGCA AGGCGACGGG GCGGCCCGGC
ATCTGCTTTG TCACCCGCGG TCCCGGCGCC ACCAACGCCT CGGCCGGCAT TCACATCGCC
CAGCAGGATT CAACGCCGAT GATCCTGTTC GTCGGCCAGA TCGAGCGGGG TCTGCGCGAC
CGCGAGGCGT GGCAGGAGGT CGATTACCGC GCCGCCTTCG GGCCGATCGC GAAATGGGCC
ACCGAGATCG AGACCGGCGC GCGGATGCCG GAATACGTCT CGCGGGCCTT CCACACCGCC
ACCGGCGGCC GGCCGGGCCC GGTGGTGGTG GCCCTGCCGA AAGACATGCT GAAGGACGCC
GCGGAAGGGC CGCTGGCCCC GCCCTTCCAA GCCGTCGAGG CCGCGCCCGG CGCGGAGGAT
CTCGCGTCCC TCGCCGCCCT GCTGGCGGAG GCCAAAAGCC CCTTCCTGGT GCTCGGCGGC
AGCCGCTGGA CCGAGCAGGC CTATGCCGAT ATCCGCCGCT TCACTGAGGC CTTCGATCTG
CCGGTGGCCA CGAGCTACCG CCGCCTGCCG CTGTTCGATC CGCTGCATCC GAACTACGCA
GGCGATCTCG GGCTTGCCGC CAACCCGAAG CTGGTGGCGC GGGCCAAGGC CGCCGACCTG
ATGATCGTGC TCGGCGGCCG CCTCGGCGAG GTCGCGAGCC AGACCTATTC GCTCCTCGAC
ATCCCGGCCC CCCGCACCCG CCTCGTCCAC ATCCATCCCG GAGCGGAGGA ACTCGGCCGG
GTCTACGTTC CGCATCTCGG CATCACCGCC GCGCCGGCCC GGATGGCGGC GGCTCTGGCG
CGGCTCGATG CGCCGGCCTC CGTCCCGTGG GCGGCCGAGA CCCGCGCGGC CCATGACGCA
TATCTGGCGT GGTCGCAGAC CCCGACGCCG CAGCCCGGCC CGGTCAATCT CGGGCAGGTG
ATGGTGCATT TGCGCGAGGC GCTGCCGGAG GACGCGATCC TGTGCAACGG CGCGGGCAAC
TACGCCGCCT GGATCCACCG CTTCTACCGC TTCCGCCGCC TTGCCACCCA CATGGCGCCG
ACCTCCGGCT CGATGGGCTA CGGCGTGCCG GCGGCTGTGG CGATGAAGCG GATCTTTCCC
GACCGCACGG TGATCTCGAT CAACGGCGAC GGCGACTTCC TGATGAACGG CCAGGAATTC
GCGACCGCCG TGCAGTACGG CCTGAACATC GTCTGCATCG TCGCCGACAA TGCGAGCTAC
GGCACGATCC GCATGCATCA GGAACGCGAT TTTCCGGGCC GTGTCCTCGC CACCGACCTC
GTGAACCCGG ACTTTGCCGC CTATGCTCGC GCCTTCGGCG GCGTCGGCTT CACCGTGGAG
CGGACCGAGG ATTTTCCGGC GGCGTTGGAG GAGGCCCTGG CGGCGCGGCG CCCGGCGATC
ATCCACGTGA AGTTCTCGGT CGATGCGATC ACGCCGGGCC TGAGCCTCAC GGCGATCCGC
GAGAAGGCGC TGGCGGGCGA TTGA
 
Protein sequence
MSQSPDAALP ARSRGRNAAQ ALVDQLAANG VTHVFAVPGE SYLPVLDALY ESGIALTVCR 
QEGGAAMMAE AHGKATGRPG ICFVTRGPGA TNASAGIHIA QQDSTPMILF VGQIERGLRD
REAWQEVDYR AAFGPIAKWA TEIETGARMP EYVSRAFHTA TGGRPGPVVV ALPKDMLKDA
AEGPLAPPFQ AVEAAPGAED LASLAALLAE AKSPFLVLGG SRWTEQAYAD IRRFTEAFDL
PVATSYRRLP LFDPLHPNYA GDLGLAANPK LVARAKAADL MIVLGGRLGE VASQTYSLLD
IPAPRTRLVH IHPGAEELGR VYVPHLGITA APARMAAALA RLDAPASVPW AAETRAAHDA
YLAWSQTPTP QPGPVNLGQV MVHLREALPE DAILCNGAGN YAAWIHRFYR FRRLATHMAP
TSGSMGYGVP AAVAMKRIFP DRTVISINGD GDFLMNGQEF ATAVQYGLNI VCIVADNASY
GTIRMHQERD FPGRVLATDL VNPDFAAYAR AFGGVGFTVE RTEDFPAALE EALAARRPAI
IHVKFSVDAI TPGLSLTAIR EKALAGD