Gene Mext_3121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3121 
Symbol 
ID5835393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3472417 
End bp3473457 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content69% 
IMG OID641368921 
Productthreonine aldolase 
Protein accessionYP_001640580 
Protein GI163852537 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2008] Threonine aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.021379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.710874 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCGAAC AGCAATTCGC CAGCGACAAC TATGCCGGGA TCTGCCCGGA GGCCCTCACC 
GCGATGCAAG CGGCCAATGC CGGCCACGCC CCCGCCTACG GCGCCGATTC CTGGACGCAG
GCCGCGGCCG ACGGCTTCCG GCGGTTGTTC GAGACCGATT GCGAGGTGTT CTTCGTCTTC
AACGGCACCG CCGCCAACTC GCTGGCGCTG GCCTCGCTCT GCCAATCCTA TCACAGCGTG
ATCTGTGCGG ACTCGGCGCA TATCGAGACC GACGAATGCG GTGCGCCGGA GTTCTTCTCC
AACGGGTCGA AGCTGCTCAC CGCCGCGACC GATGACGGGA AGCTCACGCC CGAGATCGTG
CGGACGATCG CCGGCAAGCG CTCCGACATC CATTTCCCGA AACCGCGGGC GGTGACGCTG
ACCCAGGCGA CGGAGACGGG GCGGGTCTAC AGCCTCGACG AGATCGCGGC CATCTCGGGC
GTTTGCCGCG CTTTGGGCCT GCGCCTGCAC ATGGACGGCG CGCGCTTTGC CAATGCCTGC
GCCTCGCTCG ACGCCTCCCC GGCCGAACTG ACCTGGAAGG CCGGCGTCGA CGTGCTCTGC
TTCGGCGGCA CCAAGAACGG CATGGCGGTC GGCGAGGCGG TGATCTTCTT CGACCGAAAA
CTCGCGGAGG ATTTCGACTA TCGCTGCAAG CAGGCGGGCC AGCTCGCCTC GAAGATGCGC
TACCTCTCGG CGCCCTGGGT CGGCATGCTG GAGAACGGGG CGTGGCTCGA CAACGCCCGC
CACGCCAATG AATGTGCCCG ACGGCTGGTG CGAGGGATCG CCGACGTGCC GGGCATCTCG
CTCGCGGCAC CGGTGGAGGC CAACGGCGTG TTCCTGAACC TGCCCGCGCC GGTCCAAGAG
GGGCTGCGCG CCCGCGGCTG GCAGTTCTAC GGCTTCATCG GCGGGGCGTC CCGCTTCATG
TTCGCCTGGG ATTCAGAGCC GGCGCGGGTC GATGCGCTGG CGCAGGACAT CCGCCTCTGC
GCCCTGCCCG CGGCGGCGTG A
 
Protein sequence
MAEQQFASDN YAGICPEALT AMQAANAGHA PAYGADSWTQ AAADGFRRLF ETDCEVFFVF 
NGTAANSLAL ASLCQSYHSV ICADSAHIET DECGAPEFFS NGSKLLTAAT DDGKLTPEIV
RTIAGKRSDI HFPKPRAVTL TQATETGRVY SLDEIAAISG VCRALGLRLH MDGARFANAC
ASLDASPAEL TWKAGVDVLC FGGTKNGMAV GEAVIFFDRK LAEDFDYRCK QAGQLASKMR
YLSAPWVGML ENGAWLDNAR HANECARRLV RGIADVPGIS LAAPVEANGV FLNLPAPVQE
GLRARGWQFY GFIGGASRFM FAWDSEPARV DALAQDIRLC ALPAAA