Gene Mext_3917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3917 
Symbol 
ID5834121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4353138 
End bp4354187 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content68% 
IMG OID641369708 
ProductHpcH/HpaI aldolase 
Protein accessionYP_001641359 
Protein GI163853316 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2301] Citrate lyase beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.243397 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTGC CGCGCCGCTT CTTCCAGCCC CTCGCCGCGG GTGCGCCCGA GCCGTTCCGC 
GAACTGCCGA TCAAGCTCGA GCGGATGATC CACTTCGTGC CGCCGCACAA CGAGAAGGTC
CGCGCCCGCG TGCCCGAACT CGCCAAGACG GTCGATGTGG TGCTCGGCAA CCTGGAGGAC
GCGGTCCCCG CCGACCAGAA GGAGGCGGCG CGCAAGGGCT TCGTCGAGAT GGCCCGCGCC
ACCGATTTCG CAGCCTCCGG CACCGGCCTG TGGACGCGCA TCAATGCCCT GAACTCGCCC
TGGATCCTCG ACGACCTGTT CACCATCGTC GCCGAGGTCG GCGCGAAGCT CGACGTGGTG
ATGGTGCCGA AGGTCGAGGG CCCCTGGGAC ATCCACTACA TCGACCAGTT GCTGGCCCAG
CTCGAGGCGC GCCACGGCGT GACCAAGCCG ATCCTCGTCC ACGCCATCCT CGAGACCGCC
GAAGGCGTGG CCAACGTCGA CGCCATCGCC TCCGCCTCGC CGCGCATGCA CGGCATGAGC
CTCGGGCCGG CCGATCTCGC GGCGTCCCGC GGCATGAAGA CCACCCGCGT CGGCGGCGGT
CACCCGGATT ACCGCGTCCT GTCCGATCCC AAGGGCGATG CCGAGCGGGC GTCCGCCCAG
CAGGATCTGT GGCACTACAC CATCGCCAAG ATGGTCGATG CCTGCATGGC CAACGGCATC
AAGGCGTTCT ACGGCCCGTT CGGCGACTTC TCCGATTCGG CCGCCTGCGA GGTGCAGTTC
CGCAACGCCT TCCTGATGGG CTGCGCCGGC GCCTGGACCC TGCATCCGAG CCAGGTCGCC
CTGGCCAAGA CCGTGTTCGC CCCCGATCCG GCCGAGGTGA ACTTCGCCTC CCGCATCGTC
GAGGCGATGC CCGACGGCAC CGGCGCGGTG ATGATCGACG GCAAGATGCA GGACGACGCC
ACCTGGAAGC AGGCCAAGGT CATCGTCGAT CTCGCCCGGC TCGTGGCCGA GAAGGATCCG
GATCTCGCCA AGGTCTACAA TCTGCCCTGA
 
Protein sequence
MKLPRRFFQP LAAGAPEPFR ELPIKLERMI HFVPPHNEKV RARVPELAKT VDVVLGNLED 
AVPADQKEAA RKGFVEMARA TDFAASGTGL WTRINALNSP WILDDLFTIV AEVGAKLDVV
MVPKVEGPWD IHYIDQLLAQ LEARHGVTKP ILVHAILETA EGVANVDAIA SASPRMHGMS
LGPADLAASR GMKTTRVGGG HPDYRVLSDP KGDAERASAQ QDLWHYTIAK MVDACMANGI
KAFYGPFGDF SDSAACEVQF RNAFLMGCAG AWTLHPSQVA LAKTVFAPDP AEVNFASRIV
EAMPDGTGAV MIDGKMQDDA TWKQAKVIVD LARLVAEKDP DLAKVYNLP