Gene Mext_4206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4206 
Symbol 
ID5833242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4680222 
End bp4681655 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content71% 
IMG OID641369996 
Productaldehyde dehydrogenase 
Protein accessionYP_001641646 
Protein GI163853603 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.124463 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCTCG ACGTTTCGCT TCACATCGCC GGCCGCTGGC GCCCCGGTGG CGGCGGCGAG 
ACCCTGGCCG TTCTCAATCC GGCCACGGGC GAGGCGATCG GCCGGGTCGC GGTGGCGACG
CGGGCCGATC TCGACGAGGC GTTGGAGGCG GTCGAGCGGG GCTTTGCCGC GTGGCGGCGG
GTCTCCGCCT TCGACCGGTC CAAGGTGCTG CGCCGGGCCG CCGCGCTGAT GCGCGAGCGC
GCGGAGGACA TCGCCCGCAC CATGACCGTC GAGCAGGGCA AGCCGCTCGC CGAATCACGG
GTCGAGACCG GGGTGGCGGC CGACATCATC GAGTGGTTCG CCGAAGAGGG CCGGCGCGCC
TACGGCCGGG TCATCCCCGC CCGCGCGGAG GGCGTGTTGC AGATCGTCAC CCGCGAGCCG
GTCGGGCCGG TGGCGGCCTT CACGCCCTGG AACTTCCCGA TCAATCAGGC GGTGCGAAAG
CTTTCGGCGG CGCTCTGCAC CGGCTGCCCC GTCATCCTCA AGGGACCGGA GGACACGCCG
GCCTCCTGCG CCGAACTGGT GCGCGCCTTC CTCGATGCGG GTGTGCCCGG CGACGCGCTT
GCCTTGGTCT ACGGCGATCC GGCCGAGATT TCCGGCTATC TCATCCCGCA CCCGGTGATC
CGCAAGATCA CCTTCACCGG CTCGACCGCG GTCGGCAAAC AGCTCGCGGC GCTCGCGGGC
CAGCACATGA AGCGGGCGAC GATGGAACTC GGCGGCCATG CCCCGGCGAT CGTGTTCGAC
GACGCCGATA TCGAGACCGC CGTGCGGGTG CTGTCCGCCA ACAAGTACCG CAATGCCGGC
CAGGTCTGCG TCGCCCCGAC CCGCTTCCTC GTGCAGGAGA GGGTGTACGA CCGCTTCATC
GACGGGTTCG TGGCCGCCTC GAAGGCGCTC AAGGTCGGCG ACGGGCTCGA TCCCGAAACG
CAGATGGGCC CGCTCGTCCA CGGCCGCCGG GTCGAGGCGA TGGAGGCGTT CGTCGCCGAT
GCCGAGGCGA AGGGCGCGCG CCTCCTCACC GGCGGCAGCC GCATCGGCAA TCGCGGCCAC
TTCTTCGAGC CCACCGTCTT CGCCGACGTG CCGCTGGAGG CCCGGATCAT GAACGAGGAG
CCGTTCGGGC CGATCGCGGC GATCCGACGC TTTTCCGACG ACGACGAGGC GCTCACCGAG
GCCAACCGTC TGCCCTACGG GCTCGCGGCC TACGCCTATA CCCGCTCCGG CACCCGGGCG
AACCGGATCG GGGCGGGGAT CGAGGCCGGC ATGATCTCGA TCAACCACCA CGGCATCGCG
CTGCCCGAGA CGCCCTTCGG CGGCGTGAGG GATTCCGGCT ACGGCAGCGA GGGCGGCTCG
GAGGCGATCG AGGCCTATCT CACGACGAAA TTCGTGACGC AGGCCAACGC CTGA
 
Protein sequence
MDLDVSLHIA GRWRPGGGGE TLAVLNPATG EAIGRVAVAT RADLDEALEA VERGFAAWRR 
VSAFDRSKVL RRAAALMRER AEDIARTMTV EQGKPLAESR VETGVAADII EWFAEEGRRA
YGRVIPARAE GVLQIVTREP VGPVAAFTPW NFPINQAVRK LSAALCTGCP VILKGPEDTP
ASCAELVRAF LDAGVPGDAL ALVYGDPAEI SGYLIPHPVI RKITFTGSTA VGKQLAALAG
QHMKRATMEL GGHAPAIVFD DADIETAVRV LSANKYRNAG QVCVAPTRFL VQERVYDRFI
DGFVAASKAL KVGDGLDPET QMGPLVHGRR VEAMEAFVAD AEAKGARLLT GGSRIGNRGH
FFEPTVFADV PLEARIMNEE PFGPIAAIRR FSDDDEALTE ANRLPYGLAA YAYTRSGTRA
NRIGAGIEAG MISINHHGIA LPETPFGGVR DSGYGSEGGS EAIEAYLTTK FVTQANA