Gene Mext_3423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3423 
Symbol 
ID5831843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3797926 
End bp3799449 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content67% 
IMG OID641369222 
Productaldehyde dehydrogenase 
Protein accessionYP_001640880 
Protein GI163852837 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0352428 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGC CCGAATTCCT GGCGGATGCC AAGACGAAGT CGCCGTTTTC GGCGCGCTAC 
GACAACTTCA TCGGCGGCCA GTGGGTCGCG CCGGCGAGCG GCCGCTACTT CGAGAACACC
TCCCCCATCA CCGGCAAGGT GATCTGCGAG GTCGCCCGCT CAGAGGCGGC GGACATCGAG
CGGGCGCTCG ACGCCGCGCA CGCCGCCAAG GATGCCTGGG GCCGCACCGC ACCGGCCGAG
CGCGCCCGCA TCCTCAACAA GATCGCCGAC CGGATGGAGG ACAACCTCGA TCTGATCGCG
CTGGCCGAGA CCTGGGACAA CGGCAAGCCG ATCCGCGAGA CCACCGCCGC CGACATCCCG
CTCGCCATCG ACCACTGGCG CTACTTCGCC AGCTGCGTCC GCGCCCAGGA AGGCGCGATC
TCCGAGATCG ACCACGACAC GGTGGCCTAT CACTTCCACG AGCCGCTCGG CGTCGTCGGC
CAGATCATCC CGTGGAACTT CCCGATCCTG ATGGCGGTGT GGAAGCTGGC GCCTGCGATC
GCCGCCGGCA ACTGCGTGGT GCTCAAGCCC GCCGAGCAGA CCCCCGCCTC GATCCTCGTG
GTGATGGAGC TGATCGGCGA CCTGCTGCCG CCGGGCGTCA TCAACGTCGT CAACGGCTTC
GGCCTGGAGG CCGGCAAGCC GCTCGCCTCG AACCCGCGCA TCGCCAAGAT CGCCTTCACC
GGTGAGACGA CCACCGGCCG CCTCATCATG CAGTACGCCT CGCAGAACCT GATCCCGGTG
ACGCTGGAGC TGGGCGGCAA GTCGCCGAAC ATCTTCTTCG GGGATGTCGT CAACGAGGAT
GACGACTTCT TCGACAAGGC GCTCGAAGGC TTCACCATGT TCGCCCTCAA CCAGGGCGAA
GTCTGCACCT GCCCGAGCCG CGCGCTCGTG CACGAGTCGA TCTACGACCG CTTCATCGAG
CGCGCGATCA AGCGCGTCGA GGCGATCACC CAGGGCTCGC CGCTCGATCC GGCGACGATG
ATCGGCGCGC AGGCCTCCTC GGAGCAGCTC GAGAAGATTC TCAGCTACGT CGATATCGGC
CGCCAGGAAG GCGCCGAGTG CCTCACGGGC GGCGCCCGCG GCACCCGCGA GGGCGATCTG
GCCGACGGCT TCTACATGCA GCCGACGGTG TTCAAGGGCC ACAACAAGAT GCGGATCTTC
CAGGAGGAGA TCTTCGGGCC CGTCCTCTCG GTCACGACCT TCAAGGACGA CGAGGAGGCG
CTCTCCATCG CCAACGACAC GCTCTACGGC CTCGGCGCCG GCGTGTGGAC CCGCGACGGA
ACCCGCGCCT ACCGCTTCGG CCGCGCCATC CAGGCCGGCC GCGTCTGGAC GAACTGCTAC
CACGCCTATC CGGCGCACGC GGCCTTCGGC GGCTACAAGC AGTCCGGCAT CGGCCGTGAG
ACCCACAAGA TGATGCTCGA CCACTACCAG CAGACCAAGA ACATGCTGGT CAGCTACTCC
TCGAAGAAGC TCGGGTTCTT CTAA
 
Protein sequence
MNKPEFLADA KTKSPFSARY DNFIGGQWVA PASGRYFENT SPITGKVICE VARSEAADIE 
RALDAAHAAK DAWGRTAPAE RARILNKIAD RMEDNLDLIA LAETWDNGKP IRETTAADIP
LAIDHWRYFA SCVRAQEGAI SEIDHDTVAY HFHEPLGVVG QIIPWNFPIL MAVWKLAPAI
AAGNCVVLKP AEQTPASILV VMELIGDLLP PGVINVVNGF GLEAGKPLAS NPRIAKIAFT
GETTTGRLIM QYASQNLIPV TLELGGKSPN IFFGDVVNED DDFFDKALEG FTMFALNQGE
VCTCPSRALV HESIYDRFIE RAIKRVEAIT QGSPLDPATM IGAQASSEQL EKILSYVDIG
RQEGAECLTG GARGTREGDL ADGFYMQPTV FKGHNKMRIF QEEIFGPVLS VTTFKDDEEA
LSIANDTLYG LGAGVWTRDG TRAYRFGRAI QAGRVWTNCY HAYPAHAAFG GYKQSGIGRE
THKMMLDHYQ QTKNMLVSYS SKKLGFF