Gene M446_3859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3859 
Symbol 
ID6131998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4305206 
End bp4306699 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content70% 
IMG OID641644024 
Productaldehyde dehydrogenase 
Protein accessionYP_001770666 
Protein GI170742011 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGATC AAACCCTAGG CAGCTGGAGC GAGCGGGCAC GGGCCCTCTC TATCCGCAAC 
GACGCGTTCG TCGAGGGCCG CTTCGTACCT GCGGCATCGG GCCGGACGTT CGACTGCGTC
TTCCCGGGCA CCGGACGGCG GGTGAGCCAG GTGGCGGCCT GCGAAGCGGA CGACGTCGAT
CGCGCGGTCC GCTCGGCCCG GCGCGCCTTC GAGGCGGGAT CCTGGTCGCG GATGGCGCCG
GCCGATCGCA AGCGCGTCAT GCTGCGCTTC GCGGACCTCC TCCTGGCGAA CCGCGACGAA
CTCGCGCTGC TGGAGACCCT GAACGTCGGC AAGCCCATCA CGAGCGCGCT GTCCGGAGAC
ATCCCGAGCG CGGCGAACTG CATCGCGTTC TACGGCGAGG CGATCGACAA GATCTACGGC
GAGGTCGCCC CCGCGCCCGC CGACTTCACC ACCCTGGTGA CGCGCGAGCC CCTCGGGGTG
GTCGCGGCCG TGGTGCCGTG GAACTACCCC CTGTCGATGA CGGCCTGGAA GCTCGGCCCC
GCCCTGGCCG CCGGGAACTC GGTCGTCGTG AAGCCGGCCG AGCAGTCGCC GTTCACGGCG
CTGCGGATCG CCGAACTCGC GATGGAGGCC GGACTCCCGC CGGGGGTGCT CAACGTCGTG
CCGGGCCTGG GCGAGACGGC CGGCCGAGCG CTCGGCCTCC ACATGGACGT CGACTGCGTT
ACCTTCACCG GATCGACGGA GGTCGGGAAG CTCTTCCTGC AATATGCGGG ACGATCGAAC
GCGAAGCGGG TGAGCCTCGA ACTCGGCGGC AAGTCGCCCC AGATCGTCAT GGCGGATTGC
GCGGATCTCG ACGCCGCCGC GCAGGCGGTC GCCGCCGGGA TCTTCACCAA TGCCGGGCAG
GTCTGCAACG CGGGCTCGCG GCTGATCGTC CAGGAGAGCG TCCGCGAGGA ACTGCTCGAG
AAGGTGGTGG CCCGCGCCCG CGCGCTCAAG CCCGGCGACC CGCTCGATCC CGAGACCCGG
CTGGGGCCGC TGGTCAGCGA GCCCCAGATG GAGCGCGTGC TCGGCTACAT CCGGAAGGGC
CAGGAGGCAG GCGCGGCGGT CGTCGCCGGC GGCGGGCGCA CCCTGCTCGA CACCGGCGGC
TACTTCGTCG AGCCGACCGT GTTCGACCGC GTCGAGAACC GCATGGCGAT CGCCCAGGAG
GAGATCTTCG GGCCGGTCCT CTCCACGATC TCCGTGTCCG GCTTCGACGA GGCGATCGCC
GTCGCGAACG ACACGATCTA CGGCCTCGCC GCTTCGATCT GGACGACTGA CCTGACCAAG
GCGCACCGGG CGGCCCGCGC GATCCGGTCC GGCGTCGTCT ACGTGAACTG CTTCGACAAA
GGGTCGATGT CCGTGCCCTT CGGCGGCTTC AAACAGTCCG GCTTCGGACG CGACAAGTCC
TTGCACGCCA TCGACAAGTA CATGGACCTG AAGGCGGTCT GGTTCGCGAC CTGA
 
Protein sequence
MTDQTLGSWS ERARALSIRN DAFVEGRFVP AASGRTFDCV FPGTGRRVSQ VAACEADDVD 
RAVRSARRAF EAGSWSRMAP ADRKRVMLRF ADLLLANRDE LALLETLNVG KPITSALSGD
IPSAANCIAF YGEAIDKIYG EVAPAPADFT TLVTREPLGV VAAVVPWNYP LSMTAWKLGP
ALAAGNSVVV KPAEQSPFTA LRIAELAMEA GLPPGVLNVV PGLGETAGRA LGLHMDVDCV
TFTGSTEVGK LFLQYAGRSN AKRVSLELGG KSPQIVMADC ADLDAAAQAV AAGIFTNAGQ
VCNAGSRLIV QESVREELLE KVVARARALK PGDPLDPETR LGPLVSEPQM ERVLGYIRKG
QEAGAAVVAG GGRTLLDTGG YFVEPTVFDR VENRMAIAQE EIFGPVLSTI SVSGFDEAIA
VANDTIYGLA ASIWTTDLTK AHRAARAIRS GVVYVNCFDK GSMSVPFGGF KQSGFGRDKS
LHAIDKYMDL KAVWFAT