Gene M446_3802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3802 
Symbol 
ID6134749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp4233939 
End bp4235099 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content71% 
IMG OID641643970 
Productaldo/keto reductase 
Protein accessionYP_001770614 
Protein GI170741959 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0192807 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACGTC GCAGTTTCCT GCAGGGCGCG GTGCTCGGGG CGGGCCTCGC GGCCGGCGGC 
TCGGCCGGCG CCGCGTCCCC GGCGGGGCCC GGCCCGGCGC TCCCGTCCGC GGGGCCGCTG
CCGCCCGCGG CGGCCGCGCT CGCCGGGGCG AACCGGCCCG AGGATCCGGC GGCGCTGCCC
TTCGTGACCG ATCCGGGGGA GAGGCGGGGC GAGATGCTCT ACCGGCCCCT CGGCCGCACC
GGGGTCACCG TCTCGGCGAT CGGCATGGGC GGGTTCCACC TCGGCAAGAA GGCGCTCAGC
GATGCCGAGG CGACGCGGCT GATCCATCAG GGCGTCGACC GCGGCATCAC CTTCATGGAC
AATTGCTGGG ACTACAACGA GGGCAGGTCC GAGGAGCGGA TGGGCGCGGC GCTCGCCGAG
GGCGGTTACC GGGCCAAGGT CTTCCTGATG TCGAAGATGG ACGGCCGGAC CAAGAAGGAG
GCGGCCGCCC AGATCGACAC CTCGCTCAAG CGCCTGCGCA CGGACCGCAT CGACCTCGTC
CAGCACCACG AGATCCTGCG CTACGACGAT CCCGACCGGG TCTTCGCCGA GGGCGGGGCC
ATGGAGGCCT TCATCGAGGC GCGCCAGGCC GGCAAGCTGC GCTTCATCGG CTTCACGGGC
CACAAGGACC CGCGCATCCA CCTGCAGATG CTGGAGGTCG CGGCCGAGCG GGGCTTCCGC
TTCGACACCG TGCAGATGCC CCTCAACGTG CTCGACGCGC AGTTCCGCAG CTTCGCGCAC
CTCGTGCTGC CCTCCCTGGT GGCGCAGGGG ATCGGCGTGC TCGGGATGAA GACCTTCGGC
GACGGGGTCA TCCTCAAGAG CAACGCCCCG ATCCGGCCGA TCGAGTACCT CCACTTCAAC
CTCAACCTGC CGACCTCCGT GGTGATCACC GGCATCCAGA GCCAGCGCGA CCTCGACCAG
GCCTTCGAGG CGGTGAAGAG CTTCCGGCCG ATGGACAAGG CGGCGGTGGC GGAACTGCTC
GCCCGCGCTC GACCCTACGC GCTCGAGGGC AAGTACGAGC TGTTCAAGAC GAGTTCGACC
TTCGACGGCA CCGCCAAGAA CGCCGCCTGG CTCGGCGGCG AGGCCGAGGG CGTGCAATCC
CTCGCCCCGA CCATGGAATA G
 
Protein sequence
MERRSFLQGA VLGAGLAAGG SAGAASPAGP GPALPSAGPL PPAAAALAGA NRPEDPAALP 
FVTDPGERRG EMLYRPLGRT GVTVSAIGMG GFHLGKKALS DAEATRLIHQ GVDRGITFMD
NCWDYNEGRS EERMGAALAE GGYRAKVFLM SKMDGRTKKE AAAQIDTSLK RLRTDRIDLV
QHHEILRYDD PDRVFAEGGA MEAFIEARQA GKLRFIGFTG HKDPRIHLQM LEVAAERGFR
FDTVQMPLNV LDAQFRSFAH LVLPSLVAQG IGVLGMKTFG DGVILKSNAP IRPIEYLHFN
LNLPTSVVIT GIQSQRDLDQ AFEAVKSFRP MDKAAVAELL ARARPYALEG KYELFKTSST
FDGTAKNAAW LGGEAEGVQS LAPTME