Gene Amir_4646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_4646 
Symbol 
ID8328844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp5531325 
End bp5532425 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content76% 
IMG OID644945092 
Productmycothiol-dependent formaldehyde dehydrogenase 
Protein accessionYP_003102324 
Protein GI256378664 
COG category[C] Energy production and conversion 
COG ID[COG1062] Zn-dependent alcohol dehydrogenases, class III 
TIGRFAM ID[TIGR03451] mycothiol-dependent formaldehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAGGA CGGTCAGTGC TGTGGTGGCG CCGGGTGGCG GCAAACCCGC CGAGCTGGTG 
GAGGTCGTGG TGCCGGACCC CGGTCCGGAC GAGGTGACCG TCCGGGTGCT GGCGTCGGGG
GTGTGCCACA CCGACCTGCA CTACCGGGAC GGCGTCATCG CCGCCGAGGG CCCGTACCTG
CTGGGCCACG AGGCCTCCGG GATCGTGGAG CGGGTCGGGC CGGGCGTGCG CGACGTCAAG
CCCGGCGACT TCGTGGTGCT CAACTGGCGG GCGGTGTGCG GGCGGTGCCG GGCGTGCAGG
CGCGGGCGGG CCGAGGCGTG CGTGGACGAC CGCACCGCCA CCACCCCGAT GACCCTGCTC
GACGGGACGC CGCTCACCCC GGCGCTGGGC ATCGGCGCGT TCACCGAGCT GACCCTGGTG
CACAGCGGCC AGTGCACCCC GGTGAACCCG GCGGCGGACC CGGCGGTGGT GTGCCTGCTC
GGGTGCGGGG TCATGTCGGG GCTGGGCGCG GCGATGAACA CCGGCGGCGT GCGGGTCGGC
GACACGGTCG CGGTGATCGG GGTCGGCGGG GTCGGCGGCG CGGCGGTCGT GGGCGCGCGG
CTGGCCGGGG CGACGACGGT CGTGGCGGTG GACCGCGACG AGCGCAAGCG CGCCGTCGCG
CACGAGCTGG GCGCGACCGA CTTCGTGCAC GCGGCCGAGG GCGTGGACGT GGTCGCGCGG
GTGCGCGAGC TGACCGGCGG GCTGGGCGCC GACGTGGTCG TCGACGCCGC CGGGTCCGAG
CAGACCTGGC GGCAGGCGTT CTACGCGCGG GCGCTGGGCG GCACGTTCGT CCTGGTCGCC
AGGCCGGACG CGTCGATGCG GCTGGAGCTG CCGCTGCTGG ACGCGTTCCT GCGCAACGGC
ACGTACCGCA CGAGCTGGTA CGGCGACTGC CTCCCGTCCC GCGACTTCCC GCCGCTGGTC
GAGCTGTTCC TCCAGGACCG CCTGCCGCTG CGCCGGTTCG TGTCCGAGCG GATCGGGCTC
GGCGACGTGG AACGCGCGTT CGAGTCGATG CGCAGGGGCG ACGTGCTGCG CAGCGTGGTC
CTGGTGGACG GCGCGCGCTA G
 
Protein sequence
MSRTVSAVVA PGGGKPAELV EVVVPDPGPD EVTVRVLASG VCHTDLHYRD GVIAAEGPYL 
LGHEASGIVE RVGPGVRDVK PGDFVVLNWR AVCGRCRACR RGRAEACVDD RTATTPMTLL
DGTPLTPALG IGAFTELTLV HSGQCTPVNP AADPAVVCLL GCGVMSGLGA AMNTGGVRVG
DTVAVIGVGG VGGAAVVGAR LAGATTVVAV DRDERKRAVA HELGATDFVH AAEGVDVVAR
VRELTGGLGA DVVVDAAGSE QTWRQAFYAR ALGGTFVLVA RPDASMRLEL PLLDAFLRNG
TYRTSWYGDC LPSRDFPPLV ELFLQDRLPL RRFVSERIGL GDVERAFESM RRGDVLRSVV
LVDGAR