Gene Mmcs_1644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_1644 
Symbol 
ID4110479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1781842 
End bp1782867 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content68% 
IMG OID638030764 
Productzinc-binding alcohol dehydrogenase 
Protein accessionYP_638810 
Protein GI108798613 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.14571 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGCGA TCGTTCTCAA CGGCACCAAC GACGTCGGTC TGACGTCGGT GCCAGACCCG 
GCGCCGCAGG ACGGTGAGGT CATCATCGAA GTGGCGGCGA CAGGGCTGTG TGGAACTGAC
CTCCACGAGT ATGTCGCGGG GCCCACCTTC TCGCAGCCGC CGGTGGTGCT CGGTCACGAG
GTCTCGGGCC GGATCGTGGA GGTCGGAGCG GGCGTCGACC AATCCCGCAT CGGGGAGGGC
GCCGCGGTGA TCCCGATGGA TTTCTGCGGG AGCTGCCACT ACTGCCACCG GTCGCTCTAC
CACTTGTGCC AGCGCCCAGG ATGGATCGGC TTCACCCGAA ACGGAGGCCT CGCGAACTAC
GTCGCAGTGC CCTCTCGGCT CGCAGTCCGA GTGCCGGACG TGGTGGACCT CGAGGAGGCG
GCGCTGACCG AGCCGACGGC GGTGGCGTTC CACGCGGTGC GGCGAGCGGA ACTGCTCCTC
GGCGAAACGG TGATGGTCCT CGGTGCCGGG GCACTCGGGC TCACCGTGAT CCAGTGCGCA
CGCGCGGCCG GAGCTGCGCG AATCTTCGTC ACGGAACCAA GCGGCGTGCG GGCCAGCCTG
GCGCGCGACC TCGGCGCCAC GTTGGTGCTC GATCCGCATG ACCCCGGGAC CACCGCGTGC
ATCCTGGAGG AGACCCGCGG TGTAGGGGTG GACGTGGTCT TCCATGTGGC GGGCAGCGCG
GAGGCGTTCA CACAGGGCCT GGACTGCCTC CGCAAACAGG GCCGTTTCAT GGAGATGTCG
TCGTGGGCCG GCGCGGCCTC GCTCGATGTC AACCGCCATC TGCTCAAGGA GATTCAGCTC
CGGATGGTTT TCGGTTACGA CATGTTCGAC GATTTCCCGG CCGTTCTCGC CCTGATCGCC
GACGGAAAAC TCGCGCTCGC GCCGCAAATC ACCGCTCGAG TCCCGCTGGA CCGCGCCGTC
AAGGAGGGAT TGGGCGGGCT ATTGGAGGGC CGGGAGGGTC TGGTCAAGGT GCTGGTGAAG
CCGTGA
 
Protein sequence
MEAIVLNGTN DVGLTSVPDP APQDGEVIIE VAATGLCGTD LHEYVAGPTF SQPPVVLGHE 
VSGRIVEVGA GVDQSRIGEG AAVIPMDFCG SCHYCHRSLY HLCQRPGWIG FTRNGGLANY
VAVPSRLAVR VPDVVDLEEA ALTEPTAVAF HAVRRAELLL GETVMVLGAG ALGLTVIQCA
RAAGAARIFV TEPSGVRASL ARDLGATLVL DPHDPGTTAC ILEETRGVGV DVVFHVAGSA
EAFTQGLDCL RKQGRFMEMS SWAGAASLDV NRHLLKEIQL RMVFGYDMFD DFPAVLALIA
DGKLALAPQI TARVPLDRAV KEGLGGLLEG REGLVKVLVK P