Gene Mmcs_5456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_5456 
Symbol 
ID4114541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008147 
Strand
Start bp36115 
End bp37146 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content63% 
IMG OID638034611 
Productalcohol dehydrogenase GroES-like protein 
Protein accessionYP_642612 
Protein GI108802416 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value0.077084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCGCAG CTATTTATCA CGGCCGCGAA GACGTGCGGA TTGAAGAGCT TCCCGACCCG 
AGTCCACGTG CGGGCGAAGT GGTCATCGAG GTGGCCCGAG CGGGGATCTG CGGCACAGAC
CTGCATGAGT ACATCGCCGG TCCGATGCAT GCTGCGCCAG GGGTCGTCAT TGGACACGAA
TATTCCGGGA CGGTGGTCGG CGTCGGGTCC GGCGTGCGCG AGTTCACCGA AGGTGACCGA
GTCTGCGGCG TCGGCGTTTT CGGCTGCGGT GAATGCGGCT TCTGCAAACA GGGCGCTGAA
GCACTCTGCG GAGCAGTCGG TTTCATCGGC TTCGCTGTCA ACGGAGCGCT TGCCCGCTAC
GCGTCGTTGC CGACGAAGGC GTTGTTTCGC ATCCCGGATG AGATCAGTCT CGCCGAGGCT
GCCGTCGTTG AACCGATCGC GTCGGCGTAC CACGCTGTCC GCCGTAGCGG GTTGGCAGCG
GGTGGGACCG TCTTCATCGC CGGCGCTGGC CCGATCGGCC TCGCCCTGGT GCAGTTCTCA
CTTGCCAAAG GTGCGACTCA GGTCATCGTC AATGAGGTTT CGGCGACGCG CCGCGTTGCC
GCTCACCGGG TCGGGGCTAC GCGCGTAATC GATCCTCTGG CGGAGGATGC AGTTGAGGTG
GTTCGGACGT TGACGAACGG AAACGGCGTC GACATCTCCT TCGACGCTGC GGGCGTACAA
CCCGCGCTTG ATGCGGCGCT GGGCGTTCTG CGCCCCCGCG GTCGATTGAT GGTCGTGGCG
ATTTGGGAGG CCCCGGCCGG TATCGACATC AATCGAAGCG TCATGCGCGA AGCAGATATT
GGCTTCTCAT TTTGTTACGA AGCGCAAAGG CAGGTCCCGG CGATTCTTGA CTTACTAGCT
ACGGGGGCGA TCAACCTCGG TGAACTCATC ACCGACGAGA TCCCGTTGGA TGCCGTGGTC
AGTCAGGGTC TCGAGGAACT GCGCGTCAAC CGCGATGCAC ACGTAAAAAT TCTGATCGAC
CCATCGGCAT AG
 
Protein sequence
MRAAIYHGRE DVRIEELPDP SPRAGEVVIE VARAGICGTD LHEYIAGPMH AAPGVVIGHE 
YSGTVVGVGS GVREFTEGDR VCGVGVFGCG ECGFCKQGAE ALCGAVGFIG FAVNGALARY
ASLPTKALFR IPDEISLAEA AVVEPIASAY HAVRRSGLAA GGTVFIAGAG PIGLALVQFS
LAKGATQVIV NEVSATRRVA AHRVGATRVI DPLAEDAVEV VRTLTNGNGV DISFDAAGVQ
PALDAALGVL RPRGRLMVVA IWEAPAGIDI NRSVMREADI GFSFCYEAQR QVPAILDLLA
TGAINLGELI TDEIPLDAVV SQGLEELRVN RDAHVKILID PSA