Gene Mmcs_4075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_4075 
Symbol 
ID4112905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp4343587 
End bp4344615 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content70% 
IMG OID638033218 
Productaldo/keto reductase 
Protein accessionYP_641236 
Protein GI108801039 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.48002 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGTATC GGCGGGTTGG CGAATCGGGT CTGACGGTGT CGGAGATCAG TTTCGGCGCA 
GCGACATTCG GCGGGGTCGG CGACTTCTTC GGCGCCTGGG GCGATACCGG CGTCGAGGGC
GCGCGTCGCA TCGTGGACAT CTGCCTGGAG GCTGGTGTCA CGCTGTTCGA CACCGCGGAC
GTGTACTCCG ACGGCGCCTC GGAGGAGGTG CTCGGCGAAG CCCTGCGCGG CCGGCGCGAC
CGGGTGCTCA TCTCCACCAA GGCCGCGCTG CCCACCTCGA CCGGCTGGGG CACCTCACGC
GCTCGGTTGC TGCGTGCGGT CGAGGATGCG CTGCGGCGGT TGCGGACCGA CCGCATCGAC
CTGTTCCAAC TGCACGGCTA CGACTCGGGG ACGCCGATCG AGGAAGTCGT GGCGACCCTC
GACACGCTGC TCACGCAGGG CAAGGTGCGC TACACCGGCG TGTCGAACTT CTCCGGATGG
CAGTTGATGA AATCGCTGGC GGTCGCCGAC GGCGCACACC GCACCCGCCA CATCGCCCAT
CAGGTCTACT ACTCGCTCGT CGGGCGGGAT TACGAATGGG AACTCATGCC GCTGGGCCTT
GCCGAGGGCG TCGGCGCGCT GGTGTGGAGT CCGCTGGGCT GGGGACGGCT CACCGGCCGG
ATCCGGCGCG GACGACCGCT GCCCGAGCGC AGCCGCCTGC ACGCGACCGC TGACGCGGGT
CCGCCCGTCG ACGAGGATCG GCTCTACGCC GTCGTCGACA CCCTCGACGA CATCGCCGCG
GAGACCGGAC GCACCGTGGC GCAGATCGCG CTCAACTGGC TCCTGCGGCG GCCGACCGTC
GCCTCGGTGA TCATCGGAGC CCGCAACGAG GAACAGCTGC GCGAGAACCT GGGCGCCGTC
GGCTGGCGAC TCGACGACGA GCAGATCGCC CGGCTGGACG CGGTCAGCGC CCGGGAGGCG
CCGTATCCCT ACTTCCCGTA CCGCAGGCAG GAAGGTTTCG CACTGCTCGA TCCGCCGGTG
GCGGGTTAG
 
Protein sequence
MEYRRVGESG LTVSEISFGA ATFGGVGDFF GAWGDTGVEG ARRIVDICLE AGVTLFDTAD 
VYSDGASEEV LGEALRGRRD RVLISTKAAL PTSTGWGTSR ARLLRAVEDA LRRLRTDRID
LFQLHGYDSG TPIEEVVATL DTLLTQGKVR YTGVSNFSGW QLMKSLAVAD GAHRTRHIAH
QVYYSLVGRD YEWELMPLGL AEGVGALVWS PLGWGRLTGR IRRGRPLPER SRLHATADAG
PPVDEDRLYA VVDTLDDIAA ETGRTVAQIA LNWLLRRPTV ASVIIGARNE EQLRENLGAV
GWRLDDEQIA RLDAVSAREA PYPYFPYRRQ EGFALLDPPV AG