Gene Mkms_4151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4151 
Symbol 
ID4612091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4379531 
End bp4380559 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content70% 
IMG OID639793835 
Productaldo/keto reductase 
Protein accessionYP_940133 
Protein GI119870181 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTATC GGCGGGTTGG CGAATCGGGT CTGACGGTGT CGGAGATCAG TTTCGGCGCA 
GCGACATTCG GCGGGGTCGG CGACTTCTTC GGCGCCTGGG GCGATACCGG CGTCGAGGGC
GCGCGTCGCA TCGTGGACAT CTGCCTGGAG GCTGGTGTCA CGCTGTTCGA CACCGCGGAC
GTGTACTCCG ACGGCGCCTC GGAGGAGGTG CTCGGCGAAG CCCTGCGCGG CCGGCGCGAC
CGGGTGCTCA TCTCCACCAA GGCCGCGCTG CCCACCTCGA CCGGCTGGGG CACCTCACGC
GCTCGGTTGC TGCGTGCGGT CGAGGATGCG CTGCGGCGGT TGCGGACCGA CCGCATCGAC
CTGTTCCAAC TGCACGGCTA CGACTCGGGG ACGCCGATCG AGGAAGTCGT GGCGACCCTC
GACACGCTGC TCACGCAGGG CAAGGTGCGC TACACCGGCG TGTCGAACTT CTCCGGATGG
CAGTTGATGA AATCGCTGGC GGTCGCCGAC GGCGCACACC GCACCCGCCA CATCGCCCAT
CAGGTCTACT ACTCGCTCGT CGGGCGGGAT TACGAATGGG AACTCATGCC GCTGGGCCTT
GCCGAGGGCG TCGGCGCGCT GGTGTGGAGT CCGCTGGGCT GGGGACGGCT CACCGGCCGG
ATCCGGCGCG GACGACCGCT GCCCGAGCGC AGCCGCCTGC ACGCGACCGC TGACGCGGGT
CCGCCCGTCG ACGAGGATCG GCTCTACGCC GTCGTCGACA CCCTCGACGA CATCGCCGCG
GAGACCGGAC GCACCGTGGC GCAGATCGCG CTCAACTGGC TCCTGCGGCG GCCGACCGTC
GCCTCGGTGA TCATCGGAGC CCGCAACGAG GAACAGCTGC GCGAGAACCT GGGCGCCGTC
GGCTGGCGAC TCGACGACGA GCAGATCGCC CGGCTGGACG CGGTCAGCGC CCGGGAGGCG
CCGTATCCCT ACTTCCCGTA CCGCAGGCAG GAAGGTTTCG CACTGCTCGA TCCGCCGGTG
GCGGGTTAG
 
Protein sequence
MEYRRVGESG LTVSEISFGA ATFGGVGDFF GAWGDTGVEG ARRIVDICLE AGVTLFDTAD 
VYSDGASEEV LGEALRGRRD RVLISTKAAL PTSTGWGTSR ARLLRAVEDA LRRLRTDRID
LFQLHGYDSG TPIEEVVATL DTLLTQGKVR YTGVSNFSGW QLMKSLAVAD GAHRTRHIAH
QVYYSLVGRD YEWELMPLGL AEGVGALVWS PLGWGRLTGR IRRGRPLPER SRLHATADAG
PPVDEDRLYA VVDTLDDIAA ETGRTVAQIA LNWLLRRPTV ASVIIGARNE EQLRENLGAV
GWRLDDEQIA RLDAVSAREA PYPYFPYRRQ EGFALLDPPV AG