Gene Mkms_2707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_2707 
Symbol 
ID4615993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp2830531 
End bp2831601 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content67% 
IMG OID639792373 
Productaldo/keto reductase 
Protein accessionYP_938692 
Protein GI119868740 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0109998 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGAG ACCTGTTGAT CGGTGACGCG CCCTTCTCCC ACGACCCCTG GGTGGCCAGG 
CGGGAACGCT ACGATTCGAT GCCGTACCGG CGGGTGGGTG ACTCCGGGCT GCTGCTGCCC
GCGATCTCGC TGGGCCTCTG GTACAACTTC GGCGACAACC GGCCGTTCGA CGTGCAACGC
GAGGTGCTGC GGTACGCCTT CGACCGCGGC ATCACGCATT TCGATCTCGC CAACAACTAC
GGCCCGCCGT ACGGTTCGGC CGAGGAGAAC TTCGGCCGGA TGCTGCGCCG GGACTTCAAG
CCGTATCGCA ACGAGTTGAT CGTCTCGACC AAGGCCGGCT GGGACATGTG GCCCGGACCG
TACGGGCAGC TCGGCGGCCG GGCCTACCTG CTCGCCAGCC TCGACGAATC ACTCGACCGT
CTCGGCCTCG ACTACGTCGA CATCTTCTAC TCGCACCGCA TCGATCCGAC GACACCGCTC
GAGGAGACCG TCGGCGCACT CGACACCGCG GTGCGAGCCG GTAAGACCCG CTACGTCGGG
GTCTCGTCGT ATTCGGCGGC CAAGACCGCC GAAGCGGCCG CGATCGCGAG ACGTCTCGGC
ACTCCGTTGG TGATCCACCA GCCGTCGTAC TCACTGCTGA ACCGGTGGAT CGAGGGCGAC
CTCACCACCG AACTCCGCAA CGCCGGCATG GGTGCGATCG CGTTCACCGC ACTGGCCCAG
GGTCTGCTGA CCGACCGCTA CCTGCAGTCC GACCCGAGCG AGATCGACCG TGCCACAGCA
CGACCCACGT TCAACGACGA GCACATCACC GACCGGGTGC GCGAGCAGCT GCGGGGTCTG
GCCGGCATCG CCGAACGTCG TGGACAGTCG CTGGCCCAAC TCGCGCTGGC GTGGGTGCTC
CGTGACCCGA CCGTCGCATC CACACTCGTC GGCGCGTCGA GCGTCGCGCA GCTCGAAGAG
AACCTCGGCG CCCTCGACAA CCTCGACTTC ACCGCCGACG AGCTCGCCGA AATCGACCGG
TACGCAACCG AATCCGGAAT CGACCTGTGG CGAGAGAGCT CCGATGTCTA G
 
Protein sequence
MSGDLLIGDA PFSHDPWVAR RERYDSMPYR RVGDSGLLLP AISLGLWYNF GDNRPFDVQR 
EVLRYAFDRG ITHFDLANNY GPPYGSAEEN FGRMLRRDFK PYRNELIVST KAGWDMWPGP
YGQLGGRAYL LASLDESLDR LGLDYVDIFY SHRIDPTTPL EETVGALDTA VRAGKTRYVG
VSSYSAAKTA EAAAIARRLG TPLVIHQPSY SLLNRWIEGD LTTELRNAGM GAIAFTALAQ
GLLTDRYLQS DPSEIDRATA RPTFNDEHIT DRVREQLRGL AGIAERRGQS LAQLALAWVL
RDPTVASTLV GASSVAQLEE NLGALDNLDF TADELAEIDR YATESGIDLW RESSDV