Gene Mkms_1117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_1117 
Symbol 
ID4614495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp1203069 
End bp1204733 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content69% 
IMG OID639790793 
ProductHAD family hydrolase 
Protein accessionYP_937120 
Protein GI119867168 
COG category[R] General function prediction only 
COG ID[COG0561] Predicted hydrolases of the HAD superfamily 
TIGRFAM ID[TIGR01484] HAD-superfamily hydrolase, subfamily IIB 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.293003 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.951396 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGATTTT TCAAGGTCGT GGCCGTCGAC ATCGACGGCA CCCTCACGTC GAACGGTGCG 
CTGTCCTCGG CGGCCGTGCG CGCGATCCGC GACGCCCGGC TGAACGGAAC GCAGGTGGTG
CTGGTGACCG GTCGCATCGG GCGCGAGTTG CAGGCGGAGT TCCCGGATCT CTCCGACCAC
GTCGACGCAG TGGTGCTCGA GAACGGTGCG GTTGCGGTGG TCGACGGCCG ATCGGTGGCG
CTCGCACCGC CGGTGGATCC CGCGCTCGAC GCGGAACTGA GTGCGCGTGG AATACCGTTC
CGCCGAGGGG AGACCCTCAT CGCCGCGGAC GGTCAGTACG CCGCGGCCAC CGTGGAGGCG
ATCGGAGAAC TCGGTCTGGA CTGCCAGATC ATCCGTAACC GGGGTGCGCT GATGGTGCTG
CCTGCGGGCG TCACCAAGGG GACCGGTCTG TGCGGCGTGC TGGCCCGGAT GAACCGCTCC
CCGCACAACA CGATCGCGAT CGGAGACGCC GAGAACGACG TGTCGATGAT GGCGGCCGCC
GAACTCGGTG TGGCCGTCGC CAATGCGCCC CCCTCGGTGA AGGCCCACGC CGACGAGGTG
TTGAGCGAGA GCGACGGCGA GGGGGTCGCC GGGATACTCA CCGGACCGAT CTTGAGCGGA
GCGCGTCGCT GGTGCCCGAT GCGACGGTGG ATCGACATCG GTGCATTCGA CGACGGTGCG
CCGACTCGCC TCCCGGGCAG TCAGGGGCGG ATCGTGGTCA CCGGCCCGGC CGGTTCGGGG
AAGAGCCACA TCATCGGATT GATGGCGGAA CGCTGGATTC TCGCCGGGTA CGGGGTGCTG
GTGGTCGACC CCGAGGGCGA TCACACGCAA CTGGCGACGC TCGACCACGT GGCCGCCGTC
GACAGCCGCT ACCACCTGCC GCAACCGTCC GATCTCGTCG CCATGCTGCA TCCCAGCAGC
AGCGCGGTGG TGGATCTGTC CGCGCTGTCC ACCGACGAGA AGAGAGACTA CGTGCACCGG
TTGCGCCCTG TCGTCGAGGC GCACCGGGAA CAGTACGGCT TCCCGCACTG GACCGTCTAC
GACGAAGCGC ATCTGCTCGG CCCGGGGCAG GAGGTCCGGT GGGTTCGCCG GGGCGGATAC
GTGCTGTCCT CGTTCGTCGC CGCGGCGCTG CCGGCCGACG AGATCGACGC CAGTGACGTG
GTGGTCGAGA TGGAAGACTC CGATCAGAGT CCGTCCGCAT TGCATCCGGC GCCGCGCGCC
GCGGTGCGCT ACGGCGGAAA TCCCAAGCGG GCGTTCGCGG TTGCCGAGCG CCGCACGGCA
CACGTCAGGC ACCGGCACAA GTACGCCGAT GTCGCGCTGC CCAGAGAGCG GCGGTTCTAC
TTCCATTCCG TGGACGGTCA GTCGATTGCG CCCGCGGGCA CCATGGAGGA GTTCGGCGCC
GCGCTGCGAA GGCTCACCCC CCAGGCGCTG GAATTTCACC TCGAGCGCGG AGATTTCTCC
CGGTGGCTGG AGCGCATCAT CAACGACAGG AAACTGGCCG CGGAGGTGGC GTCGTGGGAG
GACGAGATGG CGGCGCATCG GGCGGCGGAG GTCGAACGGG TCCGCCAACA GCTCATCCGT
GCTGTTCGTG ACCGTTATCT CGACGACCGC GGTGCGCTGG ACTGA
 
Protein sequence
MGFFKVVAVD IDGTLTSNGA LSSAAVRAIR DARLNGTQVV LVTGRIGREL QAEFPDLSDH 
VDAVVLENGA VAVVDGRSVA LAPPVDPALD AELSARGIPF RRGETLIAAD GQYAAATVEA
IGELGLDCQI IRNRGALMVL PAGVTKGTGL CGVLARMNRS PHNTIAIGDA ENDVSMMAAA
ELGVAVANAP PSVKAHADEV LSESDGEGVA GILTGPILSG ARRWCPMRRW IDIGAFDDGA
PTRLPGSQGR IVVTGPAGSG KSHIIGLMAE RWILAGYGVL VVDPEGDHTQ LATLDHVAAV
DSRYHLPQPS DLVAMLHPSS SAVVDLSALS TDEKRDYVHR LRPVVEAHRE QYGFPHWTVY
DEAHLLGPGQ EVRWVRRGGY VLSSFVAAAL PADEIDASDV VVEMEDSDQS PSALHPAPRA
AVRYGGNPKR AFAVAERRTA HVRHRHKYAD VALPRERRFY FHSVDGQSIA PAGTMEEFGA
ALRRLTPQAL EFHLERGDFS RWLERIINDR KLAAEVASWE DEMAAHRAAE VERVRQQLIR
AVRDRYLDDR GALD