Gene Mkms_2997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_2997 
Symbol 
ID4610827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3129101 
End bp3130105 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content73% 
IMG OID639792663 
ProductHAD family hydrolase 
Protein accessionYP_938981 
Protein GI119869029 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0647] Predicted sugar phosphatases of the HAD superfamily 
TIGRFAM ID[TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.411928 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.553212 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGCAAC ATGATTGCCT GCTCCTCGAT CTGGATGGCA CGGTGTTCCG CGGCCACGAA 
CCGACCACCG GAGCCGTCGA GAGCCTCGCC GGGCTGAGCG CGCGGGTCCT CTACGTCACC
AACAACGCCT CACGCAGCCC GGGCGACGTC GCAGGCCACC TCGTCGAGTT GGGGTTCCAC
GCCGACGCCG CCGACGTGGT GACCAGCGCC CAGAGCGCAG CGCACCTGCT CGCCGCCCAA
CTTCCCGCCG GGGCGCGGGT GCTGGTCGTG GGCACCGAGG CGTTGGCGGC CGAGGTCGAC
CTCGTGGGAC TGCAGCCGGT CCGGCAGTTC GCCGACGATC CCGCCGCCGT TGTGCAGGGG
CACAACCCGG AGACGGCGTG GGCGGACCTC GCCGAGGCGG CGCTGGCCCT GCGCGCCGGG
GCCCTGTGGG TGGCGGCCAA TGTCGATCTG ACCCTGCCGT CCGAGCGCGG ACTGTTGCCT
GGCAACGGTT CGATGGTCGC CGCCCTGCAG GCCGCGACCG CCCGCGAACC TCAGGTCGCC
GGCAAGCCGC AGCCGACGCT GATGCGGGAT GCGTTGAGCC GGGGCGACTT TCACACACCG
CTGGTTGTCG GTGACCGTCT GGACACCGAC ATCGCCGGCG CCAACGCGGC GTCGTTGCCG
AGCCTGATGG TGCTCAGCGG TGTCAGCACC GCCGACGAGG TGCTGCGTGC GGTGCCCCAG
GAGCGGCCCG ACTACATCGC CGAGGATCTG CGCTCCCTGG ACGCACCGGC CGACGACCTG
CGGGTCGGTC CCCACCCCGG CTGGCGCATC GAGGTCGATG GCGCGGACGT GACCGTCCAC
GCCGACGGCG TCGACCGCGG GGACGACCTC TCGGTGCTGC GTGCGACGGC CCACGTGGTG
TGGCAGTCGG ACCTGGCGGG CACGCCGTTC GCGGTCCGCG CGGGTGACGA CACCGCGGCC
GCCGCGCTGC AACGGTGGTC GCTGCTCACC GCCGCGATCG ACTAG
 
Protein sequence
MQQHDCLLLD LDGTVFRGHE PTTGAVESLA GLSARVLYVT NNASRSPGDV AGHLVELGFH 
ADAADVVTSA QSAAHLLAAQ LPAGARVLVV GTEALAAEVD LVGLQPVRQF ADDPAAVVQG
HNPETAWADL AEAALALRAG ALWVAANVDL TLPSERGLLP GNGSMVAALQ AATAREPQVA
GKPQPTLMRD ALSRGDFHTP LVVGDRLDTD IAGANAASLP SLMVLSGVST ADEVLRAVPQ
ERPDYIAEDL RSLDAPADDL RVGPHPGWRI EVDGADVTVH ADGVDRGDDL SVLRATAHVV
WQSDLAGTPF AVRAGDDTAA AALQRWSLLT AAID