Gene Mmcs_1605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_1605 
Symbol 
ID4110441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1740535 
End bp1741563 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content69% 
IMG OID638030726 
Producthistidinol-phosphate phosphatase 
Protein accessionYP_638772 
Protein GI108798575 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0483] Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family 
TIGRFAM ID[TIGR02067] histidinol-phosphate phosphatase HisN, inositol monophosphatase family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTCGTCG ACCGTCGGGC CGTTGATCAC CCCGGAGATC GTGACGAACT GCTCCCCCGT 
CACGTCATCC GGACGGGGGC TGACGCCGGT CACCAGCAGA GTCCCCTGAG CGAGTTCGCC
TCGCGGACCC CGGCTCCGCA TGATCCACGG CGCGGCCAGC ACGGCGATCG CCCCGATCAA
CAACAGGAGC ACAGCGAATT CCCACACACG GCCATGGTAG GACTTGCAGA CATGAGCACC
GGTTCCTCCA GCGTGGCCGA CGATCTGGCG TTGGCGTTGC GGCTCGCCGA CCATGCCGAC
GCCGTCACCG TCGACCGGTT CCGCGCACTG GACCTCCACG TCGAGACCAA ACCCGATCTC
ACGCCCGTGA CCGACGCCGA CCGTTCCGTG GAGAACGATC TGCGGCGCGC ACTCGCCGGG
GAGCGCGGCG ACGACTCGGT CCTCGGTGAA GAGTTCGGTG GCACAGCTGT TTTCAGCGGC
CGGCAGTGGG TGATCGACCC GATCGACGGG ACGAAGAACT TCGTGCGGGG TGTCCCGATC
TGGGCGACGC TGATCTCGCT GCTGAACGAC GGGGTGCCGG TGGTCGGCGT GGTCAGTGCG
CCAGCCCTGC ACCGTCGCTG GTGGGCGGCC GATGGGCTGG GCGCCTTCGT CACGGTCTCC
GGCGAGTCGC CGCGGCGGCT GTCGGTGTCG AAGGTGGCCG AACTGGATTC GGCCAGCCTG
TCGTTCTCCA GCCTGTCCGG GTGGGCCAAG CGCGGTCTGC GAGACCGATT CATCGACCTC
ACGGACGCCG TCTGGCGCGT CCGAGGGTTC GGCGACTTCT TCTCCTACTG CCTGTTGGCC
GAGGGGGCGG TGGACATCGC GGCCGAACCC GAGGTGTCGC TGTGGGATCT GGCCGCGATC
GACATTCTCG TGCGCGAGGC CGGTGGGACG TTCACGAATC TCGACGGCGC GGCGGGTCCG
CACGGAGGCA GCGTGGTCGC CTCCAACGGC CTGCTCCACG ACGCGGCGCT GGGCCACCTC
TCGCGTTAA
 
Protein sequence
MLVDRRAVDH PGDRDELLPR HVIRTGADAG HQQSPLSEFA SRTPAPHDPR RGQHGDRPDQ 
QQEHSEFPHT AMVGLADMST GSSSVADDLA LALRLADHAD AVTVDRFRAL DLHVETKPDL
TPVTDADRSV ENDLRRALAG ERGDDSVLGE EFGGTAVFSG RQWVIDPIDG TKNFVRGVPI
WATLISLLND GVPVVGVVSA PALHRRWWAA DGLGAFVTVS GESPRRLSVS KVAELDSASL
SFSSLSGWAK RGLRDRFIDL TDAVWRVRGF GDFFSYCLLA EGAVDIAAEP EVSLWDLAAI
DILVREAGGT FTNLDGAAGP HGGSVVASNG LLHDAALGHL SR