Gene Mkms_3586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3586 
Symbol 
ID4611516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3774621 
End bp3776201 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content70% 
IMG OID639793262 
Productprohead peptidase 
Protein accessionYP_939570 
Protein GI119869618 
COG category[R] General function prediction only 
COG ID[COG3740] Phage head maturation protease 
TIGRFAM ID[TIGR01543] phage prohead protease, HK97 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.286744 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0669671 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACACGC GGGCAGTCGA GTTGACCGAG GTCCGCACCG ACGACGACGC CGGGACTTTT 
ACCGGTCTCG CCGCCGGGTA CGACAACGTC GACACGCACG GCACCGTTCT ACAGCGCGGC
GCGTTCGCAT CCTCGCTCGC CGGTGGCGGC GTCGTTCCGT TGTTCTGGGA ACACGGCCAC
GACGATCCGC GGGCCATCGT CGGCGAGGTG ACCGCGGCCG TTGAGACCAC CCGCGGGCTG
GAAATCGTCG GCAAGCTCGA CACCGACACC GAACGCGGCG CCGCCGCTTA CCGGGCGGTC
AAAGGCCGAC GTATCCGCGG TCTGTCGGTC GGGATGCGCC CGACGCAGCG GCGCGGGGCG
AGCATCATCG CCGCCGACCT CTGCGAAATC TCGCTGGTCA TGCGCCCCAG CAACAGCCGC
GCGCTCGTTG AGTCGGTCCG GTCGGCCGAC GACGCGCTTC AAACCCGGGC GGCCAGCGCG
GTCGCCACTT TCGAGACCAT CGCAAAGGAC ACCACCATGC CCGAGACCAT CACCACCGAA
CGCCGCGACG AGCTCGTGGC CGAGACCCGC GGGCTCGTGG CCGCGGCTCA GGGCCGCACC
TTGAGCGCTG AGGAAGTCGC CACCGTCGAG ACCAACACCG AGGCGATCCG CCGCCACGAC
GAGCAGGCGT TGGAGACGCG CAACGACGCG CAGGCGGCGC GGCTGGCTCA GGCGCTCGGC
CAGGCCATCG ACACCCGTTC GGGCGGTCGG CAGTCGCCGT TCATGCTCAG CGCCGACAAC
GTCACCACGC TCGAGACCGC GCGCAAGCGC TTCGAGAACA TCACCGTTCT CGAGACCCGC
GCGGCGCTGG CGACCACCGA CATGGGCACC GCTCGCGAGT ACGGCCCGAA CGGCCTGCAG
GCGCCGCGGT CGCTGTGGCG TTCGGCCGGC ATCCCGACGA CCGCACCGGA CGGGTACAGC
GGCGTCGTTC CGCAGTTCAC GCTGCCCGGT GGCGCGGTGC TCGTCGGTGA GGGCGTCGAC
CACCAGGAGT TCGACGGCGT CAACCCCGAC GCGGTGACGA TCGGCCGTGC CGGTGCGTGG
TCGACGCTGA CCTCCGAAGC GCTGCTATCC ACGAGCATCA CCGAGGTTTC GGCCGCGCAC
GCGCGCATCA TCGCCCGCAA CGTTGACCGT GCGACGGTGG CGAAGATCGA GGACGCCAGC
CCGGACACGA TGAGCATTGA TCAGGCGTTG GTGACGGTGG CTGCCGAATG CGCCTGCGAT
GTCAGCGACT TGTGGATTGT CGGTGCGCCG GCCGCGGTGG CGGCGCTCGT CGGCAATGCG
ACCTTCACGC CCGCCAACGG CGGCGACGCA GAGTCCTACG CATCCCGCTA CGGCGGTGCG
GCGGTGTACC CGACGACCTC GGCGACCGCG GACACGCTGA CGGTGTTCCA TCCGCAGAGC
TTCCGCGCGT TCGCGTCGCC ATTGTCGTCG GGCGTGTTCG TGGATCCGAA GTCGGGCAAG
CAGGACTTTG GTCAGTGGAT GTTCTACGGG CTCGGACAGG CGCTCGTGGG CGCCGCGATC
ACCGTGGACA CCACCCCATA G
 
Protein sequence
MHTRAVELTE VRTDDDAGTF TGLAAGYDNV DTHGTVLQRG AFASSLAGGG VVPLFWEHGH 
DDPRAIVGEV TAAVETTRGL EIVGKLDTDT ERGAAAYRAV KGRRIRGLSV GMRPTQRRGA
SIIAADLCEI SLVMRPSNSR ALVESVRSAD DALQTRAASA VATFETIAKD TTMPETITTE
RRDELVAETR GLVAAAQGRT LSAEEVATVE TNTEAIRRHD EQALETRNDA QAARLAQALG
QAIDTRSGGR QSPFMLSADN VTTLETARKR FENITVLETR AALATTDMGT AREYGPNGLQ
APRSLWRSAG IPTTAPDGYS GVVPQFTLPG GAVLVGEGVD HQEFDGVNPD AVTIGRAGAW
STLTSEALLS TSITEVSAAH ARIIARNVDR ATVAKIEDAS PDTMSIDQAL VTVAAECACD
VSDLWIVGAP AAVAALVGNA TFTPANGGDA ESYASRYGGA AVYPTTSATA DTLTVFHPQS
FRAFASPLSS GVFVDPKSGK QDFGQWMFYG LGQALVGAAI TVDTTP