Gene Mkms_3562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3562 
Symbol 
ID4611492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3753331 
End bp3754911 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content70% 
IMG OID639793238 
Productprohead peptidase 
Protein accessionYP_939546 
Protein GI119869594 
COG category[R] General function prediction only 
COG ID[COG3740] Phage head maturation protease 
TIGRFAM ID[TIGR01543] phage prohead protease, HK97 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.530159 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACACGC GGGCAGTCGA GTTGACCGAG GTCCGCACCG ACGACGACGC CGGGACTTTT 
ACCGGTCTCG CCGCCGGGTA CGACAACGTC GACACGCACG GCACCGTTCT ACAGCGCGGC
GCGTTCGCAT CATCGCTCGC CGGTGGCGGC GTCGTTCCGT TGTTCTGGGA ACACGGCCAC
GACGATCCGC GGGCCATCGT CGGCGAGGTG ACCGCGGCCG TTGAGACCAC CCGCGGGCTG
GAAATCGTCG GCAAGCTCGA CACCGACACC GAACGCGGCG CCGCCGCTTA CCGGGCGGTC
AAAGGCCGAC GTATCCGCGG TCTGTCGGTC GGGATGCGCC CGACGCAGCG GCGCGGGGCG
AGCATCATCG CCGCCGACCT CTGCGAAATC TCGCTGGTCA TGCGCCCCAG CAACAGCCGC
GCGCTCGTTG AGTCGGTCCG GTCGGCCGAC GACGCGCTTC AAACCCGGGC GGCCAGCGCG
GTCGCCACTT TCGAGACCAT CGCAAAGGAC ACCACCATGC CCGAGACCAT CACCACCGAA
CGCCGCGACG AGCTCGTGGC CGAGACCCGC GGGCTCGTGG CCGCCGCTCA GGGCCGCACG
CTGACCGCCG AGGAAGTCGC CACCATCGAG ACCAACACCG AGACGATCCG CCGCCACGAC
GAGCAGGCGT TGGAGACGCG CAACGACGCG CAGGCGGCGC GGCTGGCTCA GGCGCTCGGC
CAGGCCATCG ACACCCGTTC GGGCGGTCGG CAGTCGCCGT TCATGCTCAG CGCCGACAAC
GTCACCACGC TCGAGACCGC GCGCAAGCGC TTCGAGAACA TCACCGTTCT CGAGACCCGC
GCGGCGCTGG CGACCACCGA CATGGGCACC GCTCGCGAGT ACGGCCCGAA CGGTCTGCAG
GCGCCGCGGT CGCTGTGGCG TTCGGCCGGT ATCCCGACGA CCGCGCCCGA CGGGTACAGC
GGCGTGGTCC CGCAGTTCAC GCTGCCCGGT GGGGCGGTGC TCGTCGGTGA GGGCGTCGAC
CACCAGGAGT TCGACGGGGT GAACCCCGAC GCGGTGACGA TCGGCCGTGC CGGTGCGTGG
TCGACACTGA CCTCCGAGGC GCTGCTATCC ACGAGCATCA CCGAGGTTTC GGCCGCCCAC
GCGCGCATCA TCGCCCGCAA CGTTGACCGT GCGACGGTGG CGAAGATCGA AGATGCCAGC
CCGGACACGA TGAGCATTGA TCAGGCGTTG GTGACGGTGG CTGCCGAATG CGCCTGCGAT
GTCAGCGACT TGTGGATTGT CGGTGCGCCG GCCGCGGTGG CGGCGCTCGT CGGCAATGCG
ACCTTCACGC CCGCCAACGG CGGGGACGCA GAGTCCTACG CATCCCGCTA CGGCGGTGCG
GCGGTGTACC CGACGACCTC GGCGACCGCG GGCACGCTGA CGGTGTTTCA TCCCCAGAGT
TTCCGCGCGT TCGCGTCGCC GCTGTCGTCG GGCGTGTTCG TGGATCCGAA GTCGGGCAAG
CAGGACTTCG GTCAGTGGAT GTTCTTCGGG CTCGGTCAGG CGCTCGTGGG CGCCGCGATC
ACCGTGGACA CCACCCCATA G
 
Protein sequence
MHTRAVELTE VRTDDDAGTF TGLAAGYDNV DTHGTVLQRG AFASSLAGGG VVPLFWEHGH 
DDPRAIVGEV TAAVETTRGL EIVGKLDTDT ERGAAAYRAV KGRRIRGLSV GMRPTQRRGA
SIIAADLCEI SLVMRPSNSR ALVESVRSAD DALQTRAASA VATFETIAKD TTMPETITTE
RRDELVAETR GLVAAAQGRT LTAEEVATIE TNTETIRRHD EQALETRNDA QAARLAQALG
QAIDTRSGGR QSPFMLSADN VTTLETARKR FENITVLETR AALATTDMGT AREYGPNGLQ
APRSLWRSAG IPTTAPDGYS GVVPQFTLPG GAVLVGEGVD HQEFDGVNPD AVTIGRAGAW
STLTSEALLS TSITEVSAAH ARIIARNVDR ATVAKIEDAS PDTMSIDQAL VTVAAECACD
VSDLWIVGAP AAVAALVGNA TFTPANGGDA ESYASRYGGA AVYPTTSATA GTLTVFHPQS
FRAFASPLSS GVFVDPKSGK QDFGQWMFFG LGQALVGAAI TVDTTP