Gene Mmcs_3514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3514 
Symbol 
ID4112346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp3740139 
End bp3741719 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content70% 
IMG OID638032649 
Productprohead peptidase 
Protein accessionYP_640677 
Protein GI108800480 
COG category[R] General function prediction only 
COG ID[COG3740] Phage head maturation protease 
TIGRFAM ID[TIGR01543] phage prohead protease, HK97 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACACGC GGGCAGTCGA GTTGACCGAG GTCCGCACCG ACGACGACGC CGGGACTTTT 
ACCGGTCTCG CCGCCGGGTA CGACAACGTC GACACGCACG GCACCGTTCT ACAGCGCGGC
GCGTTCGCAT CCTCGCTCGC CGGTGGCGGC GTCGTTCCGT TGTTCTGGGA ACACGGCCAC
GACGATCCGC GGGCCATCGT CGGCGAGGTG ACCGCGGCCG TTGAGACCAC CCGCGGGCTG
GAAATCGTCG GCAAGCTCGA CACCGACACC GAACGCGGCG CCGCCGCTTA CCGGGCGGTC
AAAGGCCGAC GTATCCGCGG TCTGTCGGTC GGGATGCGCC CGACGCAGCG GCGCGGGGCG
AGCATCATCG CCGCCGACCT CTGCGAAATC TCGCTGGTCA TGCGCCCCAG CAACAGCCGC
GCGCTCGTTG AGTCGGTCCG GTCGGCCGAC GACGCGCTTC AAACCCGGGC GGCCAGCGCG
GTCGCCACTT TCGAGACCAT CGCAAAGGAC ACCACCATGA CCGAGCCCAT CACCACCGAA
CGCCGCGACG AGCTCGTGGC CGAGACCCGC GGGCTCGTGG CCGCGGCTCA GGGCCGCACG
CTGACCGCCG AGGAAGTCGC CACCATCGAG ACCAACACCG AGACGATCCG CCGCCACGAC
GAGCAGGCGT TGGAGACGCG CAACGACGCG CAGGCGGCCA ACATCGCCCG CGCGCTCGGT
CAGGCCATCG ACACCCGTTC GGGCGGTCGG CAGTCGCCGT TCATGCTCAG CGCCGACAAC
GTCACCACGC TCGAGACCGC GCGCAAGCGC TTCGAGAACA TCACCGTTCT CGAGACCCGC
GCGGCGCTGG CGACCACCGA CATGGGCACC GCTCGCGAGT ACGGCCCGAA CGGCCTGCAG
GCGCCGCGGT CGCTGTGGCG TTCGGCCGGC ATCCCGACGA CCGCACCGGA CGGGTACAGC
GGCGTCGTTC CGCAGTTCAC GCTGCCCGGT GGCGCGGTGC TCGTCGGTGA GGGCGTCGAC
CACCAGGAGT TCGACGGCGT CAACCCCGAC GCGGTGACGA TCGGCCGTGC CGGTGCGTGG
TCGACACTGA CCTCCGAAGC GCTGCTATCC ACGAGCATCA CCGAGGTTTC GGCCGCGCAC
GCGCGCATCA TCGCCCGCAA CGTTGACCGT GCGACGGTGG CGAAGATCGA GGACGCCAGC
CCGGACACGA TGAGCATTGA TCAGGCGTTG GTGACGGTGG CTGCCGAATG CGCCTGCGAT
GTCAGCGACT TGTGGATTGT CGGTGCGCCG GCCGCGGTGG CGGCGCTCGT CGGCAATGCG
ACCTTCACGC CCGCCAACGG CGGCGACGCA GAGTCCTACG CATCCCGCTA CGGCGGTGCG
GCGGTGTACC CGACGACCTC GGCGACCGCG GACACGCTGA CGGTGTTCCA TCCGCAGAGC
TTCCGCGCGT TCGCGTCGCC ATTGTCGTCG GGCGTGTTCG TGGATCCGAA GTCGGGCAAG
CAGGACTTCG GTCAGTGGAT GTTCTACGGG CTCGGACAGG CGCTCGTGGG CGCCGCGATC
ACCGTGGACA CCACCCCATA G
 
Protein sequence
MHTRAVELTE VRTDDDAGTF TGLAAGYDNV DTHGTVLQRG AFASSLAGGG VVPLFWEHGH 
DDPRAIVGEV TAAVETTRGL EIVGKLDTDT ERGAAAYRAV KGRRIRGLSV GMRPTQRRGA
SIIAADLCEI SLVMRPSNSR ALVESVRSAD DALQTRAASA VATFETIAKD TTMTEPITTE
RRDELVAETR GLVAAAQGRT LTAEEVATIE TNTETIRRHD EQALETRNDA QAANIARALG
QAIDTRSGGR QSPFMLSADN VTTLETARKR FENITVLETR AALATTDMGT AREYGPNGLQ
APRSLWRSAG IPTTAPDGYS GVVPQFTLPG GAVLVGEGVD HQEFDGVNPD AVTIGRAGAW
STLTSEALLS TSITEVSAAH ARIIARNVDR ATVAKIEDAS PDTMSIDQAL VTVAAECACD
VSDLWIVGAP AAVAALVGNA TFTPANGGDA ESYASRYGGA AVYPTTSATA DTLTVFHPQS
FRAFASPLSS GVFVDPKSGK QDFGQWMFYG LGQALVGAAI TVDTTP