Gene Mkms_0031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_0031 
Symbol 
ID4615598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp34576 
End bp35811 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content58% 
IMG OID639789708 
Producttype I restriction-modification system specificity subunit 
Protein accessionYP_936040 
Protein GI119866088 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.404176 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.032892 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTGT GGCGGGAGTC TGTGCTCGGA GATCTATGCA CGAGAGTGAC GGTCGGGCAC 
GTCGGAAAGA TGGCCACCGA GTACGTTCCG GACGGCGTCC CTTTCCTCCG GTCACAAAAC
GTGCGGCCTT TCGTGATTGA CAAGCGCGGC TTGCTCTACA TCGGTGACGA CTTCAACGCA
AAGCTGCGCA AATCGGCGCT CACTGCGGGT GACGTCGTTA TCGTCCGCAC GGGATATCCG
GGAACGGCAG CTGTCGTCCC CGAGGATCTT GATGGATCCA ACTGCGCCGA TCTTGTTGTC
ATTACACCGT CAGACGCATT GAATCCTCAC GTGCTTGCAG CGCTCTTCAA CTCGGTCTAC
GGGCAGCACG CGGTCAGTTC GCAATTAGTT GGCTCTGCGC AACAGCACTT CAACGTTGGC
TCGGCCAAGA CGATGCGGGT CCGACTGCCC GATCGTGCTG AGCAGGACCA CATCGCAGCA
GTCCTCTGTT CGATCAATGA CTTGATCGAA AACAACCGAC GACGTGTGGA GGTTTTGGAG
GGGATGGCGC GGACCATCTA CCGCGAGTGG TTCGTGAAAT TCCGCTACCC AGGCAACGAA
GGCGTCCCTC TTGTCGACTC TGCGCTGGGC CCAGCACCGA AGGGGTGGGA AGTCGCGAAT
CTATTCGACG CTGCTGACGT CGGCTTTGGG TACTCATTCA AGTCTCCCCG GTTTTCGAAT
TCTGGTCCAT TCCAGGTGAT TCGGATCCGC GACATCCCAG TCGGCATCTC AAGGACATAT
ACCGATGAAG CAGCAGATCC GCGCTACGCC GTCTATGACG ATGACGTGCT TATAGGTATG
GACGGTGACT TCCACATGAC GGTCTGGACT GGTGAAGACG CGTGGCTGAA CCAGCGAGTC
ACCCGCCTTC GCCCGAGGCT CGGGCTGTCC GCGCTTCATC TATTGCTCGC GATCGAGGAG
CAGATCAAAG ACTGGAACCG CGCAATTGTT GGCACGACTG TGGCGCATCT AGGTAAGAAG
CATCTCCAAC TTGTCAACGT CCTCGTGCCG AATGATGCAG TACGCATAGA CGCATCTGTC
GTGTTTGCGC CCATCATGGA GGAGCGTCGT GCGCTCATCC AATCAAGTCG GCGGCTCGCC
GCTCTTCGCG ACCTCCTGCT TCCGAAGCTG GTCAGCGGAC AGATCGACGT TTCCGCACTC
GACTTGGATG CAGTGGTTGG AGAACAGGTG GCGTGA
 
Protein sequence
MTVWRESVLG DLCTRVTVGH VGKMATEYVP DGVPFLRSQN VRPFVIDKRG LLYIGDDFNA 
KLRKSALTAG DVVIVRTGYP GTAAVVPEDL DGSNCADLVV ITPSDALNPH VLAALFNSVY
GQHAVSSQLV GSAQQHFNVG SAKTMRVRLP DRAEQDHIAA VLCSINDLIE NNRRRVEVLE
GMARTIYREW FVKFRYPGNE GVPLVDSALG PAPKGWEVAN LFDAADVGFG YSFKSPRFSN
SGPFQVIRIR DIPVGISRTY TDEAADPRYA VYDDDVLIGM DGDFHMTVWT GEDAWLNQRV
TRLRPRLGLS ALHLLLAIEE QIKDWNRAIV GTTVAHLGKK HLQLVNVLVP NDAVRIDASV
VFAPIMEERR ALIQSSRRLA ALRDLLLPKL VSGQIDVSAL DLDAVVGEQV A