Gene Mkms_3850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3850 
Symbol 
ID4611785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4064342 
End bp4065601 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content57% 
IMG OID639793530 
Productrestriction endonuclease S subunits-like protein 
Protein accessionYP_939833 
Protein GI119869881 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.554786 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTTGGG CTCAGGAGGT AACGCTCGCT GAACTCGCCG AGGGCGGGCT GTTCTCTGAT 
GGAGACTGGG TTGAATCGAA GGATCAAGAT GCGAGCGGTG ACGTACGGTT GACGCAGTTA
GCCGACGTGG GCGTTGGCGA ATTTCGTGAT CGCTCTGACC GCTGGATGCG GCGTGATCAA
GCACACCGTC TTCGTTGCAC GTTCCTGGAG GGTGACGACG TTCTCATTGC TCGGATGCCT
GATCCGATTG GACGTTCGTG CTTGGTGCCT TCGAGCGTCG GATCGGCCGT AACTGTTGTT
GATGTCGCAA TCCTCCGACT TGCGCGGCGA GACGCAAACC CTAGGTACGT CATGTGGGCA
CTAAACTCCC CGAGGTTCCA CTCCAAGGTT GTCGCCTTGC AGTCGGGCAC TACAAGGAAG
CGAATATCCA GAAAAAATCT TGCGTCACTC ACGATTCCAT TACCAACCCT CGACGAACAA
AATCGCATTG TCGACCTTCT CGAAGACCAC CTGTCGCGCC TCGATGCTGC TGAGTCCTCG
CTCCGACTCG CGATGCAAAA AGCTGATGCG ATGACGACTG CATCTCTCGA TCGGCAAACG
ACAGCGGGGT CCAGGGCTTG GCGCGATACA ACCATCGGAG CGATGGCGGA GTTGGTCGAG
TATGGATCAA GTGCGAAATG TGCTGGACAA GCCGCTGACT CCGACGTCCC TGTGCTCCGC
ATGGGCAATA TCCAGAATGG GAAGATCAAT TGGACTGGAT TGAAGTACTT GCCCGCTGGC
CACGCGGAGT TCCCGAAGCT GCTGCTGCAA TCGGGTGATC TGGTTTTCAA TCGTACGAAC
AGCGCTGAGC TAGTGGGTAA GTCAGCGGTC TTCGAAGACA CTCGTGCCGC GTCATTCGCG
TCGTATCTGA TACGAGTGAG GTTCGGCCAG GAAGTGAATC CTGCGTGGGC GAACATGGTC
ATCAACAGCC CGGCAGGCCG ACGATATGTG AAGTCGGTTG CATCGCAGCA AGTTGGTCAG
GCGAACGTGA ATGGGACGAA ATTGAAAGCG TTTCCGTTAC CGTTGCCACC CCTGGATGAG
CAATGTCGTC GCGTCCGTGC TCATGATGAG GTCGTGGTGA GTCGCGAACG ACTTCACCAT
CAAATTGCAG ATTTGGTGGT GAGGGCCGCT GGTCTCCGTC GCGCCCTCCT CGCCGCAGCA
TTCACCGGCC GCCTGACGAA CTCTGCGGAA GGACTTCTCG AAGAACTCGA ATCCGTATGA
 
Protein sequence
MSWAQEVTLA ELAEGGLFSD GDWVESKDQD ASGDVRLTQL ADVGVGEFRD RSDRWMRRDQ 
AHRLRCTFLE GDDVLIARMP DPIGRSCLVP SSVGSAVTVV DVAILRLARR DANPRYVMWA
LNSPRFHSKV VALQSGTTRK RISRKNLASL TIPLPTLDEQ NRIVDLLEDH LSRLDAAESS
LRLAMQKADA MTTASLDRQT TAGSRAWRDT TIGAMAELVE YGSSAKCAGQ AADSDVPVLR
MGNIQNGKIN WTGLKYLPAG HAEFPKLLLQ SGDLVFNRTN SAELVGKSAV FEDTRAASFA
SYLIRVRFGQ EVNPAWANMV INSPAGRRYV KSVASQQVGQ ANVNGTKLKA FPLPLPPLDE
QCRRVRAHDE VVVSRERLHH QIADLVVRAA GLRRALLAAA FTGRLTNSAE GLLEELESV