Gene Mmcs_3777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_3777 
Symbol 
ID4112608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp4029860 
End bp4031119 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content57% 
IMG OID638032916 
Productrestriction endonuclease S subunits-like protein 
Protein accessionYP_640939 
Protein GI108800742 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTTGGG CTCAGGAGGT AACGCTCGCT GAACTCGCCG AGGGCGGGCT GTTCTCTGAT 
GGAGACTGGG TTGAATCGAA GGATCAAGAT GCGAGCGGTG ACGTACGGTT GACGCAGTTA
GCCGACGTGG GCGTTGGCGA ATTTCGTGAT CGCTCTGACC GCTGGATGCG GCGTGATCAA
GCACACCGTC TTCGTTGCAC GTTCCTGGAG GGTGACGACG TTCTCATTGC TCGGATGCCT
GATCCGATTG GACGTTCGTG CTTGGTGCCT TCGAGCGTCG GATCGGCCGT AACTGTTGTT
GATGTCGCAA TCCTCCGACT TGCGCGGCGA GACGCAAACC CTAGGTACGT CATGTGGGCA
CTAAACTCCC CGAGGTTCCA CTCCAAGGTT GTCGCCTTGC AGTCGGGCAC TACAAGGAAG
CGAATATCCA GAAAAAATCT TGCGTCACTC ACGATTCCAT TACCAACCCT CGACGAACAA
AATCGCATTG TCGACCTTCT CGAAGACCAC CTGTCGCGCC TCGATGCTGC TGAGTCCTCG
CTCCGACTCG CGATGCAAAA AGCTGATGCG ATGACGACTG CATCTCTCGA TCGGCAAACG
ACAGCGGGGT CCAGGGCTTG GCGCGATACA ACCATCGGAG CGATGGCGGA GTTGGTCGAG
TATGGATCAA GTGCGAAATG TGCTGGACAA GCCGCTGACT CCGACGTCCC TGTGCTCCGC
ATGGGCAATA TCCAGAATGG GAAGATCAAT TGGACTGGAT TGAAGTACTT GCCCGCTGGC
CACGCGGAGT TCCCGAAGCT GCTGCTGCAA TCGGGTGATC TGGTTTTCAA TCGTACGAAC
AGCGCTGAGC TAGTGGGTAA GTCAGCGGTC TTCGAAGACA CTCGTGCCGC GTCATTCGCG
TCGTATCTGA TACGAGTGAG GTTCGGCCAG GAAGTGAATC CTGCGTGGGC GAACATGGTC
ATCAACAGCC CGGCAGGCCG ACGATATGTG AAGTCGGTTG CATCGCAGCA AGTTGGTCAG
GCGAACGTGA ATGGGACGAA ATTGAAAGCG TTTCCGTTAC CGTTGCCACC CCTGGATGAG
CAATGTCGTC GCGTCCGTGC TCATGATGAG GTCGTGGTGA GTCGCGAACG ACTTCACCAT
CAAATTGCAG ATTTGGTGGT GAGGGCCGCT GGTCTCCGTC GCGCCCTCCT CGCCGCAGCA
TTCACCGGCC GCCTGACGAA CTCTGCGGAA GGACTTCTCG AAGAACTCGA ATCCGTATGA
 
Protein sequence
MSWAQEVTLA ELAEGGLFSD GDWVESKDQD ASGDVRLTQL ADVGVGEFRD RSDRWMRRDQ 
AHRLRCTFLE GDDVLIARMP DPIGRSCLVP SSVGSAVTVV DVAILRLARR DANPRYVMWA
LNSPRFHSKV VALQSGTTRK RISRKNLASL TIPLPTLDEQ NRIVDLLEDH LSRLDAAESS
LRLAMQKADA MTTASLDRQT TAGSRAWRDT TIGAMAELVE YGSSAKCAGQ AADSDVPVLR
MGNIQNGKIN WTGLKYLPAG HAEFPKLLLQ SGDLVFNRTN SAELVGKSAV FEDTRAASFA
SYLIRVRFGQ EVNPAWANMV INSPAGRRYV KSVASQQVGQ ANVNGTKLKA FPLPLPPLDE
QCRRVRAHDE VVVSRERLHH QIADLVVRAA GLRRALLAAA FTGRLTNSAE GLLEELESV