Gene Mmcs_0023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_0023 
Symbol 
ID4108911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp28661 
End bp29896 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content58% 
IMG OID638029149 
Producttype I restriction-modification system specificity subunit 
Protein accessionYP_637201 
Protein GI108797004 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.834107 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGTGT GGCGGGAGTC TGTGCTCGGA GATCTATGCA CGAGAGTGAC GGTCGGGCAC 
GTCGGAAAGA TGGCCACCGA GTACGTTCCG GACGGCGTCC CTTTCCTCCG GTCACAAAAC
GTGCGGCCTT TCGTGATTGA CAAGCGCGGC TTGCTCTACA TCGGTGACGA CTTCAACGCA
AAGCTGCGCA AATCGGCGCT CACTGCGGGT GACGTCGTTA TCGTCCGCAC GGGATATCCG
GGAACGGCAG CTGTCGTCCC CGAGGATCTT GATGGATCCA ACTGCGCCGA TCTTGTTGTC
ATTACACCGT CAGACGCATT GAATCCTCAC GTGCTTGCAG CGCTCTTCAA CTCGGTCTAC
GGGCAGCACG CGGTCAGTTC GCAATTAGTT GGCTCTGCGC AACAGCACTT CAACGTTGGC
TCGGCCAAGA CGATGCGGGT CCGACTGCCC GATCGTGCTG AGCAGGACCA CATCGCAGCA
GTCCTCTGTT CGATCAATGA CTTGATCGAA AACAACCGAC GACGTGTGGA GGTTTTGGAG
GGGATGGCGC GGACCATCTA CCGCGAGTGG TTCGTGAAAT TCCGCTACCC AGGCAACGAA
GGCGTCCCTC TTGTCGACTC TGCGCTGGGC CCAGCACCGA AGGGGTGGGA AGTCGCGAAT
CTATTCGACG CTGCTGACGT CGGCTTTGGG TACTCATTCA AGTCTCCCCG GTTTTCGAAT
TCTGGTCCAT TCCAGGTGAT TCGGATCCGC GACATCCCAG TCGGCATCTC AAGGACATAT
ACCGATGAAG CAGCAGATCC GCGCTACGCC GTCTATGACG ATGACGTGCT TATAGGTATG
GACGGTGACT TCCACATGAC GGTCTGGACT GGTGAAGACG CGTGGCTGAA CCAGCGAGTC
ACCCGCCTTC GCCCGAGGCT CGGGCTGTCC GCGCTTCATC TATTGCTCGC GATCGAGGAG
CAGATCAAAG ACTGGAACCG CGCAATTGTT GGCACGACTG TGGCGCATCT AGGTAAGAAG
CATCTCCAAC TTGTCAACGT CCTCGTGCCG AATGATGCAG TACGCATAGA CGCATCTGTC
GTGTTTGCGC CCATCATGGA GGAGCGTCGT GCGCTCATCC AATCAAGTCG GCGGCTCGCC
GCTCTTCGCG ACCTCCTGCT TCCGAAGCTG GTCAGCGGAC AGATCGACGT TTCCGCACTC
GACTTGGATG CAGTGGTTGG AGAACAGGTG GCGTGA
 
Protein sequence
MTVWRESVLG DLCTRVTVGH VGKMATEYVP DGVPFLRSQN VRPFVIDKRG LLYIGDDFNA 
KLRKSALTAG DVVIVRTGYP GTAAVVPEDL DGSNCADLVV ITPSDALNPH VLAALFNSVY
GQHAVSSQLV GSAQQHFNVG SAKTMRVRLP DRAEQDHIAA VLCSINDLIE NNRRRVEVLE
GMARTIYREW FVKFRYPGNE GVPLVDSALG PAPKGWEVAN LFDAADVGFG YSFKSPRFSN
SGPFQVIRIR DIPVGISRTY TDEAADPRYA VYDDDVLIGM DGDFHMTVWT GEDAWLNQRV
TRLRPRLGLS ALHLLLAIEE QIKDWNRAIV GTTVAHLGKK HLQLVNVLVP NDAVRIDASV
VFAPIMEERR ALIQSSRRLA ALRDLLLPKL VSGQIDVSAL DLDAVVGEQV A