Gene MCA1890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1890 
Symbol 
ID3104012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2032106 
End bp2033356 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content55% 
IMG OID637171047 
Producttype I restriction-modification system, S subunit 
Protein accessionYP_114325 
Protein GI53803791 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGAGG AGTGGACAGA GGCTCGCATC GATGAGCTGG GCAATGGGCG CCGTCCGGTA 
TTGAAGGCCG GTCCATTCGG CTCCTCTGTG ACAAAGGCGA CGTACAAAAC ATCGGGCTAT
AAGGTCTATG GGCAACAGGA AGTCGTAGCA AAGGACCCTA ATGCCGAAGC GTATTTCGTT
TCCGAAGCCA CGTTTACTCG TCACAAGAGC TGTGCAGTCA AGCCTGGCGA CATTCTGATG
ACGATGATGG GCACCATAGG CCGCGTGTAC CGAGTCCCGG AAGGAGCACC TGAAGGCATC
ATCAACCCTC GGCTTGTTCG CCTCGCTTTC GACACTTCTC GGATACTGTC CGAGTATGCC
GAGGTTGCAT TGGAACAGCC TTCGCTTCAG CGACTGCTGG ACCGAAGAAG TCACGGCGGC
ACGATGCAAG GTCTGAACCT GGAAGCGCTT GCATCAATTC GGCTCCTGCT TCCCCCGCTT
CCTGAGCAAC GCAAGATTGT TGAGATTCTG CGCACATGGG ACACCGCCAT CGAAACCACC
GAGCGCCTGA TCGCGGCGAA GGAGCGGTTC TATGCCCATG AACTCTCGCG CCTCATCAGC
CGCGGCCAGC ATCCGCGGCG GCCAAACGGC GATTCCGCAA GCGAAGCTTC TGAGCCTGAT
CGTGGTAGCC AATGGCGCAC AGTTTCGCTC TCAGACATTG CTACCGTGTG GAAGGGTCAA
CAGCTCAATA AAGAGCACAT GGAGGAGAGT GGGGCTTACT ACGTGCTCAA TGGCGGAATT
AACCCATCCG GCAGGACTAA TGATTGGAAT TGCGAAGCAA AAACCATCAC CATAAGTAGC
GGCGGCAATT CTTGCGGGTT TATCAATCTA AACCTAGAAA GGTTTTGGTG TGGCGGGGAC
TGCTTCGCAC TCAAGCAAAT TTCTCCTTTG GTTGATGTTG ACTATTTGTT TTTCTACCTA
AAAAGCCGGC AGCATCAAAT GATGGCACTG CGGACTGGGT CTGGAATTCC CCATATTTAT
CGCTCGGATA TTGAGTCCTT CCCGGTCATT CTTCCCGACC TCGCCACCCA AACCGCCATC
GCCCGCTATC TCACTGCACT GCGCGAAGAA ATCACGTTGC TTTCCCGTTC CCTCGGCGCC
CTCAAACGCC AAAAACGCGG CCTGATGCAA AAGCTGTTGA CCGGCCAATG GCGGGTGCCG
CTCGCGCAAG ATGCCACATC GCCCGACGCT CTTCAGGAGG TCACCCCATG A
 
Protein sequence
MREEWTEARI DELGNGRRPV LKAGPFGSSV TKATYKTSGY KVYGQQEVVA KDPNAEAYFV 
SEATFTRHKS CAVKPGDILM TMMGTIGRVY RVPEGAPEGI INPRLVRLAF DTSRILSEYA
EVALEQPSLQ RLLDRRSHGG TMQGLNLEAL ASIRLLLPPL PEQRKIVEIL RTWDTAIETT
ERLIAAKERF YAHELSRLIS RGQHPRRPNG DSASEASEPD RGSQWRTVSL SDIATVWKGQ
QLNKEHMEES GAYYVLNGGI NPSGRTNDWN CEAKTITISS GGNSCGFINL NLERFWCGGD
CFALKQISPL VDVDYLFFYL KSRQHQMMAL RTGSGIPHIY RSDIESFPVI LPDLATQTAI
ARYLTALREE ITLLSRSLGA LKRQKRGLMQ KLLTGQWRVP LAQDATSPDA LQEVTP