Gene MCA0836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA0836 
Symbol 
ID3103154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp880101 
End bp881351 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content56% 
IMG OID637170039 
Producttype I restriction-modification system S subunit 
Protein accessionYP_113333 
Protein GI53805024 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.444813 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGGTG AATGGAAGGA ATGCTCGCTC GGCGATGTGA TCGAGTTGAA GCGTGGATAC 
GACCTGCCAC AAAAAGACCG CTTGCCTGGT GACGTTCCGC TGGTCTCATC TTCTGGGGTC
ACGGACACCC ACGCAAAGGC AATGGTTAAG GGACCGGGGG TCGTGACTGG GCGATATGGA
ACATTGGGAC AAGTGTTTTA TGTTGAGCAG GATTTTTGGC CGCTGAACAC CACGCTTTAT
GTGCGCGACT TCAAGGGTAA CGATCCGCGC TTCATTAGCT ATTTTCTGCG CGATGTCGAC
TTCCATGCCT ATTCAGACAA GGCTGCCGTT CCCGGCCTGA ATCGCAACCA CCTACATCAA
GCAAAGGTTC GGATTCCCAG TGACCCCAAC GAACAACGCG CCATCGCCCA CATCCTTGGC
ACGCTGGACG ACAAGATCGA ACTCAACCGC CGCCAGAACG AGACGCTGGA GGCGATGGCC
CGCGCCTTGT TCAAGGCATG GTTCGTGGAC TTCGAGCCGG TGCGCGCCAA ATGTAGGGGC
GACCGGCCGG TCGCCCCTAC GGGGTGGCAA TGGCCGCAAC ACATCCTCGA CCTCTTCCCC
GACCGCCTCG TCGAATCGGA ACTTGGGGAG ATTCCGGAGG GGTGGCGTGT GTTTTCGTTC
GGCGATGTGG CGGAGCAAGG AAAGGGTTTC GTAAATCCAA GCAGGGAACC TGGAGAGAGG
TTTACGCACT ACAGTCTTCC TGCTTTTGAT GCGGGGAAGA TGCCTGTCAT TGAACCAGGC
GAATCAATCA AAAGTAACAA GACTCCAGTT CCAGATGGCG CAGTATTGGT ATCAAAGCTG
AACCCGCACA TTCCGCGCAT CTGGCTTGTC GGTGAGGCTG GCAATAGGGC GGTCTGTTCG
ACTGAGTTTA TTGTTTGGAC TCCGAAATCC CCAGCACAAA GTGCCTTTGT GTATTGCCTT
GCCTCATCGC CGGAATTCGT CGGTGCCATG TGCCAGCTGG TAACAGGAAC ATCGAACAGC
CACCAACGCG TCAAGCCCGA TCAGTTACGG GAAATACGTG TCTTCGCAGG TAACGAGAAT
GTCGTCGAGA CCTTCTCCAA GACGGCAGAG CCGTTGATGG ATCAGTTTTT ACAAAATACT
CGGCAATCCC GCATCCTCGC CCAACTGCGC GACACCCTGC TACCCAAACT TATTTCTGGC
GAGCTGCGCG TGAAGGATGC CGAGGCGTTC CTGAAGGAGC GGGGGCTGTG A
 
Protein sequence
MAGEWKECSL GDVIELKRGY DLPQKDRLPG DVPLVSSSGV TDTHAKAMVK GPGVVTGRYG 
TLGQVFYVEQ DFWPLNTTLY VRDFKGNDPR FISYFLRDVD FHAYSDKAAV PGLNRNHLHQ
AKVRIPSDPN EQRAIAHILG TLDDKIELNR RQNETLEAMA RALFKAWFVD FEPVRAKCRG
DRPVAPTGWQ WPQHILDLFP DRLVESELGE IPEGWRVFSF GDVAEQGKGF VNPSREPGER
FTHYSLPAFD AGKMPVIEPG ESIKSNKTPV PDGAVLVSKL NPHIPRIWLV GEAGNRAVCS
TEFIVWTPKS PAQSAFVYCL ASSPEFVGAM CQLVTGTSNS HQRVKPDQLR EIRVFAGNEN
VVETFSKTAE PLMDQFLQNT RQSRILAQLR DTLLPKLISG ELRVKDAEAF LKERGL