Gene MCA0274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA0274 
Symbol 
ID3102312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp258432 
End bp261440 
Gene Length3009 bp 
Protein Length1002 aa 
Translation table11 
GC content61% 
IMG OID637169495 
Producttype I restriction-modification system, R subunit, putative 
Protein accessionYP_112808 
Protein GI53802459 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.273589 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACGA CTGACACCAG CGAAAAGGGC CTCGAAGCCC TGATCGTGCG CGACCTGGTC 
GCCAGCGGCT ACGTACAGGG CCATGCCGCG GACTACAACC GCGATGTGGC GCTGGACGTG
ACCCAGTTGC TCGCCTTTCT TCGGGCGACG CAGCCGAAAG TTGTCGAAAC GCTGAACCTG
GGCGCCGAAG GCATTCAGCG CACCCAGTTT TTGCACCGCC TGCAGGGCGA GATCACCAAG
CGCGGCGTGG TGGACTGTTT GCGCCGCGGT GTTAGCCACG GCCCGGTACA CGTGGACCTT
TACAAGCTGT TGCCCACCCC GGGCAATGCT GCCGCTGCCG AGGCCTTCGG CAAGAACATT
TTCAGCGTCA CGCGGCAGGT GCGCTACAGC AACGACGAGT CCCAGCGCGC GCTGGATATG
GTGATTTTCA TCAACGGCCT GCCGGTGCTC ACCTTCGAGC TGAAGAACTC GCTCACCAAG
CAGACCGTCG CGGACGCCAT CGTGCAGTAC CAGACCGATC GCAACCCGGA TGAGCTGCTG
TTCCAACTCG GCCGCTGCGT TGCCCACATG GCGGTGGACG ACGCCGAGGT GCGCTTTTGC
ACTCACCTGA CCGGCAAGAC CTCCTGGTTC CTGCCTTTCA ACCAAGGTTG GAACAGCGGC
GCAGGCAACC CACCGAATCC TCATGGTCTG AAGACCGACT ATCTATGGAA GCAGGTGCTG
GTGAAGGAGT CGCTGGCTAA CATCATCGAG AACTACGCGC AGCTGGTAGA GGAGGAAGCG
GAAGACGCCA ATGGCCGCAA GCGCAAGACG CGCAAACAGA TCTTCCCCCG GTACCACCAA
CTTCGCACTG TTCGCGCCCT GTTGCGGCGC AGCCAAGCGG ATGGTGTCGG CAAGCGTTAC
CTGATCCAGC ATTCAGCCGG CAGCGGCAAG AGCAACACGA TTGCCTGGCT GGCCCATCAG
CTGGTGGAAC TGAAGACGGC GGCGGATGCG GGGCAGGCCC AGTTCGACTC CGTCATCGTC
ATCACCGACC GCCGTGCGCT GGACACGCAG ATTGCCCGCA CGATCCGGTC CTACGACCAT
GTGGCCTCGA TCTACGGTCA TTCGGAAAGC GCCGAGGAAC TGCGCACTTT CCTGCGCCGA
GGCAAGAAGA TCATCGTCAC CACGGTGCAG AAGTTCCCGT TCATCCTGGA CGAGCTGGGG
GATCTCGGCG ACAAGAAGTT CGCGCTGTTG ATCGACGAGG CCCACTCCAG CCAGGGCGGC
AAGACGACGG CCAAGATGCA TCTGGCCTTG TCCGGACAAG CCGCGGAAGG CGGCGAGGAC
GAAGAGGAAG AATCGGTCGA GGACAAGGTC AACGCCTTGA TCGAATCCCG CAAGATGCTG
GCCAACGCCA GCTACTACGC CTTCACGGCC ACGCCCAAGA CAAAGACCTT GGAGCTCTTC
GGCGAGCGTC AGGTTGTCGG CGACACGGTG CAGTTCCGCT CGCCCGAGGA GCTGACGTAT
ACCACCAAGC AGGCGATTCA GGAAGGCTTC ATCCTCGACG TGATCGCCAA CTACACCCCG
GTGTCGAGCT TCTATCACAT TGCCAAGACC GTCGAGCACG ATCCGGAGGT GGACAAAGCC
AAGGCGCTGA AGAAGATTCG GCGCTACGTG GAATCCCACG ACAAGGCGAT CCGTCGCAAG
GCGGAGATCA TGGTCGATCA CTTTATTGAA CAGGTGATCG GCGCCAAGAA GATCGGCGGC
AAGGCGCGCG CGATGATCGT CTGCAACGGA ATCGCACGGG CCATCGACTA CTTCCGTGAG
GTTTCAGACT ACCTTCGCGA GATCAAGAGC CCATACAAGG CCATCGTGGC GTACTCCGGC
GATTTCGAGG TCGGCGGGGT GAAGAAGACC GAGGCCGATC TCAACGGATT CCCGAGCAAG
GACATTCCGG CCAAGCTCAG GCAAGACCCG TATCGCTTCC TGATCGTCGC GAACAAGTTC
GTCACCGGCT TCGACGAGCC GCTGCTGCAC ACCATGTATG TGGACAAACC CCTGGCGGGC
GTCCTGGCCG TGCAGACGCT CTCGCGACTG AACCGCGCGC ATCCGCAGAA GGCCGACACG
TTCGTGCTCG ACTTCGCCGA CAACGCGGAG GCCGTCAAGG CGGCCTTCCA GGAGTACTAC
CGCGCCACGA TCCAGGAGGG GGAGACCGAT CCGAACAAGC TGCACGACCT GAAGAGCGAT
CTGGACGCCC AGCAGGTCTA CAGCTGGCAA CAGGTCGAAG ACCTCGTCGC GCAGTACCTT
GGCGGAGCGG AGCGGGATCA GCTCGATCCA ATCCTCGATG CCTGCGTCGC GGAGTACGTC
GAGAAGCTCT CTGAAGACGA CCAAGTGAAG TTCAAGGGCA AGGCAAAGGC CTTCGTCCGC
AGCTACGGCT TCCTGGCGGC CATTCTGCCC TACGGGCATC CGGCATGGGA GAAGCTGTCG
ATCTTCCTCA ACTTCCTGAT TCCGAAGCTG CCTGCCCCCA AGGAGGAGGA TCTGTCCAAG
GGTGTGCTGG AGGCCATCGA CATGGACAGC TACCGCGCTC AGGCCCAGGC GTCCATGCGC
ATGGCGATGG ATGACGCAGA TGCCTTTGTC GAACCTCCAC CCCCCGGAGG TAGCGGAGGC
AGCGGCGAAC CAGAGCTGGA CAGGCTGTCG AACATCATCA AGCAGTTCAA CGACCTGTTC
GGCAACATCG AGTGGCATGA CGCCGACAAG ATTCGCAAGG TCGTCACCGA AGAGATTCCG
GCGCGCGTCG CGCAGGACAA GGCCTACCAG AACGCACAGG CGAACTCGGG CAAGCAAAAC
GCCAGGCTGG AGCATGACAA AGCGCTCAAC CGCGTGGTGC TGGAGCTGCT CGACGACCAC
ACCGAACTCT TCAAGCAGTT CAGCGACAAC CCGAACTTCA AGCGCTGGCT GGCGGACATG
GTGTTCGACT CGACCTACCG CCCAGGACAG AAACCGTCAG TGCCGCCTCA ATCGGGCGCC
CAAGCCTGA
 
Protein sequence
MSTTDTSEKG LEALIVRDLV ASGYVQGHAA DYNRDVALDV TQLLAFLRAT QPKVVETLNL 
GAEGIQRTQF LHRLQGEITK RGVVDCLRRG VSHGPVHVDL YKLLPTPGNA AAAEAFGKNI
FSVTRQVRYS NDESQRALDM VIFINGLPVL TFELKNSLTK QTVADAIVQY QTDRNPDELL
FQLGRCVAHM AVDDAEVRFC THLTGKTSWF LPFNQGWNSG AGNPPNPHGL KTDYLWKQVL
VKESLANIIE NYAQLVEEEA EDANGRKRKT RKQIFPRYHQ LRTVRALLRR SQADGVGKRY
LIQHSAGSGK SNTIAWLAHQ LVELKTAADA GQAQFDSVIV ITDRRALDTQ IARTIRSYDH
VASIYGHSES AEELRTFLRR GKKIIVTTVQ KFPFILDELG DLGDKKFALL IDEAHSSQGG
KTTAKMHLAL SGQAAEGGED EEEESVEDKV NALIESRKML ANASYYAFTA TPKTKTLELF
GERQVVGDTV QFRSPEELTY TTKQAIQEGF ILDVIANYTP VSSFYHIAKT VEHDPEVDKA
KALKKIRRYV ESHDKAIRRK AEIMVDHFIE QVIGAKKIGG KARAMIVCNG IARAIDYFRE
VSDYLREIKS PYKAIVAYSG DFEVGGVKKT EADLNGFPSK DIPAKLRQDP YRFLIVANKF
VTGFDEPLLH TMYVDKPLAG VLAVQTLSRL NRAHPQKADT FVLDFADNAE AVKAAFQEYY
RATIQEGETD PNKLHDLKSD LDAQQVYSWQ QVEDLVAQYL GGAERDQLDP ILDACVAEYV
EKLSEDDQVK FKGKAKAFVR SYGFLAAILP YGHPAWEKLS IFLNFLIPKL PAPKEEDLSK
GVLEAIDMDS YRAQAQASMR MAMDDADAFV EPPPPGGSGG SGEPELDRLS NIIKQFNDLF
GNIEWHDADK IRKVVTEEIP ARVAQDKAYQ NAQANSGKQN ARLEHDKALN RVVLELLDDH
TELFKQFSDN PNFKRWLADM VFDSTYRPGQ KPSVPPQSGA QA