Gene MCA0838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA0838 
Symbol 
ID3102849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp881968 
End bp885600 
Gene Length3633 bp 
Protein Length1210 aa 
Translation table11 
GC content66% 
IMG OID637170041 
Producttype I restriction-modification system, R subunit 
Protein accessionYP_113334 
Protein GI53805023 
COG category[L] Replication, recombination and repair
[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases
[COG2827] Predicted endonuclease containing a URI domain 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTTTC TGTCGGAGGC CGAGGTCGAA AGCGCCGTGC TCGATCAGTT TCGCGCGCTC 
GGGTACTGCA TCGAACGCGA GGAGGACATC GGCCCCGACG GCCATCGCCC CGAACGCGAG
AGTCACGATG AGGTCGTGCT CAAGAAGCGT CTTGAGGACG CCGTGGCGCG CATCAACCCC
GCCCTGCCAT TGGAGGCGCG CCAGGATGCC ATCCGCAGGG TGACGCAGTC GGAACTGCCC
GTCCTGCTCG AAGAAAACCG CCGGCTGCAC CGGTTTTTGA CCGAAGGCGT GGATGTCGAG
TACTACGCCA TTGATGGCAC CCTGACGGCG GGCAAGGTGC GGCTGATCGA CTTCGAGCAG
CCGGGAAATA ATGACTGGCT GGCGGTGCGC CAGTTCGTGG TGATCAGCGG ACAGAACAGC
CGGCGGCCCG ATGTGGTGGT GTTCGTCAAC GGCCTGCCGC TGGCGGTGAT CGAGCTGAAG
GCGCCGGGGT CGGAGCAGGC GACTCTCAAA GGCGCCTTCA ACCAGCTGCA GACCTACAAG
GCGCAGATCG CGCCGCTGTT TCGCAGCAAC GCGCTGCTGA TCGCCTCCGA CGGCTTGCAG
GCGCGGGTGG GGTCGCTGTC GGCCGACCTG GAACGCTTCA TGCCCTGGCG CCTGCCTGCC
GCGCTGTGCT CGGCGCAGGC AGGCACCACC GACGGTACGC AGGTTGCGCC GAAAGGCGCG
CCCGAGCTCT CAACGCTGAT CGAGGGCGTG TTCGAGCATC GGCGGCTGCT CGATCTGCTC
TCGCACTTCA CGGTGTTTGG CGAGACCGGT TCGGGGCTCG TCAAGATCAT CGCCGGCTAC
CACCAGTTCC ACGCGGTGAA AAAGGCGGTG GAGCAGACGG TCCGCGCCAT GCCGCCGGCC
AATGTCGCAA AGCAGGACCC CGCCGACTAC GGCCTGCCCT CGGCGCGGGA GCAGAAGCCC
GGCGACCGGC GCGTGGGCGT GATCTGGCAC ACGCAGGGCT CCGGCAAGAG CCTGCTGATG
GCCTTCTACG CCGGCATGCT GGTCAAACAC CCTTTGCTCG AAAACCCGAC GCTGGTGGTC
ATCACCGACC GCAACGATCT GGACGACCAA CTCTTTGCGA CCTTCTCGAT GTGCCGCGAC
CTGATTCGGC AGACGCCGGT GCAGGCGGAG AGCCGCGAGC ATCTGCAGGC GCTGCTCAAC
CGGGCATCGG GCGGTGTGAT CTTTACCACG CTGCAGAAGT TCGGTCCCCT CTCCCCCGGC
CCCTCTCCCA ACGGGAGAGG GGAGGATACA GGCCCCGCTC CCAATGTAAG GGAGGAACAT
GCAAGCCCCT CTCCCTCTGG GAGAGGGGTT GGGGTGAGGG ACACGCCCCC TCCACTTACC
CTGCGGCGCA ACGTGGTGGT CATCGCCGAC GAGGCGCACC GCAGCCAGTA TGGCTTCAAG
GCCAAGGTGG ACGCCAGGAC CGGCGAGATT TCCTACGGCT TTGCCAAGTA CCTGCGCGAT
GCGCTGCCGA ATGCCTCGTT CATCGGCTTC ACCGGCACGC CCATCGAGGC CGATGACGTC
AATACTCCGG CGGTGTTCGG CCACTACATC GACATCTACG ACATCAGCCG CGCGGTGGAA
GACGGCGCGA CGGTGCCGAT CTACTACGAA TCCAGGCTCG CGCGCATCGA ACTCGACGAG
GACGAAAAAC CCAAGATCGA CGCCGAGATC GAGGAGATTC TGGAGGACGA GGAGGAGCCC
GCCCGCGAGC GCGCCAAGCA GAAGTGGGCG ACAGTGGAGG CGCTGGTCGG CAGCGACAGG
CGCCTGGCAC AGGTGGCGCA GGACATCGTG CAGCACTTCG AGGCCCGCGT GCAGGCGCTC
TCGGGCAAGG CGATGATCGT CTGCATGAGC CGGCGCATCT GCGTGGCGCT CTACGACGAA
ATTGTCCGGT TGCGCCCCGA CTGGCACAGT GCGGATGACA AGGCTGGCGC GATCAAGATC
GTGATGACGG GGGCGGCAAG CGATCCGCCC GAATGGCAGC CGCACATCGG CAACAAGGCG
CGGCGGGACC TGCTCGCCAA ACGCGCCCGC GACCCCAACG ACCCGTTGAA GCTGGTGATC
GTGCGCGACA TGTGGCTCAC GGGTTTCGAT GCGCCGTGCA TGCACACCAT GTATGTGGAC
AAGCCCATGC GCGGCCACGG GCTGATGCAG GCCATCGCGC GGGTGAACCG CGTGTTCCGC
GACAAGCCGG CGGGGCTGAT CGTCGATTAC ATCGGCATCG CGCAGAACCT CAAGTCGGCG
CTCGCCCAAT ACTCGCCGCG CGACCGTGAG AACACCGGCA TCGACGAGGC CGAGGCCGTC
GCGGTGATGC TGGAAAAGCT CGAGGTCGTG CGGGACATGT TCCACCTGCC TGCGCCGCCG
CCCGGCCAGT TTTGTGCCTA TGTCGTCGAG TGTGAAGACG GCAGCCTTTA CGTCGGTCAC
ACCGAGGATT TGATGCGGCG TTGGCAGGAA CATCGACGCG GAATCGCCGC GGATCATACG
AAGCGGTATG GGGCCAAGCG TATCGCGCAT TTCGAGACAG CCGCCTCACG CGAGGCTGCT
TTGGCGCTGG AACGGGAGTG GAAGACCGGA TTCGGTCGCA AAAGAATCCG CCGGCTGATC
CAGAATGGCG GGGCGCGGCA GGCAGGCGGC TTTGATTACC GCTCCGCACT CAACGGTTCG
CCCCAGGAGC GGCTCTCGAT GATGGCCGGC GCCATCGAGT GGATACTCGA CAAACAGCAG
CAATGGGCGG GAGAGGAGTC CACTCCGGAG GGCAAGAAGG CCGCGCACCG GCGCTTTCTT
GACGCGGTGC TGGCCTTGTC CAAGGCGTTC GCGCTGGCAT CGGCCTCCGA CGAGGCACGC
GCCGTCCGTG AGGAAGTCGG CTTCTTCCAG GCCATCCGGG CCGCGCTCAT CAAGAGCGGC
ACCGGCTCCG GCGTGACCCG GCAGGAGCGC GGCTTCGCCA TCCAGCAGAT CGTCAGCCGC
GCGGTGGTCT CCACCGAGAT CGTGGACATT CTGGCAGCAA GCGGGCTCAA GAGTCCGGAC
ATCTCCATCC TCTCCGACGA GTTCCTCGCC GAAGTCGAGC AGATGGAAAA GAAGAACCTG
GCGCTGGAAG CCTTGAGGAA GCTCCTCAAC GACGGCATAC GCTCGCGCAG CAAAGCCAAC
ATCGTCGAGA CGCGGGCGTT TTCCGAACGG CTGGAGGAGG CGGTCGCGCG CTACCACGCC
AACGCCATCA CCACCGCCGA GGTGCTGCAG GAGCTGATCC AGCTGGCCCG CGACATCCGC
GCGGCCCGAA GCCGGGGCGA GGAAGCGGGA CTCACCGAAG AGGAGATTGC CTTCTACGAT
GCGCTCGCCG AGAACGAGAG CGCGGTCGAG GTGATGGGCG ATGCCAAGCT GCGCGTGATC
GCCCACGAGC TGCTCACGAG CCTGCGCGAG AACGTGACCG TGGACTGGGC GCACCGTGAA
TCGGCACGCG CCCGGATGCG GGTGCTGGTC AAGCGCATCC TGCGCAAGTA CGGTTATCCG
CCCGACCTCC AGGACACTGC CGTACAGACC GTGTTGGCCC AGGCGGAGGC ATTGTCGTCG
GGGTGGCCGG TCTCGCGTGA AGGAAGGATT TGA
 
Protein sequence
MAFLSEAEVE SAVLDQFRAL GYCIEREEDI GPDGHRPERE SHDEVVLKKR LEDAVARINP 
ALPLEARQDA IRRVTQSELP VLLEENRRLH RFLTEGVDVE YYAIDGTLTA GKVRLIDFEQ
PGNNDWLAVR QFVVISGQNS RRPDVVVFVN GLPLAVIELK APGSEQATLK GAFNQLQTYK
AQIAPLFRSN ALLIASDGLQ ARVGSLSADL ERFMPWRLPA ALCSAQAGTT DGTQVAPKGA
PELSTLIEGV FEHRRLLDLL SHFTVFGETG SGLVKIIAGY HQFHAVKKAV EQTVRAMPPA
NVAKQDPADY GLPSAREQKP GDRRVGVIWH TQGSGKSLLM AFYAGMLVKH PLLENPTLVV
ITDRNDLDDQ LFATFSMCRD LIRQTPVQAE SREHLQALLN RASGGVIFTT LQKFGPLSPG
PSPNGRGEDT GPAPNVREEH ASPSPSGRGV GVRDTPPPLT LRRNVVVIAD EAHRSQYGFK
AKVDARTGEI SYGFAKYLRD ALPNASFIGF TGTPIEADDV NTPAVFGHYI DIYDISRAVE
DGATVPIYYE SRLARIELDE DEKPKIDAEI EEILEDEEEP ARERAKQKWA TVEALVGSDR
RLAQVAQDIV QHFEARVQAL SGKAMIVCMS RRICVALYDE IVRLRPDWHS ADDKAGAIKI
VMTGAASDPP EWQPHIGNKA RRDLLAKRAR DPNDPLKLVI VRDMWLTGFD APCMHTMYVD
KPMRGHGLMQ AIARVNRVFR DKPAGLIVDY IGIAQNLKSA LAQYSPRDRE NTGIDEAEAV
AVMLEKLEVV RDMFHLPAPP PGQFCAYVVE CEDGSLYVGH TEDLMRRWQE HRRGIAADHT
KRYGAKRIAH FETAASREAA LALEREWKTG FGRKRIRRLI QNGGARQAGG FDYRSALNGS
PQERLSMMAG AIEWILDKQQ QWAGEESTPE GKKAAHRRFL DAVLALSKAF ALASASDEAR
AVREEVGFFQ AIRAALIKSG TGSGVTRQER GFAIQQIVSR AVVSTEIVDI LAASGLKSPD
ISILSDEFLA EVEQMEKKNL ALEALRKLLN DGIRSRSKAN IVETRAFSER LEEAVARYHA
NAITTAEVLQ ELIQLARDIR AARSRGEEAG LTEEEIAFYD ALAENESAVE VMGDAKLRVI
AHELLTSLRE NVTVDWAHRE SARARMRVLV KRILRKYGYP PDLQDTAVQT VLAQAEALSS
GWPVSREGRI