Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA0838 |
Symbol | |
ID | 3102849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 881968 |
End bp | 885600 |
Gene Length | 3633 bp |
Protein Length | 1210 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637170041 |
Product | type I restriction-modification system, R subunit |
Protein accession | YP_113334 |
Protein GI | 53805023 |
COG category | [L] Replication, recombination and repair [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases [COG2827] Predicted endonuclease containing a URI domain |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGTTTC TGTCGGAGGC CGAGGTCGAA AGCGCCGTGC TCGATCAGTT TCGCGCGCTC GGGTACTGCA TCGAACGCGA GGAGGACATC GGCCCCGACG GCCATCGCCC CGAACGCGAG AGTCACGATG AGGTCGTGCT CAAGAAGCGT CTTGAGGACG CCGTGGCGCG CATCAACCCC GCCCTGCCAT TGGAGGCGCG CCAGGATGCC ATCCGCAGGG TGACGCAGTC GGAACTGCCC GTCCTGCTCG AAGAAAACCG CCGGCTGCAC CGGTTTTTGA CCGAAGGCGT GGATGTCGAG TACTACGCCA TTGATGGCAC CCTGACGGCG GGCAAGGTGC GGCTGATCGA CTTCGAGCAG CCGGGAAATA ATGACTGGCT GGCGGTGCGC CAGTTCGTGG TGATCAGCGG ACAGAACAGC CGGCGGCCCG ATGTGGTGGT GTTCGTCAAC GGCCTGCCGC TGGCGGTGAT CGAGCTGAAG GCGCCGGGGT CGGAGCAGGC GACTCTCAAA GGCGCCTTCA ACCAGCTGCA GACCTACAAG GCGCAGATCG CGCCGCTGTT TCGCAGCAAC GCGCTGCTGA TCGCCTCCGA CGGCTTGCAG GCGCGGGTGG GGTCGCTGTC GGCCGACCTG GAACGCTTCA TGCCCTGGCG CCTGCCTGCC GCGCTGTGCT CGGCGCAGGC AGGCACCACC GACGGTACGC AGGTTGCGCC GAAAGGCGCG CCCGAGCTCT CAACGCTGAT CGAGGGCGTG TTCGAGCATC GGCGGCTGCT CGATCTGCTC TCGCACTTCA CGGTGTTTGG CGAGACCGGT TCGGGGCTCG TCAAGATCAT CGCCGGCTAC CACCAGTTCC ACGCGGTGAA AAAGGCGGTG GAGCAGACGG TCCGCGCCAT GCCGCCGGCC AATGTCGCAA AGCAGGACCC CGCCGACTAC GGCCTGCCCT CGGCGCGGGA GCAGAAGCCC GGCGACCGGC GCGTGGGCGT GATCTGGCAC ACGCAGGGCT CCGGCAAGAG CCTGCTGATG GCCTTCTACG CCGGCATGCT GGTCAAACAC CCTTTGCTCG AAAACCCGAC GCTGGTGGTC ATCACCGACC GCAACGATCT GGACGACCAA CTCTTTGCGA CCTTCTCGAT GTGCCGCGAC CTGATTCGGC AGACGCCGGT GCAGGCGGAG AGCCGCGAGC ATCTGCAGGC GCTGCTCAAC CGGGCATCGG GCGGTGTGAT CTTTACCACG CTGCAGAAGT TCGGTCCCCT CTCCCCCGGC CCCTCTCCCA ACGGGAGAGG GGAGGATACA GGCCCCGCTC CCAATGTAAG GGAGGAACAT GCAAGCCCCT CTCCCTCTGG GAGAGGGGTT GGGGTGAGGG ACACGCCCCC TCCACTTACC CTGCGGCGCA ACGTGGTGGT CATCGCCGAC GAGGCGCACC GCAGCCAGTA TGGCTTCAAG GCCAAGGTGG ACGCCAGGAC CGGCGAGATT TCCTACGGCT TTGCCAAGTA CCTGCGCGAT GCGCTGCCGA ATGCCTCGTT CATCGGCTTC ACCGGCACGC CCATCGAGGC CGATGACGTC AATACTCCGG CGGTGTTCGG CCACTACATC GACATCTACG ACATCAGCCG CGCGGTGGAA GACGGCGCGA CGGTGCCGAT CTACTACGAA TCCAGGCTCG CGCGCATCGA ACTCGACGAG GACGAAAAAC CCAAGATCGA CGCCGAGATC GAGGAGATTC TGGAGGACGA GGAGGAGCCC GCCCGCGAGC GCGCCAAGCA GAAGTGGGCG ACAGTGGAGG CGCTGGTCGG CAGCGACAGG CGCCTGGCAC AGGTGGCGCA GGACATCGTG CAGCACTTCG AGGCCCGCGT GCAGGCGCTC TCGGGCAAGG CGATGATCGT CTGCATGAGC CGGCGCATCT GCGTGGCGCT CTACGACGAA ATTGTCCGGT TGCGCCCCGA CTGGCACAGT GCGGATGACA AGGCTGGCGC GATCAAGATC GTGATGACGG GGGCGGCAAG CGATCCGCCC GAATGGCAGC CGCACATCGG CAACAAGGCG CGGCGGGACC TGCTCGCCAA ACGCGCCCGC GACCCCAACG ACCCGTTGAA GCTGGTGATC GTGCGCGACA TGTGGCTCAC GGGTTTCGAT GCGCCGTGCA TGCACACCAT GTATGTGGAC AAGCCCATGC GCGGCCACGG GCTGATGCAG GCCATCGCGC GGGTGAACCG CGTGTTCCGC GACAAGCCGG CGGGGCTGAT CGTCGATTAC ATCGGCATCG CGCAGAACCT CAAGTCGGCG CTCGCCCAAT ACTCGCCGCG CGACCGTGAG AACACCGGCA TCGACGAGGC CGAGGCCGTC GCGGTGATGC TGGAAAAGCT CGAGGTCGTG CGGGACATGT TCCACCTGCC TGCGCCGCCG CCCGGCCAGT TTTGTGCCTA TGTCGTCGAG TGTGAAGACG GCAGCCTTTA CGTCGGTCAC ACCGAGGATT TGATGCGGCG TTGGCAGGAA CATCGACGCG GAATCGCCGC GGATCATACG AAGCGGTATG GGGCCAAGCG TATCGCGCAT TTCGAGACAG CCGCCTCACG CGAGGCTGCT TTGGCGCTGG AACGGGAGTG GAAGACCGGA TTCGGTCGCA AAAGAATCCG CCGGCTGATC CAGAATGGCG GGGCGCGGCA GGCAGGCGGC TTTGATTACC GCTCCGCACT CAACGGTTCG CCCCAGGAGC GGCTCTCGAT GATGGCCGGC GCCATCGAGT GGATACTCGA CAAACAGCAG CAATGGGCGG GAGAGGAGTC CACTCCGGAG GGCAAGAAGG CCGCGCACCG GCGCTTTCTT GACGCGGTGC TGGCCTTGTC CAAGGCGTTC GCGCTGGCAT CGGCCTCCGA CGAGGCACGC GCCGTCCGTG AGGAAGTCGG CTTCTTCCAG GCCATCCGGG CCGCGCTCAT CAAGAGCGGC ACCGGCTCCG GCGTGACCCG GCAGGAGCGC GGCTTCGCCA TCCAGCAGAT CGTCAGCCGC GCGGTGGTCT CCACCGAGAT CGTGGACATT CTGGCAGCAA GCGGGCTCAA GAGTCCGGAC ATCTCCATCC TCTCCGACGA GTTCCTCGCC GAAGTCGAGC AGATGGAAAA GAAGAACCTG GCGCTGGAAG CCTTGAGGAA GCTCCTCAAC GACGGCATAC GCTCGCGCAG CAAAGCCAAC ATCGTCGAGA CGCGGGCGTT TTCCGAACGG CTGGAGGAGG CGGTCGCGCG CTACCACGCC AACGCCATCA CCACCGCCGA GGTGCTGCAG GAGCTGATCC AGCTGGCCCG CGACATCCGC GCGGCCCGAA GCCGGGGCGA GGAAGCGGGA CTCACCGAAG AGGAGATTGC CTTCTACGAT GCGCTCGCCG AGAACGAGAG CGCGGTCGAG GTGATGGGCG ATGCCAAGCT GCGCGTGATC GCCCACGAGC TGCTCACGAG CCTGCGCGAG AACGTGACCG TGGACTGGGC GCACCGTGAA TCGGCACGCG CCCGGATGCG GGTGCTGGTC AAGCGCATCC TGCGCAAGTA CGGTTATCCG CCCGACCTCC AGGACACTGC CGTACAGACC GTGTTGGCCC AGGCGGAGGC ATTGTCGTCG GGGTGGCCGG TCTCGCGTGA AGGAAGGATT TGA
|
Protein sequence | MAFLSEAEVE SAVLDQFRAL GYCIEREEDI GPDGHRPERE SHDEVVLKKR LEDAVARINP ALPLEARQDA IRRVTQSELP VLLEENRRLH RFLTEGVDVE YYAIDGTLTA GKVRLIDFEQ PGNNDWLAVR QFVVISGQNS RRPDVVVFVN GLPLAVIELK APGSEQATLK GAFNQLQTYK AQIAPLFRSN ALLIASDGLQ ARVGSLSADL ERFMPWRLPA ALCSAQAGTT DGTQVAPKGA PELSTLIEGV FEHRRLLDLL SHFTVFGETG SGLVKIIAGY HQFHAVKKAV EQTVRAMPPA NVAKQDPADY GLPSAREQKP GDRRVGVIWH TQGSGKSLLM AFYAGMLVKH PLLENPTLVV ITDRNDLDDQ LFATFSMCRD LIRQTPVQAE SREHLQALLN RASGGVIFTT LQKFGPLSPG PSPNGRGEDT GPAPNVREEH ASPSPSGRGV GVRDTPPPLT LRRNVVVIAD EAHRSQYGFK AKVDARTGEI SYGFAKYLRD ALPNASFIGF TGTPIEADDV NTPAVFGHYI DIYDISRAVE DGATVPIYYE SRLARIELDE DEKPKIDAEI EEILEDEEEP ARERAKQKWA TVEALVGSDR RLAQVAQDIV QHFEARVQAL SGKAMIVCMS RRICVALYDE IVRLRPDWHS ADDKAGAIKI VMTGAASDPP EWQPHIGNKA RRDLLAKRAR DPNDPLKLVI VRDMWLTGFD APCMHTMYVD KPMRGHGLMQ AIARVNRVFR DKPAGLIVDY IGIAQNLKSA LAQYSPRDRE NTGIDEAEAV AVMLEKLEVV RDMFHLPAPP PGQFCAYVVE CEDGSLYVGH TEDLMRRWQE HRRGIAADHT KRYGAKRIAH FETAASREAA LALEREWKTG FGRKRIRRLI QNGGARQAGG FDYRSALNGS PQERLSMMAG AIEWILDKQQ QWAGEESTPE GKKAAHRRFL DAVLALSKAF ALASASDEAR AVREEVGFFQ AIRAALIKSG TGSGVTRQER GFAIQQIVSR AVVSTEIVDI LAASGLKSPD ISILSDEFLA EVEQMEKKNL ALEALRKLLN DGIRSRSKAN IVETRAFSER LEEAVARYHA NAITTAEVLQ ELIQLARDIR AARSRGEEAG LTEEEIAFYD ALAENESAVE VMGDAKLRVI AHELLTSLRE NVTVDWAHRE SARARMRVLV KRILRKYGYP PDLQDTAVQT VLAQAEALSS GWPVSREGRI
|
| |