Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA1893 |
Symbol | |
ID | 3104219 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 2037119 |
End bp | 2040328 |
Gene Length | 3210 bp |
Protein Length | 1069 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637171050 |
Product | type I restriction-modification system, R subunit |
Protein accession | YP_114328 |
Protein GI | 53803780 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGACA TGGACAGCTT TCGCTTCAAC GAGAAATACC TCTCGCAGAT TCCCGCGCTG CAACTCTTCA TCAACCTGGG CTATGCCTAC CTCACGCCCG AGCAGGCGCT TGCCGCCCGC AGCGGCAAGC CTGGGCAGGT GCTGCTGGAA GAAGTGCTGC GCGAGCGGCT GAAGAAAATC AACCGTATCC AGTACAAGGG GCAGAGCTAC CTGTTTTCCG AGGAGAACAT CCAGACCGCC ATCCAGCGCC TGAAGAACGT CAAGTACGAC GGTTTGCTGA AAACCAACGA GGCCATCTAT GACCTGCTGA CGCTGGGCGT GGCGCTGGAG CAATCCATCG AGGGCGACAG CAAGAGCTTT ACGCTCAACT ACATCGACTG GAAGAACCCA GCCAACAACG TCTACCACGT CACGGCCGAG TTCCCGGTCG AACGCACGCG AAGCAGCGAG ACCTGCCGGC CGGACATCGT GCTGTTTGTC AACGGCATTC CGTTTGCGGT GATCGAGTGC AAGTCGCCCA AGGTGGAGGT CGAGCAGGCC GTGTCGCAGA TGATCCGCAA CCAGCGCGAA GAGTACATCC CCAAGCTGTT CACCTATGCC CAGTTGCTGC TGGCGACCAA CAAGAACGCG GTGCGCTATG CCAGCACCGG AACACCAGCC AAATTTTGGG CGCTGTGGCG TGAAGACATT GCGCCCGAGG TGCTGCGCCG CGTGCTCGAT CGGCCCTTGC CTCTTGAAGC CAAGGCAGGG CTGTTTGACC TGATTTGGGA TGGCCAATCA CTGCACGAAC CCGAATTCGG CAAGTCGTGG GCGATCACCG AACAAGATCG CATGCTGTAT GCCTTGTGCC GCCCAGAGCG GCTGCTCGAT CTGGCGTTTC GTTTCACTGT TTTTGATGCC GGCATCCGCA AGGTTGCCCG CTACCAGCAG TTCTTTGCCA TCCAGCGCGT GCTCGAACGG GTCAAGCACC GGGATGATGT GGGGCGGCGC TTGGGCGGCA TCATCTGGCA CACCCAAGGG TCGGGCAAGT CACTGACCAT GGTGATGCTG GCGCGCGCCC TGGCGCTCGA CCCCGACATC CGCAACCCGC GCATCGTGCT GGTGACGGAT CGGGTGGATT TGGACAAGCA GCTTGGCAAC ACCTTTGCCG CCTGCGGCTT GACGCCGGAT CGGGCGGATA GCGGGCGCCA TCTGCTCGAG CTGGTTTCCG AGAACAAGGC GCATATCGTC ACCACGCTCA TCCACAAGTT TGACAAGGCG CTGTCGATCA AGAAGCACGT CGAGGAATCG GCCGACATCT TCATCCTGGT CGATGAGAGC CACCGCACCA ACTTCGGCGG CTTTGCCGCG CGCATGCGGC AGATGTTCCC GATGGCCTGT TACCTGGGCT TCACCGGCAC GCCGTTGATG AAGAAGGAGA AGAACAACTT CGCCAAGTTC GGCGGCTTGA TCGACACCTA TGCGATCAAT CAGGCCGTCA AGGATGGCGC GGTGGTTCCC CTGCTGTATG AGGCGCGGCA TGTGGAGATG GAGCAGAACG AAAAAGCCAT CGACACCTGG TTCGAGCGCC ACACGCAAGG GCTGACGGAT GCGCAAAAGG CGGACCTGAA GAAGAAATAC AGCCGGGCCG AGATGCTCAA CAAGGCCGAT CAGGTGATCT ACATGCGCGC CTTCGACATC AGCGAGCACT ACCGGCAAAA CTGGCAGGGC ACGGGCTTCA AGGCGCAGCT CGTCGCACCG AGCAAGGCGG CCGCCCTCAC CTACAAGAAG TTTCTTGACG AGATCGGGCA CGTGACGAGC GAAGTCATCA TTTCGCCGCC GGATACCCGC GAAGGCAACG AGGAGACGGA CGACAAGCCA ACCGATGAGG TCGTGGCCTT CTGGGAACGG ATGATGAAGC GCTATGGCTC CGAGGAGGAG TACAACAAGC AGCTCATCAA CCAGTTCAAG TACGGCGACG ATCCGGAAAT CCTGATCGTC GTCGACAAGC TCCTGACCGG CTTTGATGCG CCACGTAACA CGGTGCTGTA CCTGACGCGC AAGCTGCGCG AGCACACCCT ATTGCAGGCC ATCGCCCGGG TGAACCGTCT GTATGACGAT GACGAAGGCA AGCAGCCCAA GGAGTTCGGC TACATCATCG ACTACGCCGG CATTCTGGGC GAGCTCGATC AGGCGCTGAC CGCTTATAGC GCCCTCGACG GCTTCGACGA GGCCGACCTG GCCGGAACCC TGGTCAGCAT CCAGGAGGAA GTGAGGAAGC TGCCGCAGCG CCATGCCGAT CTTTGGGACC TCTTCAAAGA GGTCAAGAAC CAGCACGACG AAGAGGCCTA CGAGCAGCTG CTGGCGGACG AGAAGCTGCG GGAGACGTTT TATGAGCGGC TGGCGGCCTT CGGCAAGACT CTCTCGATTG CGATGTCATC GGAGCAGTTC ATCGCCGTAA CGCCCGAGCA GAAGTTGCAG TCGTACAAGA CCGATCTGAA GCGGTTTACG AACCTGAAGG CGGCTGTCAA ATTGCGCTAT GCCGAGTCGG TGGACTACCG CGACTTCGAA CCCAAGATCA AGAAGCTCTT GGACACGCAC ATTTCGGCGA GCGAAGTGGT TCGGCTGAAT GAGCCCGTGA ACATCTTCGA CGAGCAGGCG TTCAAAGAGA TGGTCGAGGA TCGAGGCGAA GCCAAGACCA CCGCCGCGAA GGCCGACATG ATCGCGCATG CGACGAAGAA GGCCATCAGC GAGCGCCTGG AACAGGACCC CGCGTTCTAC GAGAAGTTCT CGAAACTCAT TCAGCAAGCC ATCGACGACT TCCGCGCCCA ACGCATCTCG GACCTGGAGT ACCTGAACAA GGTCAGCGAG ATCAAGGATG CGGTGGTCAA CCGGCGCAGT GACGACGCAC CAGCGCAGTT GGCCGGCAAC GACAACGCCC TTGCTTTGTA CGGCGTACTC AAGCCCTATG TGAGCGGCCA TGTGAAGGCG GAAGACGTGG CGGCGACGCT TGCTGCCGAT GCGGCCGTCG ATATCTGGTC GATCATTCAG CGCAACAAGA AGGTGGGCTT TTGGGACGAT TTGGACGCAC AACGTCGAAC GATGAACGAG ATCGACGACT ACCTGTACGA CGAGGTCAAG GACAAGAAAG GTGTGCAGCT TTCGACTGGC GAGATGGACG ACATCATCGA GCGGACGATG CAGCTGGCAC GGCACCGGAT GGTGGTATGA
|
Protein sequence | MDDMDSFRFN EKYLSQIPAL QLFINLGYAY LTPEQALAAR SGKPGQVLLE EVLRERLKKI NRIQYKGQSY LFSEENIQTA IQRLKNVKYD GLLKTNEAIY DLLTLGVALE QSIEGDSKSF TLNYIDWKNP ANNVYHVTAE FPVERTRSSE TCRPDIVLFV NGIPFAVIEC KSPKVEVEQA VSQMIRNQRE EYIPKLFTYA QLLLATNKNA VRYASTGTPA KFWALWREDI APEVLRRVLD RPLPLEAKAG LFDLIWDGQS LHEPEFGKSW AITEQDRMLY ALCRPERLLD LAFRFTVFDA GIRKVARYQQ FFAIQRVLER VKHRDDVGRR LGGIIWHTQG SGKSLTMVML ARALALDPDI RNPRIVLVTD RVDLDKQLGN TFAACGLTPD RADSGRHLLE LVSENKAHIV TTLIHKFDKA LSIKKHVEES ADIFILVDES HRTNFGGFAA RMRQMFPMAC YLGFTGTPLM KKEKNNFAKF GGLIDTYAIN QAVKDGAVVP LLYEARHVEM EQNEKAIDTW FERHTQGLTD AQKADLKKKY SRAEMLNKAD QVIYMRAFDI SEHYRQNWQG TGFKAQLVAP SKAAALTYKK FLDEIGHVTS EVIISPPDTR EGNEETDDKP TDEVVAFWER MMKRYGSEEE YNKQLINQFK YGDDPEILIV VDKLLTGFDA PRNTVLYLTR KLREHTLLQA IARVNRLYDD DEGKQPKEFG YIIDYAGILG ELDQALTAYS ALDGFDEADL AGTLVSIQEE VRKLPQRHAD LWDLFKEVKN QHDEEAYEQL LADEKLRETF YERLAAFGKT LSIAMSSEQF IAVTPEQKLQ SYKTDLKRFT NLKAAVKLRY AESVDYRDFE PKIKKLLDTH ISASEVVRLN EPVNIFDEQA FKEMVEDRGE AKTTAAKADM IAHATKKAIS ERLEQDPAFY EKFSKLIQQA IDDFRAQRIS DLEYLNKVSE IKDAVVNRRS DDAPAQLAGN DNALALYGVL KPYVSGHVKA EDVAATLAAD AAVDIWSIIQ RNKKVGFWDD LDAQRRTMNE IDDYLYDEVK DKKGVQLSTG EMDDIIERTM QLARHRMVV
|
| |