Gene MCA1893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1893 
Symbol 
ID3104219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2037119 
End bp2040328 
Gene Length3210 bp 
Protein Length1069 aa 
Translation table11 
GC content60% 
IMG OID637171050 
Producttype I restriction-modification system, R subunit 
Protein accessionYP_114328 
Protein GI53803780 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGACA TGGACAGCTT TCGCTTCAAC GAGAAATACC TCTCGCAGAT TCCCGCGCTG 
CAACTCTTCA TCAACCTGGG CTATGCCTAC CTCACGCCCG AGCAGGCGCT TGCCGCCCGC
AGCGGCAAGC CTGGGCAGGT GCTGCTGGAA GAAGTGCTGC GCGAGCGGCT GAAGAAAATC
AACCGTATCC AGTACAAGGG GCAGAGCTAC CTGTTTTCCG AGGAGAACAT CCAGACCGCC
ATCCAGCGCC TGAAGAACGT CAAGTACGAC GGTTTGCTGA AAACCAACGA GGCCATCTAT
GACCTGCTGA CGCTGGGCGT GGCGCTGGAG CAATCCATCG AGGGCGACAG CAAGAGCTTT
ACGCTCAACT ACATCGACTG GAAGAACCCA GCCAACAACG TCTACCACGT CACGGCCGAG
TTCCCGGTCG AACGCACGCG AAGCAGCGAG ACCTGCCGGC CGGACATCGT GCTGTTTGTC
AACGGCATTC CGTTTGCGGT GATCGAGTGC AAGTCGCCCA AGGTGGAGGT CGAGCAGGCC
GTGTCGCAGA TGATCCGCAA CCAGCGCGAA GAGTACATCC CCAAGCTGTT CACCTATGCC
CAGTTGCTGC TGGCGACCAA CAAGAACGCG GTGCGCTATG CCAGCACCGG AACACCAGCC
AAATTTTGGG CGCTGTGGCG TGAAGACATT GCGCCCGAGG TGCTGCGCCG CGTGCTCGAT
CGGCCCTTGC CTCTTGAAGC CAAGGCAGGG CTGTTTGACC TGATTTGGGA TGGCCAATCA
CTGCACGAAC CCGAATTCGG CAAGTCGTGG GCGATCACCG AACAAGATCG CATGCTGTAT
GCCTTGTGCC GCCCAGAGCG GCTGCTCGAT CTGGCGTTTC GTTTCACTGT TTTTGATGCC
GGCATCCGCA AGGTTGCCCG CTACCAGCAG TTCTTTGCCA TCCAGCGCGT GCTCGAACGG
GTCAAGCACC GGGATGATGT GGGGCGGCGC TTGGGCGGCA TCATCTGGCA CACCCAAGGG
TCGGGCAAGT CACTGACCAT GGTGATGCTG GCGCGCGCCC TGGCGCTCGA CCCCGACATC
CGCAACCCGC GCATCGTGCT GGTGACGGAT CGGGTGGATT TGGACAAGCA GCTTGGCAAC
ACCTTTGCCG CCTGCGGCTT GACGCCGGAT CGGGCGGATA GCGGGCGCCA TCTGCTCGAG
CTGGTTTCCG AGAACAAGGC GCATATCGTC ACCACGCTCA TCCACAAGTT TGACAAGGCG
CTGTCGATCA AGAAGCACGT CGAGGAATCG GCCGACATCT TCATCCTGGT CGATGAGAGC
CACCGCACCA ACTTCGGCGG CTTTGCCGCG CGCATGCGGC AGATGTTCCC GATGGCCTGT
TACCTGGGCT TCACCGGCAC GCCGTTGATG AAGAAGGAGA AGAACAACTT CGCCAAGTTC
GGCGGCTTGA TCGACACCTA TGCGATCAAT CAGGCCGTCA AGGATGGCGC GGTGGTTCCC
CTGCTGTATG AGGCGCGGCA TGTGGAGATG GAGCAGAACG AAAAAGCCAT CGACACCTGG
TTCGAGCGCC ACACGCAAGG GCTGACGGAT GCGCAAAAGG CGGACCTGAA GAAGAAATAC
AGCCGGGCCG AGATGCTCAA CAAGGCCGAT CAGGTGATCT ACATGCGCGC CTTCGACATC
AGCGAGCACT ACCGGCAAAA CTGGCAGGGC ACGGGCTTCA AGGCGCAGCT CGTCGCACCG
AGCAAGGCGG CCGCCCTCAC CTACAAGAAG TTTCTTGACG AGATCGGGCA CGTGACGAGC
GAAGTCATCA TTTCGCCGCC GGATACCCGC GAAGGCAACG AGGAGACGGA CGACAAGCCA
ACCGATGAGG TCGTGGCCTT CTGGGAACGG ATGATGAAGC GCTATGGCTC CGAGGAGGAG
TACAACAAGC AGCTCATCAA CCAGTTCAAG TACGGCGACG ATCCGGAAAT CCTGATCGTC
GTCGACAAGC TCCTGACCGG CTTTGATGCG CCACGTAACA CGGTGCTGTA CCTGACGCGC
AAGCTGCGCG AGCACACCCT ATTGCAGGCC ATCGCCCGGG TGAACCGTCT GTATGACGAT
GACGAAGGCA AGCAGCCCAA GGAGTTCGGC TACATCATCG ACTACGCCGG CATTCTGGGC
GAGCTCGATC AGGCGCTGAC CGCTTATAGC GCCCTCGACG GCTTCGACGA GGCCGACCTG
GCCGGAACCC TGGTCAGCAT CCAGGAGGAA GTGAGGAAGC TGCCGCAGCG CCATGCCGAT
CTTTGGGACC TCTTCAAAGA GGTCAAGAAC CAGCACGACG AAGAGGCCTA CGAGCAGCTG
CTGGCGGACG AGAAGCTGCG GGAGACGTTT TATGAGCGGC TGGCGGCCTT CGGCAAGACT
CTCTCGATTG CGATGTCATC GGAGCAGTTC ATCGCCGTAA CGCCCGAGCA GAAGTTGCAG
TCGTACAAGA CCGATCTGAA GCGGTTTACG AACCTGAAGG CGGCTGTCAA ATTGCGCTAT
GCCGAGTCGG TGGACTACCG CGACTTCGAA CCCAAGATCA AGAAGCTCTT GGACACGCAC
ATTTCGGCGA GCGAAGTGGT TCGGCTGAAT GAGCCCGTGA ACATCTTCGA CGAGCAGGCG
TTCAAAGAGA TGGTCGAGGA TCGAGGCGAA GCCAAGACCA CCGCCGCGAA GGCCGACATG
ATCGCGCATG CGACGAAGAA GGCCATCAGC GAGCGCCTGG AACAGGACCC CGCGTTCTAC
GAGAAGTTCT CGAAACTCAT TCAGCAAGCC ATCGACGACT TCCGCGCCCA ACGCATCTCG
GACCTGGAGT ACCTGAACAA GGTCAGCGAG ATCAAGGATG CGGTGGTCAA CCGGCGCAGT
GACGACGCAC CAGCGCAGTT GGCCGGCAAC GACAACGCCC TTGCTTTGTA CGGCGTACTC
AAGCCCTATG TGAGCGGCCA TGTGAAGGCG GAAGACGTGG CGGCGACGCT TGCTGCCGAT
GCGGCCGTCG ATATCTGGTC GATCATTCAG CGCAACAAGA AGGTGGGCTT TTGGGACGAT
TTGGACGCAC AACGTCGAAC GATGAACGAG ATCGACGACT ACCTGTACGA CGAGGTCAAG
GACAAGAAAG GTGTGCAGCT TTCGACTGGC GAGATGGACG ACATCATCGA GCGGACGATG
CAGCTGGCAC GGCACCGGAT GGTGGTATGA
 
Protein sequence
MDDMDSFRFN EKYLSQIPAL QLFINLGYAY LTPEQALAAR SGKPGQVLLE EVLRERLKKI 
NRIQYKGQSY LFSEENIQTA IQRLKNVKYD GLLKTNEAIY DLLTLGVALE QSIEGDSKSF
TLNYIDWKNP ANNVYHVTAE FPVERTRSSE TCRPDIVLFV NGIPFAVIEC KSPKVEVEQA
VSQMIRNQRE EYIPKLFTYA QLLLATNKNA VRYASTGTPA KFWALWREDI APEVLRRVLD
RPLPLEAKAG LFDLIWDGQS LHEPEFGKSW AITEQDRMLY ALCRPERLLD LAFRFTVFDA
GIRKVARYQQ FFAIQRVLER VKHRDDVGRR LGGIIWHTQG SGKSLTMVML ARALALDPDI
RNPRIVLVTD RVDLDKQLGN TFAACGLTPD RADSGRHLLE LVSENKAHIV TTLIHKFDKA
LSIKKHVEES ADIFILVDES HRTNFGGFAA RMRQMFPMAC YLGFTGTPLM KKEKNNFAKF
GGLIDTYAIN QAVKDGAVVP LLYEARHVEM EQNEKAIDTW FERHTQGLTD AQKADLKKKY
SRAEMLNKAD QVIYMRAFDI SEHYRQNWQG TGFKAQLVAP SKAAALTYKK FLDEIGHVTS
EVIISPPDTR EGNEETDDKP TDEVVAFWER MMKRYGSEEE YNKQLINQFK YGDDPEILIV
VDKLLTGFDA PRNTVLYLTR KLREHTLLQA IARVNRLYDD DEGKQPKEFG YIIDYAGILG
ELDQALTAYS ALDGFDEADL AGTLVSIQEE VRKLPQRHAD LWDLFKEVKN QHDEEAYEQL
LADEKLRETF YERLAAFGKT LSIAMSSEQF IAVTPEQKLQ SYKTDLKRFT NLKAAVKLRY
AESVDYRDFE PKIKKLLDTH ISASEVVRLN EPVNIFDEQA FKEMVEDRGE AKTTAAKADM
IAHATKKAIS ERLEQDPAFY EKFSKLIQQA IDDFRAQRIS DLEYLNKVSE IKDAVVNRRS
DDAPAQLAGN DNALALYGVL KPYVSGHVKA EDVAATLAAD AAVDIWSIIQ RNKKVGFWDD
LDAQRRTMNE IDDYLYDEVK DKKGVQLSTG EMDDIIERTM QLARHRMVV