Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_1948 |
Symbol | |
ID | 7094066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | + |
Start bp | 2122339 |
End bp | 2125287 |
Gene Length | 2949 bp |
Protein Length | 982 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643465275 |
Product | type III restriction protein res subunit |
Protein accession | YP_002362253 |
Protein GI | 217978106 |
COG category | [V] Defense mechanisms |
COG ID | [COG3587] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 81 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTTA AATTCGATCC CTCACTTCAG TACCAGCAGG ATGCCGTCAG CGCCGTCGTT GGGGCGTTCG AGGGGCAGCC CTTCGTGCAG ACCGGGGCAA TGGCGTTTCA GTCGCTTCAG ATCGGCGGTC TGTTTCAGAC GGAGCTGGGG CTGGGCAATC TCCTCAACAT TGGCGATGAG CAAATTCTCG CAAACGTCCG GGCCGTTCAG GAAGCTAATG AGATCGAGAA GGTAATTGCC CTCAACGGGC GTGAGTTCTC AGTCGAGATG GAGACCGGCA CCGGCAAGAC CTACGTCTAT CTTCGGACGA TTTTTGAGCT GAACAAGACC TACGGCTTCA AGAAGTTTAT CATCGTGGTT CCGAGTGTCG CCATTCGTGA GGGCGTGCTT AAGAGCATCG AGGTAACGAA GGAGCACTTC CACACGCTCT ACGACAACGC GCCCTTCGAT CACTTTGTCT ATGACTCAAA GCGTCTAGGC AAAGTGCGCC AGTTCGCGAC CAGCAATCAG ATTCAGATCA TGGTCATCAA CATCCAGTCC TTCCAGAAGG ATGTTGCCGA CAAGGACCTC TCGGAGATGA CCGAGGATGA GCTGAAGAAG CTCAATGTCA TCAACCGTGA GAATGATCGC ATGTCGGGCC GCAGGCCTAT CGAGTTCATT CAGGCGGCCA GCCCCGTTGT CATCATCGAC GAACCCCAGA GCGTCGATAC GACCGAGAAA TCACGGCGGG CGATTGGCAA CCTCAATCCA ATGGCGACGC TGCGTTACAG CGCGACGCAT CGCAATCCCT ATAACCTCCT CTACAAGCTC GACCCGATCA AGGCTTACGA CCTACGGCTC GTGAAGAGGA TCGAAGTCGC ATCCGTCCGG TCGGATGACA ATTTCAATGA TGCATACGTG AAGCTGCTCA AGACAGACAA CAAGACCGGC ATCAAAGCGC AAATCGAGAT TCACAGGGAA GGTGCCACTG GCCCCAAAGC GGCGAAGCTT TGGGTCAAGC AGGGCGATGA CTTGTACGTG AAGTCGGACG AGCGCGACGC TTATCGCGAC GGCTACATCG TGCAGAACAT CGATTGCACT CCGGGCTCCG AATATATCGA GTTCAATCAA GGCCGCTTCC TTGAGCTGGG TCAGGAAGTC GGCGGGCTTG GCGAAGACAT TATGAAGGCT CAGGTCTATG AGACCGTCGA GCAGCATCTA AAGAAGGAGC GCGCCCTGAA GGGCAAGGGC ATCAAGGTGC TCTCGCTGTT CTTCATCGAC CGCGTCGCCA ACTACCGCAT CTACAATGAG GACGGGACGA CCAGCCTTGG CAAGATCGGT CAGTGGTTTG AGGAGGCCTA TCAGCAGCTC ACGGCCAAGC CCATCTACAA GGGCCTTATC CCATTCAGCG TTGCCGATGT TCACAACGGC TACTTCTCGC AGGACAAGCA GGGCCACGCC AAGGACACAC GCGGGAACAC CGCCGACGAT GATGACACTT ACAGCCTCAT CATGCGCGAC AAGGAGCGGC TTCTCGATCC TAACGTTGCA CTGCGCTTCA TCTTCTCCCA CTCCGCCCTG CGCGAGGGCT GGGACAATCC AAATGTGTTC CAGATTTGCA CCCTGAACGA GACGCAATCA GCCGAGCGCA AGCGGCAGGA AATCGGGCGC GGGTTGCGTC TGCCTGTCAA TGAGACCGGC GAGCGCGTTC ATGACGAAAC GATCAATCGT CTGACCGTCA TCGCCAACGA GTCATATGAG GATTTTGCGC GCACGCTTCA GACCGAGTTT GAAGAGGATT TTGGCATCAA GTTCGGAAGG ATCGAGAAGA TCGCTTTCGC AAAGCTCGTG CGACGGGCTG CGGATGGAAC CGATGTCGAA CTCGGGCAGG ACGAGTCCGT GAAGATTTGG CACGAGCTCG TTGCGAAGGG CTACCTAAAT GGCGCGGGCG ATATTCTGGA GAAGTTTGAC CCGAAGAACC CCCATTTCAA ACTGGAAATT TCAGACGCGT TCGCTGATCT CCGGGCGGAA ATCATCGACG AGGTGAACCG CAAGCTCTTC AAGAACCGTA TCGTCAATGT CCGCGATGAG CGCACCCTGA AATTCCGGAA AGAGGTGCAT CTCAGCGCCG ACTTCCAGGC TCTCTGGGAT AAGATCAAAC ATCGCACGCG TTACCGCGTG ACTTTTGAAA CCGCTGCACT GATCGACCGG GCGCTCTCGC GCATCAAGCA GATCGAACCG ATCAAGGCAG CGCGCATCGA GACCACCGTC GTTGAGGTGG ATATTACCGA TGCCGGTGTC TCCGCCGACC GACAGATTTC GTCGCGAGTG AGGGACGTGC AGCAGGTAAA GGTCTTGCCG GACATTCTCG CCTTCCTGCA GAAGGAGACT GAGCTGACCC GCCACACGCT TGCCGAAATC CTCAAGCGCT CGGGGCGGCT CGGCGAGTTC AAGATTAATC CGCAGGCTTT CATGGCAGCC GCTGCGAAGG AAATATCGCG CGCGCTGCAT GACCTGATGC TCGAAGGCAT CAAATACGAG AAGGTCGCAG GCCAGCATTG GGAAATGAGC CGGATCGAGC AGGATGCCGA AGACGGCATC GTCCGGTATC TCGGCAATCT CTATGAGGTT CAAAACCGCG AGAAGTCGCT CTTCGATGCA ATTGTTTATG AATCCGAGGT TGAGAAGCAA TTCGCACGCG ACCTCGACAG CAATGAGAAC GTGAAGCTAT TCGTCAAGCT GCCGTCGTGG TTCAAGATCG ACACGCCCAT CGGCACCTAT AATCCCGACT GGGCCTTTGT GACCGAGCGC GAGGAGAAGC TTTATTTCGT TCGTGAGACG AAGAGCACGC TCGACAGCGA GGAGCGGCGC ACTAAGGAAA ACCAAAAGAT CGCCTGTGGT CGCAAGCATT TCGATTCGCT TGGGGTAGAC TATGCCGTGG TCACTTCGCT TGCAGACGTG GCGATGTGA
|
Protein sequence | MKLKFDPSLQ YQQDAVSAVV GAFEGQPFVQ TGAMAFQSLQ IGGLFQTELG LGNLLNIGDE QILANVRAVQ EANEIEKVIA LNGREFSVEM ETGTGKTYVY LRTIFELNKT YGFKKFIIVV PSVAIREGVL KSIEVTKEHF HTLYDNAPFD HFVYDSKRLG KVRQFATSNQ IQIMVINIQS FQKDVADKDL SEMTEDELKK LNVINRENDR MSGRRPIEFI QAASPVVIID EPQSVDTTEK SRRAIGNLNP MATLRYSATH RNPYNLLYKL DPIKAYDLRL VKRIEVASVR SDDNFNDAYV KLLKTDNKTG IKAQIEIHRE GATGPKAAKL WVKQGDDLYV KSDERDAYRD GYIVQNIDCT PGSEYIEFNQ GRFLELGQEV GGLGEDIMKA QVYETVEQHL KKERALKGKG IKVLSLFFID RVANYRIYNE DGTTSLGKIG QWFEEAYQQL TAKPIYKGLI PFSVADVHNG YFSQDKQGHA KDTRGNTADD DDTYSLIMRD KERLLDPNVA LRFIFSHSAL REGWDNPNVF QICTLNETQS AERKRQEIGR GLRLPVNETG ERVHDETINR LTVIANESYE DFARTLQTEF EEDFGIKFGR IEKIAFAKLV RRAADGTDVE LGQDESVKIW HELVAKGYLN GAGDILEKFD PKNPHFKLEI SDAFADLRAE IIDEVNRKLF KNRIVNVRDE RTLKFRKEVH LSADFQALWD KIKHRTRYRV TFETAALIDR ALSRIKQIEP IKAARIETTV VEVDITDAGV SADRQISSRV RDVQQVKVLP DILAFLQKET ELTRHTLAEI LKRSGRLGEF KINPQAFMAA AAKEISRALH DLMLEGIKYE KVAGQHWEMS RIEQDAEDGI VRYLGNLYEV QNREKSLFDA IVYESEVEKQ FARDLDSNEN VKLFVKLPSW FKIDTPIGTY NPDWAFVTER EEKLYFVRET KSTLDSEERR TKENQKIACG RKHFDSLGVD YAVVTSLADV AM
|
| |