Gene Msil_1948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1948 
Symbol 
ID7094066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2122339 
End bp2125287 
Gene Length2949 bp 
Protein Length982 aa 
Translation table11 
GC content56% 
IMG OID643465275 
Producttype III restriction protein res subunit 
Protein accessionYP_002362253 
Protein GI217978106 
COG category[V] Defense mechanisms 
COG ID[COG3587] Restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTTA AATTCGATCC CTCACTTCAG TACCAGCAGG ATGCCGTCAG CGCCGTCGTT 
GGGGCGTTCG AGGGGCAGCC CTTCGTGCAG ACCGGGGCAA TGGCGTTTCA GTCGCTTCAG
ATCGGCGGTC TGTTTCAGAC GGAGCTGGGG CTGGGCAATC TCCTCAACAT TGGCGATGAG
CAAATTCTCG CAAACGTCCG GGCCGTTCAG GAAGCTAATG AGATCGAGAA GGTAATTGCC
CTCAACGGGC GTGAGTTCTC AGTCGAGATG GAGACCGGCA CCGGCAAGAC CTACGTCTAT
CTTCGGACGA TTTTTGAGCT GAACAAGACC TACGGCTTCA AGAAGTTTAT CATCGTGGTT
CCGAGTGTCG CCATTCGTGA GGGCGTGCTT AAGAGCATCG AGGTAACGAA GGAGCACTTC
CACACGCTCT ACGACAACGC GCCCTTCGAT CACTTTGTCT ATGACTCAAA GCGTCTAGGC
AAAGTGCGCC AGTTCGCGAC CAGCAATCAG ATTCAGATCA TGGTCATCAA CATCCAGTCC
TTCCAGAAGG ATGTTGCCGA CAAGGACCTC TCGGAGATGA CCGAGGATGA GCTGAAGAAG
CTCAATGTCA TCAACCGTGA GAATGATCGC ATGTCGGGCC GCAGGCCTAT CGAGTTCATT
CAGGCGGCCA GCCCCGTTGT CATCATCGAC GAACCCCAGA GCGTCGATAC GACCGAGAAA
TCACGGCGGG CGATTGGCAA CCTCAATCCA ATGGCGACGC TGCGTTACAG CGCGACGCAT
CGCAATCCCT ATAACCTCCT CTACAAGCTC GACCCGATCA AGGCTTACGA CCTACGGCTC
GTGAAGAGGA TCGAAGTCGC ATCCGTCCGG TCGGATGACA ATTTCAATGA TGCATACGTG
AAGCTGCTCA AGACAGACAA CAAGACCGGC ATCAAAGCGC AAATCGAGAT TCACAGGGAA
GGTGCCACTG GCCCCAAAGC GGCGAAGCTT TGGGTCAAGC AGGGCGATGA CTTGTACGTG
AAGTCGGACG AGCGCGACGC TTATCGCGAC GGCTACATCG TGCAGAACAT CGATTGCACT
CCGGGCTCCG AATATATCGA GTTCAATCAA GGCCGCTTCC TTGAGCTGGG TCAGGAAGTC
GGCGGGCTTG GCGAAGACAT TATGAAGGCT CAGGTCTATG AGACCGTCGA GCAGCATCTA
AAGAAGGAGC GCGCCCTGAA GGGCAAGGGC ATCAAGGTGC TCTCGCTGTT CTTCATCGAC
CGCGTCGCCA ACTACCGCAT CTACAATGAG GACGGGACGA CCAGCCTTGG CAAGATCGGT
CAGTGGTTTG AGGAGGCCTA TCAGCAGCTC ACGGCCAAGC CCATCTACAA GGGCCTTATC
CCATTCAGCG TTGCCGATGT TCACAACGGC TACTTCTCGC AGGACAAGCA GGGCCACGCC
AAGGACACAC GCGGGAACAC CGCCGACGAT GATGACACTT ACAGCCTCAT CATGCGCGAC
AAGGAGCGGC TTCTCGATCC TAACGTTGCA CTGCGCTTCA TCTTCTCCCA CTCCGCCCTG
CGCGAGGGCT GGGACAATCC AAATGTGTTC CAGATTTGCA CCCTGAACGA GACGCAATCA
GCCGAGCGCA AGCGGCAGGA AATCGGGCGC GGGTTGCGTC TGCCTGTCAA TGAGACCGGC
GAGCGCGTTC ATGACGAAAC GATCAATCGT CTGACCGTCA TCGCCAACGA GTCATATGAG
GATTTTGCGC GCACGCTTCA GACCGAGTTT GAAGAGGATT TTGGCATCAA GTTCGGAAGG
ATCGAGAAGA TCGCTTTCGC AAAGCTCGTG CGACGGGCTG CGGATGGAAC CGATGTCGAA
CTCGGGCAGG ACGAGTCCGT GAAGATTTGG CACGAGCTCG TTGCGAAGGG CTACCTAAAT
GGCGCGGGCG ATATTCTGGA GAAGTTTGAC CCGAAGAACC CCCATTTCAA ACTGGAAATT
TCAGACGCGT TCGCTGATCT CCGGGCGGAA ATCATCGACG AGGTGAACCG CAAGCTCTTC
AAGAACCGTA TCGTCAATGT CCGCGATGAG CGCACCCTGA AATTCCGGAA AGAGGTGCAT
CTCAGCGCCG ACTTCCAGGC TCTCTGGGAT AAGATCAAAC ATCGCACGCG TTACCGCGTG
ACTTTTGAAA CCGCTGCACT GATCGACCGG GCGCTCTCGC GCATCAAGCA GATCGAACCG
ATCAAGGCAG CGCGCATCGA GACCACCGTC GTTGAGGTGG ATATTACCGA TGCCGGTGTC
TCCGCCGACC GACAGATTTC GTCGCGAGTG AGGGACGTGC AGCAGGTAAA GGTCTTGCCG
GACATTCTCG CCTTCCTGCA GAAGGAGACT GAGCTGACCC GCCACACGCT TGCCGAAATC
CTCAAGCGCT CGGGGCGGCT CGGCGAGTTC AAGATTAATC CGCAGGCTTT CATGGCAGCC
GCTGCGAAGG AAATATCGCG CGCGCTGCAT GACCTGATGC TCGAAGGCAT CAAATACGAG
AAGGTCGCAG GCCAGCATTG GGAAATGAGC CGGATCGAGC AGGATGCCGA AGACGGCATC
GTCCGGTATC TCGGCAATCT CTATGAGGTT CAAAACCGCG AGAAGTCGCT CTTCGATGCA
ATTGTTTATG AATCCGAGGT TGAGAAGCAA TTCGCACGCG ACCTCGACAG CAATGAGAAC
GTGAAGCTAT TCGTCAAGCT GCCGTCGTGG TTCAAGATCG ACACGCCCAT CGGCACCTAT
AATCCCGACT GGGCCTTTGT GACCGAGCGC GAGGAGAAGC TTTATTTCGT TCGTGAGACG
AAGAGCACGC TCGACAGCGA GGAGCGGCGC ACTAAGGAAA ACCAAAAGAT CGCCTGTGGT
CGCAAGCATT TCGATTCGCT TGGGGTAGAC TATGCCGTGG TCACTTCGCT TGCAGACGTG
GCGATGTGA
 
Protein sequence
MKLKFDPSLQ YQQDAVSAVV GAFEGQPFVQ TGAMAFQSLQ IGGLFQTELG LGNLLNIGDE 
QILANVRAVQ EANEIEKVIA LNGREFSVEM ETGTGKTYVY LRTIFELNKT YGFKKFIIVV
PSVAIREGVL KSIEVTKEHF HTLYDNAPFD HFVYDSKRLG KVRQFATSNQ IQIMVINIQS
FQKDVADKDL SEMTEDELKK LNVINRENDR MSGRRPIEFI QAASPVVIID EPQSVDTTEK
SRRAIGNLNP MATLRYSATH RNPYNLLYKL DPIKAYDLRL VKRIEVASVR SDDNFNDAYV
KLLKTDNKTG IKAQIEIHRE GATGPKAAKL WVKQGDDLYV KSDERDAYRD GYIVQNIDCT
PGSEYIEFNQ GRFLELGQEV GGLGEDIMKA QVYETVEQHL KKERALKGKG IKVLSLFFID
RVANYRIYNE DGTTSLGKIG QWFEEAYQQL TAKPIYKGLI PFSVADVHNG YFSQDKQGHA
KDTRGNTADD DDTYSLIMRD KERLLDPNVA LRFIFSHSAL REGWDNPNVF QICTLNETQS
AERKRQEIGR GLRLPVNETG ERVHDETINR LTVIANESYE DFARTLQTEF EEDFGIKFGR
IEKIAFAKLV RRAADGTDVE LGQDESVKIW HELVAKGYLN GAGDILEKFD PKNPHFKLEI
SDAFADLRAE IIDEVNRKLF KNRIVNVRDE RTLKFRKEVH LSADFQALWD KIKHRTRYRV
TFETAALIDR ALSRIKQIEP IKAARIETTV VEVDITDAGV SADRQISSRV RDVQQVKVLP
DILAFLQKET ELTRHTLAEI LKRSGRLGEF KINPQAFMAA AAKEISRALH DLMLEGIKYE
KVAGQHWEMS RIEQDAEDGI VRYLGNLYEV QNREKSLFDA IVYESEVEKQ FARDLDSNEN
VKLFVKLPSW FKIDTPIGTY NPDWAFVTER EEKLYFVRET KSTLDSEERR TKENQKIACG
RKHFDSLGVD YAVVTSLADV AM