Gene Bpro_1944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_1944 
Symbol 
ID4015425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp2004590 
End bp2007694 
Gene Length3105 bp 
Protein Length1034 aa 
Translation table11 
GC content60% 
IMG OID637941611 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_548773 
Protein GI91787821 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.31477 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAG ACCAACTTGA ACAGGAAACC TTAGCCTGGC TGGCCGATGT GGGATACACC 
CATCGCTATG GTCCCGACAT CGCCCATGAT GGTGCTATGC CCGAGCGCAG CAGCTACCGC
CAGGTGTTGC TGCCGTTTCG CCTGCGCGAG GCCATCCAGA AGCTCAATCC CGGGATCCCT
ACTGCCGCCC GTGAAGATGC CCTCAAGCAG GTGCTTGATC TTGGCATACC CGCATTGCTG
ACGGCCAACC AGCACTTCCA CCGGCTGCTG GTGGCCGGTG TGCCGGTGCG GTACCAGAAG
GACGGTGAAA CTCGGGGCGA CTTTGTGCGC CTGATTGACT GGGCTGATCC TGAGCGCAAC
GAGTGGCTGG CGGTTAACCA GTTCAGCATC AAGGGCCCTC ACCACACACG CCGGCCCGAC
ATTGTCTTGT TCGTCAACGG CCTGCCGCTG GTGCTGCTAG AACTGAAGAA CCCGGCGGAT
AAAAACGCCG ACATCTGGCG AGCTTACGAC CAGATTCAGA CGTACAAAGA GCAGATTCCC
GACGTATTTC AGACTAATGA GGTGCTGGTC ATCACCGACG GTACCGAGGC GTTGATGGGG
TCGTTGTCGG CCGATGCCGA GCGTTTCATG GCCTGGCGCA CCATCGACGG CGTGACGCTG
GATCCCTTGG GCGAGTTCAA CGAACTGCAA ACCCTGGTGC GCGGCGTGCT GGCGCCGGCC
TATTTCCTGG ACTTCCTCCG TTACTTTGTG CTGTTCGAGG ACGATGGCGC CCTGGTCAAA
AAGATCGCCG GTTACCACCA GTTCCACGCT GTGCGTGCTG CCATTCTCCA GGTGGTGGAC
GCATCGCGTC CGGGAGGCTC CCAAAAAGGC GGTGTGGTCT GGCACACCCA GGGTAGCGGC
AAGAGCATCA CCATGACCTG CTTTGCCGCC CGCGTCATGC AGGAACCGGC GATGGAGAAC
CCCACCATCG TCGTCATCAC CGACCGCAAC GACCTGGACG GGCAGTTGTT TGGCGTGTTC
AGCCTGGCAC AAGACCTGTT GCGCGAGCAG CCGGTGCAGG TGAGTACCCG TCAGGATCTA
CGCGCCAAAT TGGCCAACCG ACCTTCTGGC GGCATCGTAT TTGCAACTAT CCAGAAATTC
ATGCCTGGCG AGGACGAGGA CACGTTCCCA ATGCTGTCCG ACAGGCACAA CATCGTCGTG
ATCGCCGATG AGGCTCACCG CACACAATAT GGCTTTGAGG CTAAGCTCAA AGGCAAACAA
GGGCAGGAAA GTTACCAGGG CGGCTATGCC CAGCACTTGC GCGATGCGTT GCCTAACGCG
ACCTTTGTTG CATTTACCGG CACTCCCGTC AGCAGCACCG ACCGCGACAC CCGGGCCGTG
TTCGGCGACT ACATCCATAT CTACGACATG CAACAGGCCA AGGACGATGG CGCCACCGTG
GCCATCTACT ACGAGTCGCG CCTGGCCAAA CTCAGCCTGA TTGAGTCCGA CCTGTCGCAG
ATGGACGAGG AGGTCGACGA ACTGGCCGAG GACGACGAGG AAAGCCAGCA GGCCGTCCTC
AAAAGCCGCT GGGCCGCTCT CGAGAAGGTT GTTGGCGCGG AACCGCGCGT GGCCAGCGTG
GCAGCCGATT TGGTAACTCA CTTCGAGGAA CGCAACAAGG CACAGCCCGG CAAGGCGATG
GTTGTGGCCA TGAGCCGCGA CATCTGCGTC CACCTGTACA ACGAGATCGT AAAGCTGCGC
CCGGACTGGC ACGATGCTGA CCCGGAGAAG GGCGCCGTCA AGATCGTCAT GACCGGCTCG
GCAAGTGACA AACCACTGCT GCGTCCGCAC ATTTACAGCG CCCAGACCAA GAAGCGGCTG
GAAAAGCGCT TCAAGGACCC GGCCGATCCG CTGCGCCTCG TGATCGTGCG TGACATGTGG
CTGACTGGCT TTGACGCCCC TTGCGTGCAT ACCCTGTACG TGGACAAGCC CATGAAGGGC
CATAACCTGA TGCAGGCCAT AGCCCGCGTG AATCGTGTGT TCAAGGACAA GCAAGGCGGC
CTGGTGGTGG ACTATATCGG CATCGGCAAC GAGCTGAAGT CGGCCATGAA GGAATACACC
CAATCAAAGG GCCGTGGCCG TCCCACGGTG GACGCGCATG AGGCGTATAG CGTGCTCGCG
GAAAAGCTTG ATGTGCTGCG CTCCATGCTG CACGGCTTTG ATTACACCGG ATTCTTGACT
GGCGGACACA AAACCCTGGC CGGTGCCGCC AACCATGTGC TGGGCCTCAA CACCGGCTCG
TCCCGGGACG GCAAGAAGCG ATTTGCCGAC ACTGCGCTGG CCATGAGCCA GGCGTTTACG
CTGTGCTGCA CGCTTGACGA AGCCAAGGCA GTGCGCGAGG AAGTGGCCTT TATGCAGGGC
GTGAAGGTCA TCTTGACGAA GAAGGAGCTG ACGGCCAAGA GGCGCACGGA TGAGGCGCGC
GAACTGGCTA TCCGGCAGAT CATCAGCTCG GCGGTGGTGT CCGAGAGTGT GGTGGACATC
TTTGACGCCG TGGGCCTGGA CAAGCCCAAC ATCGGCCTGC TGGATGACGA ATTCCTTGCC
CAGGTCAAAA ACCTGCCGGA AAAGAACCTG GCTGTGGAGC TGCTGGAGCG GCTGCTGGAG
GGGGAAATCA AAAGTCGCTT CACAAGCAAT GTCGTACAGG ACAAGAAGTT CTCGGAGATG
CTGGCCAACG TCATCACTCG CTATCAAAAC CGCTCGATTG AAACAGCGCA GGTCATGGAA
GAGCTGGTGG AAATGGCCAA GAAATTCCGT GAAGCGGCAT CCCGCGGAGA GTCGCTGGGG
CTGACCGAGG ACGAGGTGCG CTTTTACGAT GCGCTGGCTA ACAACGAGTC AGCGGTACGC
GAGCTGACCG ACGAAACCCT CAAGAAGATC GCCCACGAGC TGACCGAGAA CCTGCGTCAG
AACCTGAGCG TGGACTGGTC AGAACGCGAG AGCGTGCGAG CCAAATTGCG CTTGATGGTC
AAGCGCATTC TGCGCAAATA CAAATACCCA CCGGATCTGC AGGACGCTGC TGTGGAGCTG
GTGCTGAAGC AGGCGCAGGT GATGGGGGAG AGTTGGTCGG TGTGA
 
Protein sequence
MTEDQLEQET LAWLADVGYT HRYGPDIAHD GAMPERSSYR QVLLPFRLRE AIQKLNPGIP 
TAAREDALKQ VLDLGIPALL TANQHFHRLL VAGVPVRYQK DGETRGDFVR LIDWADPERN
EWLAVNQFSI KGPHHTRRPD IVLFVNGLPL VLLELKNPAD KNADIWRAYD QIQTYKEQIP
DVFQTNEVLV ITDGTEALMG SLSADAERFM AWRTIDGVTL DPLGEFNELQ TLVRGVLAPA
YFLDFLRYFV LFEDDGALVK KIAGYHQFHA VRAAILQVVD ASRPGGSQKG GVVWHTQGSG
KSITMTCFAA RVMQEPAMEN PTIVVITDRN DLDGQLFGVF SLAQDLLREQ PVQVSTRQDL
RAKLANRPSG GIVFATIQKF MPGEDEDTFP MLSDRHNIVV IADEAHRTQY GFEAKLKGKQ
GQESYQGGYA QHLRDALPNA TFVAFTGTPV SSTDRDTRAV FGDYIHIYDM QQAKDDGATV
AIYYESRLAK LSLIESDLSQ MDEEVDELAE DDEESQQAVL KSRWAALEKV VGAEPRVASV
AADLVTHFEE RNKAQPGKAM VVAMSRDICV HLYNEIVKLR PDWHDADPEK GAVKIVMTGS
ASDKPLLRPH IYSAQTKKRL EKRFKDPADP LRLVIVRDMW LTGFDAPCVH TLYVDKPMKG
HNLMQAIARV NRVFKDKQGG LVVDYIGIGN ELKSAMKEYT QSKGRGRPTV DAHEAYSVLA
EKLDVLRSML HGFDYTGFLT GGHKTLAGAA NHVLGLNTGS SRDGKKRFAD TALAMSQAFT
LCCTLDEAKA VREEVAFMQG VKVILTKKEL TAKRRTDEAR ELAIRQIISS AVVSESVVDI
FDAVGLDKPN IGLLDDEFLA QVKNLPEKNL AVELLERLLE GEIKSRFTSN VVQDKKFSEM
LANVITRYQN RSIETAQVME ELVEMAKKFR EAASRGESLG LTEDEVRFYD ALANNESAVR
ELTDETLKKI AHELTENLRQ NLSVDWSERE SVRAKLRLMV KRILRKYKYP PDLQDAAVEL
VLKQAQVMGE SWSV