Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pnap_4782 |
Symbol | |
ID | 4685973 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polaromonas naphthalenivorans CJ2 |
Kingdom | Bacteria |
Replicon accession | NC_008761 |
Strand | + |
Start bp | 19345 |
End bp | 22434 |
Gene Length | 3090 bp |
Protein Length | 1029 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639826771 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_973933 |
Protein GI | 121583507 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 140 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAAG ACCAACTCGA ACAAGAAACC CTGGCCTGGC TTCAGGACGT GGGCTACACC TGCCACTGCG GCTACGACAT CGCGCCTGAC GGTCCTGCGC CCGAGCGCAG CAGTTTCAGC CAGGCGCTGC TGCCCTTCCG GCTGCGCGAG GCCATCCACA AGCTCAATCC CGGCATCCCG ACCCCTGCCC GCGAAGACGC CTTCAAGCAG GTTCTTGACT TGGGCATCCC GGCGCTGCTG AGCGCCAACC GGCATTTCCA CAAGCTGCTG GTGGGCGGCG TGCCCGTGGA GTACCAGAAA GACGGCCAGA CCCGGGGCGA CTTCGTGCGC CTGATCGACT GGGCGCAGCC GGCGCGCAAC GAGTTTCTGG CGGTCAACCA GTTTTCCCTC AAAGGTGCGC ACCACACGCG CCGCCCCGAC ATCATCCTGT TCGTCAACGG CCTGCCCTTG GTGCTGCTCG AACTCAAGAA TCCCGCCGAC CTCAACGCCA ATGTGTGGAA AGCCTACGAC CAGATCCAGA CCTACAAGGC GCAGATCCCG GGCGTGTTCG AGTACAACGA AGTGCTGGTG ATTTCGGACG GCACCGAGGC GCTGCTGGGG TCCTTGTCTA GCAGCAGCGA GCGCTTCATG GCCTGGCGCA CGATTGACGG CCAGGCGCTG GACCCGCTGG GCCAATTCAA CGAGCTGCAG ACCCTGGTGC GCGGCGTGCT GGCCCCGGCG TACCTGCTGG ACTACCTGCG CTACTTCGTG CTGTTCGAGG ACGACGGCCA GCTGGCCAAG AAAATCGCCG GCTACCACCA GTTCCATGCG GTCCGCTCGG CCATTACCCA GGTCGTGACC GCCTCTCGCC CCGGCGGCAC CCACAAGGGC GGCGTGGTCT GGCACACCCA GGGCAGTGGC AAGAGCATCA CCATGACCTG CTTTGCTGCT CGCGTGATGC AGGAGCCGGC GATGGAGAAC CCCACCATCG TGGTGATTAC CGACCGCAAC GACCTGGACG GCCAGCTCTT TGGCGTGTTC AGCCTGGCCC AGGATCTGCT GCGCGAGCAG CCGGTGCAGG TCAGCACGCG GCAGGATCTG CGGACCAGGC TGGCGAACCG GCCCTCGGGC GGCATCGTGT TCGCCACCAT CCAGAAATTC ATGCCGGGCG AGGATGAGGA CACCTTCCCT ACCCTGTCCG AGCGCCACAA CATCGTGGTG ATTGCCGACG AGGCGCACCG CACCCAGTAC GGCTTCGAGG CCAAGCTCAA GGGCAAGCCC GGACACGAGA CCTACCAGGT CGGCTACGCC CAGCACCTGC GCGACGCGCT GCCCAACGCC ACCTTCGTGG CCTTCACCGG CACCCCGGTC AGCAGTGAAG ATCGCGACAC GCGCGCCGTG TTCGGCGACT ACATCTCGGT CTATGACATG CAGCAGGCCA AGGAGGACGG CGCCACGGTC GCCATCTACT ACGAGTCGCG CCTGGCCAAG TTGAAGCTCA AGGAAGAAGA TTTTTCGCTG ATCGACGAGG AGGTCGATGA GCTGGCCGAA GACGAGGAGG AAAGCACCCA GGCCAAGCTC AAAAGCCGCT GGGCTGCCCT GGAGAAGGTG GTCGGCGCTG AACCGCGCGT GGCCAGCGTG GCGGCCGACC TCGTGGCGCA TTTCGAGGAG CGCAACAAGG CCCAGAGCGG CAAGGCCATG ACGGTGGCCA TGAGCCGCGA CATCTGCGTG CATCTGTACA ACGAGATCGT CAAGCTGCGC CCGGACTGGC ACGACCCGGA TCCAGAAAAG GGCGCCATCA AGATCGTGAT GACCGGGTCC AGCAGCGACA AGGCGCTGCT GCGCCCGCAC ATCTACAGCG CCCAGGTCAA GAAACGCCTG GAAAAGCGCT TCAAGGATCC GGCCGACCCG CTGCGCCTGG TCATCGTGCG CGACATGTGG CTCACCGGCT TTGACGCGCC GTGCGTGCAT ACCCTCTACG TGGACAAGCC CATGAAGGGC CACAACCTGA TGCAGGCGAT TGCCCGGGTC AACCGCGTGT TCAAGGACAA GCAGGGCGGC CTGGTGGTGG ACTACATCGG CATCGGCAAT GAACTCAAAG CCGCCATGAA GGAATACACC CAGAGCAAAG GCCGGGGTCG GCCCACGGTG GACGCGCATG AAGCGTATAG CGTGCTGGCC GAAAAACTCG ACATCCTGCA AACGATGCTG CACGGCTACG ACTACAGCGG TTTTCTGACC GGCGGCCACA AGGCGCTGGC CGGCGCCGCC AACCATGTGC TGGGCGCCCA GGACGGCAAG AAGCGCTTTG CCGACACGGC CCTACAGATG AGCAAGGCGT TCAGCCTGTG CTGCACGCTG GACGAGGCCA AGGCGGTGCG CGAGGAGGTG GCCTTTTTGC AGGGCGTCAA GGTCATCCTG ACCAAAAAGG ATTTGAGCGC GCAAAAGAAG ACCGACGAGC AGCGCGACCT GGCCATCCGG CAGATCATCA ACTCGGCCGT GGTGTCGGAC AGCGTGGTGG ACATTTTCGA TGCCGTCGGG CTGGACAAGC CCAACATCGG ACTGCTGTCC GACGAGTTCC TGGCGCAGGT GAAAAACCTG CCGGAGAAGA ACCTGGCGGT GGAATTGCTG GAGCGGCTGC TGGAGGGCGA GATCAAGAGC CGGTTTGCCA GCAACGTGGT GCAGGAGAAG AAGTTTTCCG AGCTGCTTGC CGGTGTCATC AAGCGCTACC AGAATCGCTC CATCGAGACC GCCCAGGTCA TGGAGGAGCT GGTGGCGATG GCCAGGAAGT TTCAGGAGGC GGCCAACCGA GGCGAAGCAC TGGGCCTCAC CGAGGACGAG ATCAAGTTTT ATGACGCGCT GGCCACCAAC GAATCGGCCG TGCGGGAACT GACCGATGAA ACCCTCAAGA AGATCGCCCA TGAGCTGACC GAGAACCTGC GCCAAAACCT CAGCGTGGAC TGGTCGGAGC GCGAGAGCGT GCGCGCCAGG CTGCGCCTGA TGGTCAGGCG CATCCTGCGC AAATACAAGT ACCCGCCCGA CCTGCAGGAT GCGGCGGTGG AACTGGTGCT GCAGCAGGCG CAGGCGCTAG GAACCGTATG GATGATCTAG
|
Protein sequence | MTEDQLEQET LAWLQDVGYT CHCGYDIAPD GPAPERSSFS QALLPFRLRE AIHKLNPGIP TPAREDAFKQ VLDLGIPALL SANRHFHKLL VGGVPVEYQK DGQTRGDFVR LIDWAQPARN EFLAVNQFSL KGAHHTRRPD IILFVNGLPL VLLELKNPAD LNANVWKAYD QIQTYKAQIP GVFEYNEVLV ISDGTEALLG SLSSSSERFM AWRTIDGQAL DPLGQFNELQ TLVRGVLAPA YLLDYLRYFV LFEDDGQLAK KIAGYHQFHA VRSAITQVVT ASRPGGTHKG GVVWHTQGSG KSITMTCFAA RVMQEPAMEN PTIVVITDRN DLDGQLFGVF SLAQDLLREQ PVQVSTRQDL RTRLANRPSG GIVFATIQKF MPGEDEDTFP TLSERHNIVV IADEAHRTQY GFEAKLKGKP GHETYQVGYA QHLRDALPNA TFVAFTGTPV SSEDRDTRAV FGDYISVYDM QQAKEDGATV AIYYESRLAK LKLKEEDFSL IDEEVDELAE DEEESTQAKL KSRWAALEKV VGAEPRVASV AADLVAHFEE RNKAQSGKAM TVAMSRDICV HLYNEIVKLR PDWHDPDPEK GAIKIVMTGS SSDKALLRPH IYSAQVKKRL EKRFKDPADP LRLVIVRDMW LTGFDAPCVH TLYVDKPMKG HNLMQAIARV NRVFKDKQGG LVVDYIGIGN ELKAAMKEYT QSKGRGRPTV DAHEAYSVLA EKLDILQTML HGYDYSGFLT GGHKALAGAA NHVLGAQDGK KRFADTALQM SKAFSLCCTL DEAKAVREEV AFLQGVKVIL TKKDLSAQKK TDEQRDLAIR QIINSAVVSD SVVDIFDAVG LDKPNIGLLS DEFLAQVKNL PEKNLAVELL ERLLEGEIKS RFASNVVQEK KFSELLAGVI KRYQNRSIET AQVMEELVAM ARKFQEAANR GEALGLTEDE IKFYDALATN ESAVRELTDE TLKKIAHELT ENLRQNLSVD WSERESVRAR LRLMVRRILR KYKYPPDLQD AAVELVLQQA QALGTVWMI
|
| |