Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pnap_4334 |
Symbol | |
ID | 4685431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polaromonas naphthalenivorans CJ2 |
Kingdom | Bacteria |
Replicon accession | NC_008757 |
Strand | + |
Start bp | 245524 |
End bp | 246882 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639826193 |
Product | HNH endonuclease |
Protein accession | YP_973358 |
Protein GI | 121582916 |
COG category | [V] Defense mechanisms |
COG ID | [COG1403] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.069794 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.702083 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGTTT TTGTATTGGA TCGGAGCGGG CAGCCGGTGA TGCCCTGCAG CGAAAAGCGC GCCCGGCTGC TTCTGCAGAG CAAGCGCGCC AGGGTTCACC GGGTCATGCC GTTCACGATT CGACTGATCG ACCGAAGCCA GGCCGACTGC CTGCTCCAGC CGCTTCGCCT CAAGCTCGAT CCGGGCAGCC GGGCCACGGG CCTGGCACTG GTGCGGGATA TTGAAACCAT TGAGCCCGCC ATGGGTGAGG TCACGCGCGG CGCGGCCGTG GTGAGCCTGC TTGAGCTGGA GCACCGCGGC AGGCAGATAT CAGAAGCGCT CACTGCGCGT CGCCAGATGC GCCGCCGGCG CCGCAACCAG CTGCGCTACC GCGCCCCGCG TTTTTTGAAC CGGGGCAACA AGCAAAAAGG TTGGCTCGCG CCTTCGTTGC AGCACCGGGT CGATACGACG GCGGCCTGGG TAGCACGCAT CCAGCGCTGG GCACCGGTGA CGGCGCTCAG CTCGGAACTG GTGCGCTTTG ACATGCAGCA ATTGCAAAAC ACCGAGATCG AAGGCGCCGA GTACAGCCAA GGCACGCTGG CGGGCTACGA AGTGCGCGAG TACCTGCTGG AGAAGTGGAA GCGCACCTGC GCCTACTGCG ATGCGCAAAA CACGCCGCTG CAGATCGAGC ACATCGAGCC CCGGGCGCGG GGCGGCAGCC ACCGGATCTC CAACCTATGC CTGGCCTGCC AGCCCTGCAA CCAGAAAAAA GCCGCGCGCA CGCTTCAAGA TTTCCTCAAG AAAGACCCCA AGCGCCTGGC GCGCATCCTC GCGCAAGCCC AGCGGCCGCT GCGCGATGCC GCAGCGGTCA ACGTCACGCG CTGGGCGCTG GCCAACGCAC TGAAGACCAC TGGCTTCCCG CTGGAGCTGG CCTCGGGTGG CCGGACCAAA TTCAACCGAT GCACGTTGGA CGTGCCCAAG ACGCACGCGC TGGATGCGGC CTGCGTGGGC CAGGTGGAGG CCATCACTGG CTGGCAGCAG CCTGCGTTCT ACACCTTGAC CATCAAGGCC ATGGGCCGGG GCAGCTACCA GCGCACTCGG CTGGACGCCT ACGGCTTTCC AAGAGGCTAC CTGATGCGTG CCAAGTCGGT CCACGGTTTC CAGACCGGGG ACCGGGTCAA GGCCGTCGTG CCCCAGGGCA AGAAGGTTGG CACGCATGTG GGGCGCGTGG CGATCCGCAA GACGGGCAGC TTCAACATCA CCACGCCGGC TGGCGTGGTT CAGGGCATCA GTCACAAGCA CTGCCGCATC GTTCAGCGGA ACGACGGGTA TGGCTATTTC TTCCACCGGG CCGATTTAAC ACAGGACGCG AACAGGTAA
|
Protein sequence | MAVFVLDRSG QPVMPCSEKR ARLLLQSKRA RVHRVMPFTI RLIDRSQADC LLQPLRLKLD PGSRATGLAL VRDIETIEPA MGEVTRGAAV VSLLELEHRG RQISEALTAR RQMRRRRRNQ LRYRAPRFLN RGNKQKGWLA PSLQHRVDTT AAWVARIQRW APVTALSSEL VRFDMQQLQN TEIEGAEYSQ GTLAGYEVRE YLLEKWKRTC AYCDAQNTPL QIEHIEPRAR GGSHRISNLC LACQPCNQKK AARTLQDFLK KDPKRLARIL AQAQRPLRDA AAVNVTRWAL ANALKTTGFP LELASGGRTK FNRCTLDVPK THALDAACVG QVEAITGWQQ PAFYTLTIKA MGRGSYQRTR LDAYGFPRGY LMRAKSVHGF QTGDRVKAVV PQGKKVGTHV GRVAIRKTGS FNITTPAGVV QGISHKHCRI VQRNDGYGYF FHRADLTQDA NR
|
| |