Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pnap_3261 |
Symbol | |
ID | 4687736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polaromonas naphthalenivorans CJ2 |
Kingdom | Bacteria |
Replicon accession | NC_008781 |
Strand | - |
Start bp | 3462748 |
End bp | 3465753 |
Gene Length | 3006 bp |
Protein Length | 1001 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639836274 |
Product | hypothetical protein |
Protein accession | YP_983480 |
Protein GI | 121606151 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.20626 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCACT TCATCATCGG GCTGGGCGGC ACGGGGGGCA AGATCATCCG CGCCCTGCGC AAGAGCCTGT ACCAGGAATT CCACGGCGGG CCGCCGGCCG GCGTCGGCAT CGGCTATCTG TATGTCGATT CCTCCAGCGA AATGATGGCC ATGGACGACC CGACGTGGAA GACGCTGGGC ACCTCGGTGC AGCTGTCCAA GGCCAGCCAG CTCCTGATCA CCGATGCCAA CCTGACCTCG CGGCTCGACA ACCTCGACAG CTATCCGGGC CTGAAGCACT GGCTGGGCAG CCCGCAGGAG TGGCGCGACA TCCTCAATAG CATCGTCGGC GCCACGCTCG GCGGCCAGAA GCGCCGGCTG GGCCGCTTTT TGTTTGCCTG CAAGGCGGAC AAGTACCGCG AACAGGTGCA AACCCAGGTC AAGCTGCTGC AGCAAAGCGG CGAAACCGAC GTGACCTTTC ACATCGTCGT CGGGCTGGCC GGCGGCACCG GCAGCGGCAG CGTGATCGAC GCGGTGGCGC AGCTGCGCGA CCTGTACCCC GACTCGAAAC GCTTTCGCAT CCTGATTTAC GCACTGCTGC CCGACGCCTA TCCGCACCCC AACTGGGACA CGGGCAACTA CCACGCCAAC GGTTTTGCCG CGCTGACCGA ACTCAACGCG ATGTCGGTCG GGGCATACCA GCCTTACGAC GTGACCGGCG TCAAGCAGCG GCTCACGCTC AGCGACCCGT TCAACGGCTG CTACATCTTC GGCAACGAGA ACGAAAACGG GCTGACGGTC GATGTGGACA AGGACTTGCC GGGCATCGTC GCCGACTTTT TGTACCAGAA AATCGTCGCG GTCAACAACG TCAACTGGGC CTCGCTGGGC CGCATGGAGA ACGCGGAAAA CGGCGACGGC TCGCCCGAGA CCGCGCCGCA GGGCCGCATG CCCGAGCGCT CCAAGCGCTT CCTGGCCTTC GGCATCAAGC GGCTGGCGAT TCCCGAAGAG GAAATCAGCG AATACCTGAC CTACAGCTTT GCCCGGCAGG CGGCGCTGCA GCTGCGCTTC AACCACTGGC AGCCGGCGTC GGGCTTCATC GACGAGCCGC GCAAGCTGGA CTTCAACGAG TTCGTGCAGC AAAAGGAAAC GCAGCTGCGC TGGCTGCTGT CGGACGACCA CCTGACGCTG GCGCTCGGCA TCCTGCCCGA GGACGCCGCC AACAAGCGCT GGAAGTCGTT CACCGGCGAG TGGGAGGCGG TGATTCCCAA TTTCAAGTCG CTGGTGCGCG AGCGCGAGCG CGCCACCTGG CTCGACGAGC TGACCAAGCT GTGCGAAAAA CGCTTTCAGG ACGACTACCG CACGCTGGGC GTGGCCGGCT TTTACCGCAC CAAGCTGAAG GCGCGCAAGG ACATGGCGCG CGAGATCCGC ACGCGCATCG AGCAGGAGCT GATCAACGAA TGGAAGGTCG GCGCCAAGTC GGCCTGGGAC GTGTCGCGGC TCCTGATTGC GCTGACCGAA ACGCTGGACG AGCGGCTAAA AGCCTGCGAC GACAACGTGG TGCGTGCGCG CAACGCCGAG GAGCAGGCGC AGTCGCGCGT GCTGGGCAAT CTGCAGAAAT GGTCGGGCAT GAGCCTGCTG TCCAAGCACC TGCTGGGAAC GCCCGACAGC CTGCTGGATG CGCACGGCGT GCATTTGCAG GAAATGTATG TCTATCGGAC CCGAGCCGAA GGCTGGGGCT TTGCCAAGGC GCTGCTGATC GAGGTGATCG CCGAGATCAC CGACCTGAAG GGCGAGGTGG ACCGCGTCGC CACTACCCTG CAGCAGGCGC TGAAAAAGTT TGAAGTGGGC ATTCAGGCGC GGCTTAACGA TGCCGGATCG GGCGACCTGC GCCAGCATTT GATCCGCTTT TACGACCCGG TGCAGGTCAA GACCATCAGC CGCCGGCTGG TGCTCGACGA AGCCGAGCAG AAGACGCAAA GCGGCCGGGT GCGCGCCGCG CTGATCGAGA AAATCGGGCC GGATGCGGGC TTTGCGCAGT TCAACCAGCG CGTGCCGGAA TCGGCGTTTC TCGATGTGCT GGAGACGGTG TGCGAGGACA ACGCCCGCAT TGCCCACCAG AACCTGGTGC AAAACCCCAA GGAGCGGCTG CTGGGCGTGT CGATCATCGA CAAGCTGCGT GACCGCTACG GCGCCGACCC GCAGGAACTC AAAAGCTATG TGAACGAGCT GGTCAGCCGC GCCGGCAATT TTGTCGCGCT GGAGCCGCTG GAAATCCACC GCGCCGCGCC GGGCATTCCG CTGGGCGTGC CGACGGCGGT GGGCAAGTTC ACCGTCATCC TGCCGAAGGC GCCGGAACAG GCCGAGTTTT CCCGGGTGCT GAAGGACGCG CTGCGCGAAG CCAAGACCGG CGACGTGGAA ATCATCGAGT CCGATGGCCG GTTGAACGAG ATCACGCTGG TGTCGATCAC CAACCTGCTG CCGCTGCGCT ACCTCAAGCC GCTGAAGTTC CTGGAAGAAA AATACCGCCG CCGCATCGAC ACCGGCGGCG CGCGCGCCCG GCTCGAATTG CACACCGAAG GCGACGGCAA CGCCTGGCCG AGGCTGTTCG TGGCCTCCAG CGCCGAGGTC AAGCTGCAGG CGCTGCCTTA TGTGCTGCTG GCCAAGGCGC TGGGTTTCAT CCATGAAGGC AGGAATCCTG CGACCGGCGC CGACGAAGTG CTGCTGCTGA CCAAGGACGC CGACGGTTTT GACAACGACC CGGTGGCGCT GGGCAAGAGC TTCATGGCCA GCGCCGACAG CATCGACCTG GCGAACCTGC ACACCATCAA GTCGGTGTGC AACGCGATGC TGGCGGGCGC CGCTTATCTG CATCAGGACC GGCGCACCGA AGTGCAGCGG GCCATCCTGG CCGAGGTCGA GGCGGTCAAG GCCGCGCGCG GCGGCAACAT CCAGGACGAA ACCTACCGCC GCTTTCTGGA CGCCGGGCGG CGCGCCGTGG CGATTTTGAA AGGCGAGGCG GCTTGA
|
Protein sequence | MNHFIIGLGG TGGKIIRALR KSLYQEFHGG PPAGVGIGYL YVDSSSEMMA MDDPTWKTLG TSVQLSKASQ LLITDANLTS RLDNLDSYPG LKHWLGSPQE WRDILNSIVG ATLGGQKRRL GRFLFACKAD KYREQVQTQV KLLQQSGETD VTFHIVVGLA GGTGSGSVID AVAQLRDLYP DSKRFRILIY ALLPDAYPHP NWDTGNYHAN GFAALTELNA MSVGAYQPYD VTGVKQRLTL SDPFNGCYIF GNENENGLTV DVDKDLPGIV ADFLYQKIVA VNNVNWASLG RMENAENGDG SPETAPQGRM PERSKRFLAF GIKRLAIPEE EISEYLTYSF ARQAALQLRF NHWQPASGFI DEPRKLDFNE FVQQKETQLR WLLSDDHLTL ALGILPEDAA NKRWKSFTGE WEAVIPNFKS LVRERERATW LDELTKLCEK RFQDDYRTLG VAGFYRTKLK ARKDMAREIR TRIEQELINE WKVGAKSAWD VSRLLIALTE TLDERLKACD DNVVRARNAE EQAQSRVLGN LQKWSGMSLL SKHLLGTPDS LLDAHGVHLQ EMYVYRTRAE GWGFAKALLI EVIAEITDLK GEVDRVATTL QQALKKFEVG IQARLNDAGS GDLRQHLIRF YDPVQVKTIS RRLVLDEAEQ KTQSGRVRAA LIEKIGPDAG FAQFNQRVPE SAFLDVLETV CEDNARIAHQ NLVQNPKERL LGVSIIDKLR DRYGADPQEL KSYVNELVSR AGNFVALEPL EIHRAAPGIP LGVPTAVGKF TVILPKAPEQ AEFSRVLKDA LREAKTGDVE IIESDGRLNE ITLVSITNLL PLRYLKPLKF LEEKYRRRID TGGARARLEL HTEGDGNAWP RLFVASSAEV KLQALPYVLL AKALGFIHEG RNPATGADEV LLLTKDADGF DNDPVALGKS FMASADSIDL ANLHTIKSVC NAMLAGAAYL HQDRRTEVQR AILAEVEAVK AARGGNIQDE TYRRFLDAGR RAVAILKGEA A
|
| |