Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pnap_2409 |
Symbol | |
ID | 4686892 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polaromonas naphthalenivorans CJ2 |
Kingdom | Bacteria |
Replicon accession | NC_008781 |
Strand | + |
Start bp | 2543124 |
End bp | 2545781 |
Gene Length | 2658 bp |
Protein Length | 885 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639835419 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_982637 |
Protein GI | 121605308 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACGTT CGGCAGCGGG ATTCATGCTG ATAGTTTGCC TCGTGCTGGC CAGTTCCTGC GGCGGCGGCA GCGATGCTTC GGCGCCCGCA CCCGCCGAGG CTGATCCGGT TCAGACGGCC AGTACGGCAG TGGCAAAAAT CAATGCCGCG CAAGGCGTGC TGGAGCTGCC GCTGCTGCAG TCGGGCGCTG CGCTGTATGC CGATGTGCGC ATCCGGTTTT CACTCGACGG AAATTTTGAG CTGCTGTCAT GGCGGCCGGT CACATCGGCT GTTGCACCCG TGGACGCCGA TTTACAGCCG CAAGTTGCAT TGGCAGACCT GCAGCGGTCC AGCCAGCCCC TGAAGCTGAA CATCCGCCGG CTGCACATGG ATGCGGCGGT CTATGAGGCG GTGAGCGTGG AACTTCAAAA CGGCCGCTGG CGCTACCCGC AGCCGCTGGT GCCCGCCACC ACCCTGACCT CGAAAGACCT GCGCGCCAAC CCTGCCCTGT TTGCCCGTGA CGACCATCTG GTCGTGATGG AATCGAGTCC CGGCCGGTCG GAAGAATTTT CGCTGCGGCT GGAGGCCAGG CGCTACCGCT TCTGCATGGA CCCGCAGGAC GAGGGCGCTG ACAGCATTAC TTTGCTCGAT GCAAGCGGCG CCAGCCTGCT GACCATCAAG GCCGGAGGGC CTTGCGCCAC CCTTCAGGCC GAACAGGGCT TGTACAGGAT TCGCCAGCGC TACGGCGGCA CCGGCGCCAG CCGCACGATG TTCATGCGCC GCCAGCCGGC GGGGCAAGCC GTGTTAACCC AGCCGCCCGG CAGCCAGCCG CCGCTCAAGG ACGCTGGTGT TGAAGAGTAC TGGGGAATTC TGGCCAGCTT CATTGACCCG ACGACACAAA AGGCAAGCAG CAGCGGCTTT TTGGGACTGT CGGGCTGGCG GCCTGGCGGC CCGGAGTTTC CCGACATACA AGGGTGCGTG GGATATATCC ATGCCTCGGT CCAGTCATTG AAGGCCAACC ACACCAACAG CTATTCAGCG AGCACCCTGC TGGATGGACT GAATTTTTTC AAGCGCATCA ATGACAGCCA GGGAAACCCC ACGCAGCTCG GGGCACCGCT TTCATGCAAT GGCGGACTGG TGGGCCTGGA GGCATCCCCA ACACTGGCTG TTGCTTTGAT TGCCTCGGGT GACGTTCTCC GCCTCACCGG CACAATTCAG ATCGACAGTT TTTCCACCGC GACCAATGCG TTCATTTTGA AATCTGTCCG AGACGGCTTT GCATCCCAGG CGATGGGCTT TCCCTACGGC GTCAACGGGA TGGTGCCCAA CCGCCCGGAG GAACTAGGCG TCCTGGTGAC TTTGACGCCC GATCAAATAG CGCAATACCA GCCCGAGTAC CGCACCGTGC TGCGCTACCG GCCCGGCGGC TTTGCAGGCA GCAGCCTGCC CGGCCAAGGC CAAGTGGCCT TGTTCAACAC CCAGGACTGC AGCGGCGCCG CCATGGTGGT CGATCACTAC GACCTGCCCG GCATCATGCC CGGCGGCCCG CTGGGCAATT TTGACGGCTC CTTGAAACTC GGCACCCTGA CCACGGCCAC GGTCTATTCG GCCATGTCCC AGCAAGGCGA GCAACAGCAT TTGAACCGGT CTGGCTGCAT TGCATCCGGC TGGGGCAGCG CGGGCTGGAA GGCAGCCTCG ATTGCCATCA ACGTCGATAC CATCGAGATG GTGCTGTCCG ACAAGAAGTG CGAGCAGTGC AATCTGTCGG GCATCGACCT GACGGGCAGG TCGATGCCCG ACGTGAAACT CTACGGCGCC AACCTCAACA ATGCGCATCT GGCCGGCAGC GACCTGTCGG GCGCCGACCT GCGCTACGCC TCGCTGCAGG GCGCGCAACT GCCGAATGCG AATCTGGACG CCGCCAACCT GTGCGCGGCC AACCTCAATG CCGCGCCGTC AACCGCCGGC GTCTCCAATG TCGCGGCCAA CCTGACCGGC GCCTACCTGC GCAACGCCAG CCTGTATGGC AGCAACATGG CCGGGGCCAA CTTCAGCAAT GCGAGTTTTT ACAGCACCAG CCAGGCCGCG TGCCAGCCTT CGGACTGCGG CGCCTACCAA AAGCCGGCGT GCGCCAACGC CTACGGCGCC AAGCTGGACA GCGCCAAATT CAGCAGCGCC TACCTGGTGG GCGTGGACAT GAGCAATGTC GCCGGCAGCG CCGCCGACTT CAGCAATGCC GTGCTCAATG GCGCATCGTT TCGCAATGCC ACGCTGACCC CCGACTTCAA CGGCACGCCG GCCAATTTCA GCAACGCCTT CTTGCAGGGC GCCGATTTCA CCGGCGCCAA CCTCGTCAAT CCGATTTTCA CCAATGCCTA CGGCGACGCC AACCCGAATG GCGGCTGCAT GCAGTTCGAA CTCGGTGCGT CCTACACCTC GTTTCCCGGT TTTGCCGTGC CGACGACGCC GCCGCCCAGC ACCGCATGCG TCAGGTCCGC GCCGGTGCAA ACCTGCGTGC AGTTCACCTA CAACAAGCCC ACGCTCGGCC TGGTGCTGGA CCCGCCGGCC ACGCCCCTGA GCCAGGCGCG CCCGAAAAAC ACCAGCTGCT TTGCCGAGGC GCCGCTGTGC GGAGACCCGT TCATCGGGGT TGGTCCCAAT ACCTGCTGGT CGCCGTGA
|
Protein sequence | MTRSAAGFML IVCLVLASSC GGGSDASAPA PAEADPVQTA STAVAKINAA QGVLELPLLQ SGAALYADVR IRFSLDGNFE LLSWRPVTSA VAPVDADLQP QVALADLQRS SQPLKLNIRR LHMDAAVYEA VSVELQNGRW RYPQPLVPAT TLTSKDLRAN PALFARDDHL VVMESSPGRS EEFSLRLEAR RYRFCMDPQD EGADSITLLD ASGASLLTIK AGGPCATLQA EQGLYRIRQR YGGTGASRTM FMRRQPAGQA VLTQPPGSQP PLKDAGVEEY WGILASFIDP TTQKASSSGF LGLSGWRPGG PEFPDIQGCV GYIHASVQSL KANHTNSYSA STLLDGLNFF KRINDSQGNP TQLGAPLSCN GGLVGLEASP TLAVALIASG DVLRLTGTIQ IDSFSTATNA FILKSVRDGF ASQAMGFPYG VNGMVPNRPE ELGVLVTLTP DQIAQYQPEY RTVLRYRPGG FAGSSLPGQG QVALFNTQDC SGAAMVVDHY DLPGIMPGGP LGNFDGSLKL GTLTTATVYS AMSQQGEQQH LNRSGCIASG WGSAGWKAAS IAINVDTIEM VLSDKKCEQC NLSGIDLTGR SMPDVKLYGA NLNNAHLAGS DLSGADLRYA SLQGAQLPNA NLDAANLCAA NLNAAPSTAG VSNVAANLTG AYLRNASLYG SNMAGANFSN ASFYSTSQAA CQPSDCGAYQ KPACANAYGA KLDSAKFSSA YLVGVDMSNV AGSAADFSNA VLNGASFRNA TLTPDFNGTP ANFSNAFLQG ADFTGANLVN PIFTNAYGDA NPNGGCMQFE LGASYTSFPG FAVPTTPPPS TACVRSAPVQ TCVQFTYNKP TLGLVLDPPA TPLSQARPKN TSCFAEAPLC GDPFIGVGPN TCWSP
|
| |