Gene Pnap_3705 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_3705 
Symbol 
ID4689256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp3945047 
End bp3948364 
Gene Length3318 bp 
Protein Length1105 aa 
Translation table11 
GC content69% 
IMG OID639836723 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_983922 
Protein GI121606593 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.249518 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCACA TTCACCGCTC CATCTGGAAC GACCAGACCG GCACTTTTGT TGCCGTTTCA 
GAAAACACCC GCAGCGCCGG CAAGAAAATC TCGTCCTGCG CCTCGGCGGC TGGCGGCGGC
GCCCACTTCG GCCTGAAAAC CCTGGCGGTT TGCCTGATGC TGGCCTGCGG TGCGGGCGTG
CATGCCCAGC CCACGGGCGG CGTGGTGTCG GCGGGCAGCG CCACCATTGG CGGCACGCCA
GGCGCCATGA CCATCACCCA GACCACGCCC AACGTGGCGA TCAACTGGCA GAGCTTTGGC
ATCAGGGCCG GCGAGTCGGT CCAGTTCGTG CAGCCGGGCA GCAACTCGGT CGCGCTCAAC
CGGGTCGTCG GCGCCGATCC TTCGAGCATC CTGGGCAGCT TGAGTTCCAA CGGCAAGGTG
TTCCTGGTCA ACCCGAACGG CATCTTGTTC GGCCAGGGCG CGTCGGTCAA CGTGGGCGGG
CTGGTGGCGT CCACCCTGGC GATCAGCGAT GCCAATTTCA TGGCGCGCAA CTACCAGTTT
TCCGGCGCCG GCACGGGCAG CGTGGTGAAC CAGGGGGCGA TCAACGCCGC CAGCGGCGGC
TACGTGGCCC TGCTGGGCGC GAACGTCAGC AACCAGGGCG TGATTGCGGC CCAGCTGGGC
ACCGTGGCGC TGGCTGCGGG CAACGCCGTC ACGCTTGACG TTGCAGGCGA CAAGCTCTTG
AGCGTCACGG TCGATCAGGG CGCAGTCAAC GCGCTGGTTG ACAACGGCGG CCTGCTCCGG
GCCGACGGCG GCCAAGTGCT GATGACCACC CAGGCAGCGG GCAGCCTGCT GTCCAACGCG
GTCAACAACA CCGGCGTGGT GCAGGCGCAG ACGCTGCAAA ACGTCAACGG CACCATCAAG
CTGCTGGGCG GCATGGAAAA CGGCACGGTC AGCGCAGGCG GCACACTGGA CGCTTCCGGC
ACAGGCGCGG GTCAGACGGG CGGCAGCGTC ACGGTGACCG GCCATCACGC CGGCCTGTTT
GGCGGGCAGG TCAATGCCTC CGGCGACGCA GGCGGCGGCA CCGTGCTGAT CGGCGGCGAC
TACCAGGGTA AAAACCCCGC CGTCCAGAAC GCCGCAGCGA CCTACATGAG CGCCGACGCC
AGCATCCGCG CCGACGCCAT CACCAGCGGC AACGGCGGCA AGGTCGTGCT GTGGGGCAAT
GACTCGACAC GCGCTTACGG CAGCATCACA GCGCGCGGCG GCGCACAGGG CGGCAACGGC
GGCTTGATCG AAACTTCCGG CCATGCGCTG GACGTGGCGG GCATCCAGAT CAACGCCTCC
GCCGCCAACG GCCTGGCCGG CCTGTGGCTG CTCGACCCAG CCGATGTGAC GATCACGAAT
GCGGCCACCA GCGGCGGAAC CTTGACCGGC AACGTCTTCA CCCCGAACCC CAGCGCCAGC
ACGGCCAATG TGAATGTCGC CGACATCGTG ACCGCACTGA CCGCCGGCAC CAACGTCACC
ATCACCACAG CCAACACCAG TGGAGCCGGC AATGGCGATA TCACCGTTGA CACGCCCATC
ACCTGGACCA AAACGCCGGG CGCCTTACCC GTCTCAACGC TGACCCTGAA TGCCGTTCAG
GATGTGATCG TGAACGAGGC GATTACCGCC ACTTGGGGAA GCCTGGTCAC GACGGCCGGA
CGTGATGTGA CGCTGTTTGC GCCCGTCACG ACGACCAACG GCAGCTTTAC GGCAAACGCC
GGGCGCGACG TCAACCTCGT GAATAACGCT GCGCATCCCT TGACGGCGAT TACGACGACC
GATGGAAATT TCAGGGCGAA CGCCGGAGGC AATGTGAACG TCGAGACGGT CGCCTTGACG
ACAACCCGGG GTAACCTGAT CCTGGTGGCC GGTATAGATG GCAACACCAG CGGCAGTGTC
GTTTTTGCGG CTGGCACCCC CGTGGCTGCC GTCACAGGCC CAAATGCAGC GGTCATGATT
TACAACCCGA TAGGAACGCC ACCGACGGAT TATTCGGGTA ACTTTACGTT GACTTCTGGC
GCCACGCTGA CGTCAGGTGT CTTTTTGCCA GGGATTCCGG GTGCTACGGG TGCTACGGGT
GGCGTTGGCG CCACGGGCGC GACCGGGGCC ACGGGCACGA CCGGTGCGAC CGGCGGCGTT
GGCGCGACCG GTACGACCGG GGCCACGGGA ACGACCGGCG GCGTTGGCGC GACCGGCGCA
ACCGGGGCCA CGGGAACAAC CGGCGGCGTT GGCGCGACCG GCGCAACCGG GGCCACGGGA
ACAACCGGCG GCGTTGGCGC GACCGGTACG ACCGGGGCCA CGGGAACGAC CGGCGGCGTT
GGCGCGACCG GCGCAACCGG GGCCACGGGA ACAACCGGCG GCGTTGGCGC GACCGGTACG
ACCGGGGCCA CGGGAACGAC CGGCGGCGTT GGCGCGACCG GCGCGACCGG GGCCACGGGA
ACGACCGGCG GCGTTGGCGC GACCGGCGCG ACCGGGGCCA CGGGAACGAC CGGCGGCGTT
GGCGCGACCG GCGCGACCGG GGCCACGGGA ACGACCGGCG GCGTTGGCGC GACCGGTGCG
ACCGGGGCCA CGGGAACGAC CGGCGGCGTT GGCGCAACCG GTGCGACCGG GGCCACGGGA
ACGACCGGCG GCGTTGGCGC AACCGGTGCG ACCGGGGCCA CGGGAACGAC CGGCGGCGTT
GGCGCAACCG GCGCGACCGG GGCCACGGGA ACGACCGGCG GCGTTGGCGC AACCGGCGCA
ACCGGGGCCA CGGGAACGAC CGGCGGCGTT GGCGCAACCG GCGCAACCGG GGCGACCGGT
GCCACGGGAA CAACCGGCGG CGTTGGCGCG ACCGGTGCGA CCGGTGCGAC CGGGGCCACG
GGAACAATCG GCGGCGTTGG CGCGACCGGC GCGACCGGCG CGACCGGCGC GACCGGCGCG
ACCGGCGCGA CCGGCGCGAC CGGGGCCACG GGCACGACCG GCGGCGTTGG CGCAACCGGT
GCGACGGGAA CAACTGGCGC AACGGGAACG ACCGGCGCTA CCGGACCAGC GGCCCATATT
CCTCCTATCG TCTTCCCACC GCCTGCATTT CCACAAAAGC CGCCTCTAAA CGTGCTGCCG
CCGGTGTTGC CGGGCACTTG GGTGCCCACG GTAGTGGTCG CCCCGCTGCC CCCAGAGCTT
CTGACCGTCG TACCGCCGAC ACCGCCAGCC GTGCCCCCAT CCGTTATGCC GGAGTCGCCG
CCAGTGGTCT TGCCGGTCGT GATACCGTCC AGGCCCTACA TGCCAACCAA GCGCCTGCCT
AAACAGGAAC GCAACTAG
 
Protein sequence
MNHIHRSIWN DQTGTFVAVS ENTRSAGKKI SSCASAAGGG AHFGLKTLAV CLMLACGAGV 
HAQPTGGVVS AGSATIGGTP GAMTITQTTP NVAINWQSFG IRAGESVQFV QPGSNSVALN
RVVGADPSSI LGSLSSNGKV FLVNPNGILF GQGASVNVGG LVASTLAISD ANFMARNYQF
SGAGTGSVVN QGAINAASGG YVALLGANVS NQGVIAAQLG TVALAAGNAV TLDVAGDKLL
SVTVDQGAVN ALVDNGGLLR ADGGQVLMTT QAAGSLLSNA VNNTGVVQAQ TLQNVNGTIK
LLGGMENGTV SAGGTLDASG TGAGQTGGSV TVTGHHAGLF GGQVNASGDA GGGTVLIGGD
YQGKNPAVQN AAATYMSADA SIRADAITSG NGGKVVLWGN DSTRAYGSIT ARGGAQGGNG
GLIETSGHAL DVAGIQINAS AANGLAGLWL LDPADVTITN AATSGGTLTG NVFTPNPSAS
TANVNVADIV TALTAGTNVT ITTANTSGAG NGDITVDTPI TWTKTPGALP VSTLTLNAVQ
DVIVNEAITA TWGSLVTTAG RDVTLFAPVT TTNGSFTANA GRDVNLVNNA AHPLTAITTT
DGNFRANAGG NVNVETVALT TTRGNLILVA GIDGNTSGSV VFAAGTPVAA VTGPNAAVMI
YNPIGTPPTD YSGNFTLTSG ATLTSGVFLP GIPGATGATG GVGATGATGA TGTTGATGGV
GATGTTGATG TTGGVGATGA TGATGTTGGV GATGATGATG TTGGVGATGT TGATGTTGGV
GATGATGATG TTGGVGATGT TGATGTTGGV GATGATGATG TTGGVGATGA TGATGTTGGV
GATGATGATG TTGGVGATGA TGATGTTGGV GATGATGATG TTGGVGATGA TGATGTTGGV
GATGATGATG TTGGVGATGA TGATGTTGGV GATGATGATG ATGTTGGVGA TGATGATGAT
GTIGGVGATG ATGATGATGA TGATGATGAT GTTGGVGATG ATGTTGATGT TGATGPAAHI
PPIVFPPPAF PQKPPLNVLP PVLPGTWVPT VVVAPLPPEL LTVVPPTPPA VPPSVMPESP
PVVLPVVIPS RPYMPTKRLP KQERN