Gene Pnap_4014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_4014 
Symbol 
ID4689981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp4278440 
End bp4281310 
Gene Length2871 bp 
Protein Length956 aa 
Translation table11 
GC content68% 
IMG OID639837028 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_984227 
Protein GI121606898 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.893958 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAA CCCACGCCAT CATTGCCGGG GCCCTGTTGG CGACCGCCGT CATGCTCGCC 
GGCGGCGCCA GCGCCCAGAC CGTCGTCAAC GGCCAGGCCA GCTTTCAGCA GATTGGCAAT
GTCCGCACCA TCACCAACAC GCCCGGCACC ATCATCCAGT GGCCCGGCTT CTCGATTGGG
GCGGGCGAGG TTACCCGCTT CGTCCAGCAA AACGCTGCGA GCGCGGTGCT CAACCGCATC
ACCGGACAGG AGCCGTCCTT GATCCTGGGC GCCTTGCAGT CGAACGGCCG GGTTTTCCTG
GTCAACCCGA ACGGCGTGCT GTTCGGCGCC GGTTCGCGGG TGGACGTGAA CGGGCTGGTG
GCTTCGAGCC TGACGATTTC CAACAGTGAT TTCCTGGCCG GCAAGATGAA CTTCAGCGCT
GGCGCCGTGG CGGGCCATGT TGTCAACCAG GGGAGCATTT CAACGCCCGG AGGCGGCCAG
GTGATCCTGA TTGCGCCGCA AGTCGGGAAC AGCGGCCTCA TCCATTCGCC CGGCGGCGAG
GTCGTGCTGG CCGCAGGCCG CAGCGTGAAG CTGGCCGACA GCCAGAACCC GGCCCTGCAT
GTGGTGGTCA GCGCGCCGCA GGACCAGGCC GTCAACCTGG GCCAGATCGT GGCGCAAAGC
GGCCGCATCG GCATCTTCGG CAACCTGGTC AACCAGCGCG GCCTGGTCAG CGCCGACCAG
GCTGCAATGG GTGCCAACGG GCAGATCGTG CTCAAGGCCA GCGGCGACCT GCTGCTCGAA
GCCGGCAGCC TGACCAGCGC GAGCGGCGCC GGTGACAGCA CCGGCGGCAC GATCCACCTT
TTGGGCGAGC GCGTGGGCCT GACGGGCAAC GCCCGCGTGG ACGCCAGCGG CCAGGCCGGC
GGCGGCACCG TGCTGATCGG CGGCGACTAC CGCGGCCAGA ACCCGGCCAT CGCCAACGCG
AAGCAGGTCT ACGTCTCGGG CGGTGCCACC ATCCGCGCCG ACGCCCTGGC GTCGGGCAAC
GGCGGCAAGG TGATCGCCTG GTCCGACGGC ACGACACGCG TTTATGGCAG CATCAGCGCC
CGCGGCGGTG CGCAATCGGG CAACGGCGGC TTCGTCGAAA CATCGGGCCA CACGCTGGAC
ATGCGGGGCC GGGTCGATAC GCGCGCGCCG AACGGCCGCA CCGGCACGCT GCTGCTGGAC
CCGACCAACC TCTACATCGC GAACGACCAG GCCAGCGCCG CCAGCGCGGG CATGACGGGC
AGCGACACTT CAGCCGAAAC CTTTGTCGCT TCTGGCCCCG TGAGCGATTC GCTCCTGACC
GTCGCCACCC TGGAGGCTGC GCTTGCAAAC AACATGGTGA CCGTGACCAC CGACAACGCA
TCGGGAACGG GCGCCGGCAT GATTCACGTG GTCAACCCCG TGACGTGGGC CAGCGACACG
GGCCTGACGC TGCATGCCAA CGCGGGCATC GAGATCAACG CGGCCCTCAC TGGCGGGCGC
GGCAGCCGGC TGGAGCTGTA CACGGCTTCG GGCAACATCA ACCAGACGGC GCCCCTCAGC
GTGGTCGCAC TGTCGGCCCG CGCCGACAAC GGCTCGGTCA ATCTCACCCA TCGGGACAAC
CAGGTTGAGC GACTGGCCGG TTTTGCCAAC GGGGCCGGCG GATTCAACTA CCGCGGCAAC
GGCACGCCGC TGGTCATCGG CATGGCCGGC CCTGAAGACG GCATCACTTC GCTGGGCAGC
GGACCCATCA ACGTCGCTGT CACAGGCGAC CTCACACTGC AAAGCCCGGT CAGCTCGCAG
GCGGGCGACA TCGTTCTGGC CGCCGGCAAC TTTGACAACC AGTCCACCGT GGACTCGGTC
AGCGGCCGCG TGACGGTGCA GGGCGCCGTG CTGCGTCCTT CGCTGGCCGA CTGCATCGCC
AACACGGCGC TGGCCGGCTG CACGGCCGTG CTGCCCATCC TGGCGGCCTG CCAGGCAGAT
CCGGCCATTC CCGGCTGCTC GGCTGTGCTG CCACCCCCCA CGCTGGACGC CTGCATCGCC
GCCCCGGCCA CAAGCGGTTG CGCTGCGGTG CTGCCAACGC TGACCCAATG CACGATCGCA
CCTTCGACGG CCGGCTGCTC GGTGGTGTTG CCGACCTTGG CCTCATGCAT TGCCTCACCG
ACGCTTGCGG GCTGCGCTGC GGTGCTGCCT ACGCTGGCTC AATGCGTCGC CTCGCCGGCG
CTTGCGGGTT GCGCTGCGGT GCTGCCCACG CTGAGTCAAT GCATCGCCTC GCCGGCGCTT
GCGGGCTGCG CTGCGGTGCT GCCTACGCTG AGCCAGTGCA CCATTACACC TTTGACAGCG
GGGTGCTCGG TGGTGCTGCC GTCCCTGAAC GCATGCGTCG CCTCGCCGGC GCTTGCGGGC
TGCGCTGCGG TGCTGCCCGC GCTGAGCCAA TGCATCGCCT CGCCGGCGCT CGCGGGCTGC
ACTACGGTGC TGCCCACGCT GGCCCAATGC GTCGCCTCGC CGGCGCTTGC GGGCTGCGCT
GCGGTGCTGC CCAGGCTGAG TCAATGCGTC GCCTCGCCGA CGCTTGCGGG CTGCTCGGTG
GTGCTGCCGA CACTGGCTCA GTGCACGGCC TCTCCTACCT TGCAGGGCTG CTCGGCCGTG
CTGCCGCCAG CGTCTATCTG TGTCAGCAAC CCTGCCAGCC CGGGTTGCGC GGTCGTGGTG
CCGCCTGTCC AGATCGGCGC CAACTCACCT GTGGACCAGG CTTTGAATGC GACGATCAAC
ATCATCAACA CCAGCGGCAA CGGCACTGCG CTGCTGACCC AGCTATTCAC TCAGCCAAAT
GATGCCAGGG GCACAGCAGA TGCGTCAATC AAGAAAACAT TTTGCAACTG A
 
Protein sequence
MKTTHAIIAG ALLATAVMLA GGASAQTVVN GQASFQQIGN VRTITNTPGT IIQWPGFSIG 
AGEVTRFVQQ NAASAVLNRI TGQEPSLILG ALQSNGRVFL VNPNGVLFGA GSRVDVNGLV
ASSLTISNSD FLAGKMNFSA GAVAGHVVNQ GSISTPGGGQ VILIAPQVGN SGLIHSPGGE
VVLAAGRSVK LADSQNPALH VVVSAPQDQA VNLGQIVAQS GRIGIFGNLV NQRGLVSADQ
AAMGANGQIV LKASGDLLLE AGSLTSASGA GDSTGGTIHL LGERVGLTGN ARVDASGQAG
GGTVLIGGDY RGQNPAIANA KQVYVSGGAT IRADALASGN GGKVIAWSDG TTRVYGSISA
RGGAQSGNGG FVETSGHTLD MRGRVDTRAP NGRTGTLLLD PTNLYIANDQ ASAASAGMTG
SDTSAETFVA SGPVSDSLLT VATLEAALAN NMVTVTTDNA SGTGAGMIHV VNPVTWASDT
GLTLHANAGI EINAALTGGR GSRLELYTAS GNINQTAPLS VVALSARADN GSVNLTHRDN
QVERLAGFAN GAGGFNYRGN GTPLVIGMAG PEDGITSLGS GPINVAVTGD LTLQSPVSSQ
AGDIVLAAGN FDNQSTVDSV SGRVTVQGAV LRPSLADCIA NTALAGCTAV LPILAACQAD
PAIPGCSAVL PPPTLDACIA APATSGCAAV LPTLTQCTIA PSTAGCSVVL PTLASCIASP
TLAGCAAVLP TLAQCVASPA LAGCAAVLPT LSQCIASPAL AGCAAVLPTL SQCTITPLTA
GCSVVLPSLN ACVASPALAG CAAVLPALSQ CIASPALAGC TTVLPTLAQC VASPALAGCA
AVLPRLSQCV ASPTLAGCSV VLPTLAQCTA SPTLQGCSAV LPPASICVSN PASPGCAVVV
PPVQIGANSP VDQALNATIN IINTSGNGTA LLTQLFTQPN DARGTADASI KKTFCN