Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pnap_4014 |
Symbol | |
ID | 4689981 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polaromonas naphthalenivorans CJ2 |
Kingdom | Bacteria |
Replicon accession | NC_008781 |
Strand | + |
Start bp | 4278440 |
End bp | 4281310 |
Gene Length | 2871 bp |
Protein Length | 956 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639837028 |
Product | filamentous haemagglutinin outer membrane protein |
Protein accession | YP_984227 |
Protein GI | 121606898 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.893958 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACAA CCCACGCCAT CATTGCCGGG GCCCTGTTGG CGACCGCCGT CATGCTCGCC GGCGGCGCCA GCGCCCAGAC CGTCGTCAAC GGCCAGGCCA GCTTTCAGCA GATTGGCAAT GTCCGCACCA TCACCAACAC GCCCGGCACC ATCATCCAGT GGCCCGGCTT CTCGATTGGG GCGGGCGAGG TTACCCGCTT CGTCCAGCAA AACGCTGCGA GCGCGGTGCT CAACCGCATC ACCGGACAGG AGCCGTCCTT GATCCTGGGC GCCTTGCAGT CGAACGGCCG GGTTTTCCTG GTCAACCCGA ACGGCGTGCT GTTCGGCGCC GGTTCGCGGG TGGACGTGAA CGGGCTGGTG GCTTCGAGCC TGACGATTTC CAACAGTGAT TTCCTGGCCG GCAAGATGAA CTTCAGCGCT GGCGCCGTGG CGGGCCATGT TGTCAACCAG GGGAGCATTT CAACGCCCGG AGGCGGCCAG GTGATCCTGA TTGCGCCGCA AGTCGGGAAC AGCGGCCTCA TCCATTCGCC CGGCGGCGAG GTCGTGCTGG CCGCAGGCCG CAGCGTGAAG CTGGCCGACA GCCAGAACCC GGCCCTGCAT GTGGTGGTCA GCGCGCCGCA GGACCAGGCC GTCAACCTGG GCCAGATCGT GGCGCAAAGC GGCCGCATCG GCATCTTCGG CAACCTGGTC AACCAGCGCG GCCTGGTCAG CGCCGACCAG GCTGCAATGG GTGCCAACGG GCAGATCGTG CTCAAGGCCA GCGGCGACCT GCTGCTCGAA GCCGGCAGCC TGACCAGCGC GAGCGGCGCC GGTGACAGCA CCGGCGGCAC GATCCACCTT TTGGGCGAGC GCGTGGGCCT GACGGGCAAC GCCCGCGTGG ACGCCAGCGG CCAGGCCGGC GGCGGCACCG TGCTGATCGG CGGCGACTAC CGCGGCCAGA ACCCGGCCAT CGCCAACGCG AAGCAGGTCT ACGTCTCGGG CGGTGCCACC ATCCGCGCCG ACGCCCTGGC GTCGGGCAAC GGCGGCAAGG TGATCGCCTG GTCCGACGGC ACGACACGCG TTTATGGCAG CATCAGCGCC CGCGGCGGTG CGCAATCGGG CAACGGCGGC TTCGTCGAAA CATCGGGCCA CACGCTGGAC ATGCGGGGCC GGGTCGATAC GCGCGCGCCG AACGGCCGCA CCGGCACGCT GCTGCTGGAC CCGACCAACC TCTACATCGC GAACGACCAG GCCAGCGCCG CCAGCGCGGG CATGACGGGC AGCGACACTT CAGCCGAAAC CTTTGTCGCT TCTGGCCCCG TGAGCGATTC GCTCCTGACC GTCGCCACCC TGGAGGCTGC GCTTGCAAAC AACATGGTGA CCGTGACCAC CGACAACGCA TCGGGAACGG GCGCCGGCAT GATTCACGTG GTCAACCCCG TGACGTGGGC CAGCGACACG GGCCTGACGC TGCATGCCAA CGCGGGCATC GAGATCAACG CGGCCCTCAC TGGCGGGCGC GGCAGCCGGC TGGAGCTGTA CACGGCTTCG GGCAACATCA ACCAGACGGC GCCCCTCAGC GTGGTCGCAC TGTCGGCCCG CGCCGACAAC GGCTCGGTCA ATCTCACCCA TCGGGACAAC CAGGTTGAGC GACTGGCCGG TTTTGCCAAC GGGGCCGGCG GATTCAACTA CCGCGGCAAC GGCACGCCGC TGGTCATCGG CATGGCCGGC CCTGAAGACG GCATCACTTC GCTGGGCAGC GGACCCATCA ACGTCGCTGT CACAGGCGAC CTCACACTGC AAAGCCCGGT CAGCTCGCAG GCGGGCGACA TCGTTCTGGC CGCCGGCAAC TTTGACAACC AGTCCACCGT GGACTCGGTC AGCGGCCGCG TGACGGTGCA GGGCGCCGTG CTGCGTCCTT CGCTGGCCGA CTGCATCGCC AACACGGCGC TGGCCGGCTG CACGGCCGTG CTGCCCATCC TGGCGGCCTG CCAGGCAGAT CCGGCCATTC CCGGCTGCTC GGCTGTGCTG CCACCCCCCA CGCTGGACGC CTGCATCGCC GCCCCGGCCA CAAGCGGTTG CGCTGCGGTG CTGCCAACGC TGACCCAATG CACGATCGCA CCTTCGACGG CCGGCTGCTC GGTGGTGTTG CCGACCTTGG CCTCATGCAT TGCCTCACCG ACGCTTGCGG GCTGCGCTGC GGTGCTGCCT ACGCTGGCTC AATGCGTCGC CTCGCCGGCG CTTGCGGGTT GCGCTGCGGT GCTGCCCACG CTGAGTCAAT GCATCGCCTC GCCGGCGCTT GCGGGCTGCG CTGCGGTGCT GCCTACGCTG AGCCAGTGCA CCATTACACC TTTGACAGCG GGGTGCTCGG TGGTGCTGCC GTCCCTGAAC GCATGCGTCG CCTCGCCGGC GCTTGCGGGC TGCGCTGCGG TGCTGCCCGC GCTGAGCCAA TGCATCGCCT CGCCGGCGCT CGCGGGCTGC ACTACGGTGC TGCCCACGCT GGCCCAATGC GTCGCCTCGC CGGCGCTTGC GGGCTGCGCT GCGGTGCTGC CCAGGCTGAG TCAATGCGTC GCCTCGCCGA CGCTTGCGGG CTGCTCGGTG GTGCTGCCGA CACTGGCTCA GTGCACGGCC TCTCCTACCT TGCAGGGCTG CTCGGCCGTG CTGCCGCCAG CGTCTATCTG TGTCAGCAAC CCTGCCAGCC CGGGTTGCGC GGTCGTGGTG CCGCCTGTCC AGATCGGCGC CAACTCACCT GTGGACCAGG CTTTGAATGC GACGATCAAC ATCATCAACA CCAGCGGCAA CGGCACTGCG CTGCTGACCC AGCTATTCAC TCAGCCAAAT GATGCCAGGG GCACAGCAGA TGCGTCAATC AAGAAAACAT TTTGCAACTG A
|
Protein sequence | MKTTHAIIAG ALLATAVMLA GGASAQTVVN GQASFQQIGN VRTITNTPGT IIQWPGFSIG AGEVTRFVQQ NAASAVLNRI TGQEPSLILG ALQSNGRVFL VNPNGVLFGA GSRVDVNGLV ASSLTISNSD FLAGKMNFSA GAVAGHVVNQ GSISTPGGGQ VILIAPQVGN SGLIHSPGGE VVLAAGRSVK LADSQNPALH VVVSAPQDQA VNLGQIVAQS GRIGIFGNLV NQRGLVSADQ AAMGANGQIV LKASGDLLLE AGSLTSASGA GDSTGGTIHL LGERVGLTGN ARVDASGQAG GGTVLIGGDY RGQNPAIANA KQVYVSGGAT IRADALASGN GGKVIAWSDG TTRVYGSISA RGGAQSGNGG FVETSGHTLD MRGRVDTRAP NGRTGTLLLD PTNLYIANDQ ASAASAGMTG SDTSAETFVA SGPVSDSLLT VATLEAALAN NMVTVTTDNA SGTGAGMIHV VNPVTWASDT GLTLHANAGI EINAALTGGR GSRLELYTAS GNINQTAPLS VVALSARADN GSVNLTHRDN QVERLAGFAN GAGGFNYRGN GTPLVIGMAG PEDGITSLGS GPINVAVTGD LTLQSPVSSQ AGDIVLAAGN FDNQSTVDSV SGRVTVQGAV LRPSLADCIA NTALAGCTAV LPILAACQAD PAIPGCSAVL PPPTLDACIA APATSGCAAV LPTLTQCTIA PSTAGCSVVL PTLASCIASP TLAGCAAVLP TLAQCVASPA LAGCAAVLPT LSQCIASPAL AGCAAVLPTL SQCTITPLTA GCSVVLPSLN ACVASPALAG CAAVLPALSQ CIASPALAGC TTVLPTLAQC VASPALAGCA AVLPRLSQCV ASPTLAGCSV VLPTLAQCTA SPTLQGCSAV LPPASICVSN PASPGCAVVV PPVQIGANSP VDQALNATIN IINTSGNGTA LLTQLFTQPN DARGTADASI KKTFCN
|
| |