Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3798 |
Symbol | |
ID | 8254932 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | - |
Start bp | 4557901 |
End bp | 4562481 |
Gene Length | 4581 bp |
Protein Length | 1526 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644937462 |
Product | filamentous hemagglutinin outer membrane protein |
Protein accession | YP_003094051 |
Protein GI | 255533679 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0575475 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTATA TTAACTTCTC ATCCCCTCTA CCTACCCTGA GTAGGTACAT CATCATTTCA TTTACCTTTA TTTTGTGTAC GCTTGGTTTT CAGGCACATG CACAATATTG TACCCCAAAT CTTAACTGTA CAGATGGTGA TCTGATCCTG AATGTAAGCC TGTCTACCTT AAACAACTCC AGCACATGCG GTACAAATGG TTATAGCAAT TTTACGGCTT TGGCTGCCCC GTCGCTGGCT ATGGGCACAA GTTATCCTAT TTCTGTAACT GTTGGTGCTG GCTGGTCATC TGAGGCTGTA AGTGTCTGGA TTGATTACAA TGGAAATGGA ATTTTTGAAG CTTCAGAGTT TACCTATATT GGTGCTGGCA GCGGCAGTGT AGTTTCCGGA AATATTGCCA TTCAGGCTAC CACCTTAGGT TCAAAAAGAA TGAGGGTGCG TGTGGCTGCA GTAGGAGCTA TTGCTGCTAC TGATGATATG GCATGTGATG CAATGCAGGA GTATGGTGAA TTTGAAGATT ATACAGTCAA TATTGTTGCG GCAACAGCAA CACCCAATGG CAATGGCATT TTGCATGTAA AGAAAAATGG GGCAGGTAAT TTTAATGGCA ACAGCTGGGC CAATGCCATT CCGGAACTGG CCGATGCGCT CAAGTTTGCA AAAATTCAGA ATGCCATCAT TCCGGGTACA GTCAAAGAAA TCTGGGTGGC AAAGGGTACC TATAAGCCCA TGTACAGTCC TGAGGATGGC CCTGATTTTG GGACAGATAA AGGCAGGGAC AATGCTTTTC TGCTGGTAAA GGATGTAAAA GTTTATGGTG GTTTTGCGGG CACAGAAACC ACCCTTTCCC AACGTGACCT GTCTGTAACA ACAAATAAGA GCACATTAAG CGGAGATTTT AATGATGATG ATGTTGTTAC TGGTACAGGC AGCACCTTGA ACATTTCGGG AAATACAGAG AACGCCTATC ATGTCCTTGT TTCTGCCGGT GGGGCAGGAA ATGCGGTATT AAATGGCTTT ACCATTACAG GTGCATATGG TGCCAGCGGA ACTGCCATCA TAGTCAATAC ATTTAGTGTT TACAGAAATA CGGGTGGGGG TATAATTAAT ACCAATGCTT CAAGCCCAAC CGTAGCCAAC TGTATCTTCA TCAACAATAT GGTGAAAGGA GGCTCTGGTG GTGCTATGTA TAATTCTATA TCGGCTAACC CGCTGGTGAT GAATTGCAGT TTCATCAACA ACTTTGCCAG CAATATTTCC GGTCAAGGCT GGGGCGATGA AGGCGGCGGC GGAATGTATA ACCAGGGTTC GGCCCCTGTT ATTGTCAATT GTACGTTTTA CGGTAATTTA GCGACGGGAA ACCGTAGTGG GGGGGCAATC AGTAACCTTC AGTCCATTCC AAGCATTATA AATACGGTCA TATCCAATAA CTCGGCAAGT GCTTCAAGTG GAATATACAG TCCCAATATG GGTACGCTTT TTATCCGCTA TAGCCTGGTA CAGGACATGC CGGCCGATGT TGCCAATCAT AATTTAGATG GTACAGTAGA TCCTTTGTTT ACCAATCCGG CTACAGGAGA TTACACTTTA AAATCAGGAA GTCCTGCGAT CAATTCAGGA AGTAATGTAT TATACGAAGC AGCCGACGGT AATGTGGGTA ACAACAGTTT AGGTCTGGAC AAGGACCTGG CCGGCAATCC CCGGTTGGCA GGTACAAATA TAGACATAGG TGCTTACGAA TCGCAAATTC AATCTCAAAC CATCACTGCA GGAAATATTG TTAAAACTTA CGGCGATGTA GCCTTTGTGC CGACCGCTAC AGCAAGTTCA GGTCTGGAAG TCAGCTATGC ATCTGCCGAC AATACCATTG CTGAAGCCTT TCAGGATGCT GCAGATGGTA ACAAATGGAA ACTGAAAATC AAAAAAGCAG GTACAGTAAA CATCACTGCA AGTCAGCCAG GTGGCAATGG TTATGACCCG GCCCCTGACG TTGTGTTCAG TTTAACGGTG AATAAAAGAC CGGTAACCTT AAGCATCAAA CCCGCAGCTA CCTTTAGCAA GGTGTATGAT GCCGGCACTG CAGGAACTTT TCTGGCAACA GACCTTATAT TGGCCAGCGG CGATGTCATC AATAGCGATG AAGTGTTGTT GAGTTTAAGC TCCGGAGCTG CCCAATATGA TACCAAAAAC GCGGGAACCG GAAAAACAAT TACCTTGCCA ATTGCCAGTG TATCACTAAG CGGTGCGCAG GCTGGAAATT ACAGCATTGC CAATTTAGCA GATCTGAGCA GCAGCAATGC AGAAATTACT GCTATGCCAT TAACCATAAC GGCCAGCAAT GCCAGCAAAG TTTACGACGG TATTGCCTAT GCCGGTGGTA ACGGTGTAAG TTATGGTGCC TTTGCTGCTG GGGAAAGTAG TGCCGATCTC TCCGGACTGT TGTCCTATGG TGGAACAGCA CAAAATGCTA TAAATGCAGG TAGCTATACC ATCATACCAG GTGGGCTAAG TTCAGGCAAC TATGCCATTA CTTATGTGAA CGCTGAACTG ACCATTTCCC AAAACCCCGT AAACACGCTT ACATTTAACA CACAGACTGC GGGCAGTACC CTTAACAAAA CCTATGGTGA TGCAGGCATC AATGCTGCTG CCAATGCCAG TTCTGGCCTC ACTGTTCTTT ACAGCAGTAG CAATACAGCA GTAGCCAGTG TAAATACTGC TGGCCAGGTT AGTTTCCTGG CTGCTGGTAC TGCTACCATT ACCGCCAGTC AGGCCGGCAA TGCCAATTAC GCTGCAGGAA CAGCCATCAG TTTCCAGGTA CAGGTGGCCA AAAAAATACT TACAGTTACT GCCAAGGATT TCAATAAAAC CTACGATGGT CTGCCTTATA CCGGTGGCAA TGGCGTAAGT TATAGTGGCT TTGAAAATGG CGATGACTTC AGCGCTTTAA CCGGAACAAT TGGTTATATA GGCACTTCAC AAGGGGCGCT AAATGCTGGC ACCTATGCTA TTGTTCCGAC GGGGCTGACA TCGGCCAATT ATGATTTCGA TTATAGGGGA GGAACACTGG GCATCACGCA ATCTGCCAAT AATGCAATAG TCTTTAACAG TCAGACTGCG GGCAGTACGC TTAACAAAAC CTATGGTGAT GCAGGCATCA ACGCTGCCGC CAATGCCAGT TCTGGCCTCA CTGTTCTTTA CAGCAGCAGC AATACAGCAG TGGCCAGTGT TAACACGAGC GGCACTGTGC AGCTCCTTTC TTTCGGTACA GCAATTATAA CAGCCAGTCA GCCAGGCAAC ATCAACTATG TAGCCGCCAC ACCGGTTAGC TTTACCGTCA ATGTACAGAA AAAACAGCTT AGCATTACCG CCAAAAACGC CAGCAAAATT TATGATGGCA ATATCTATAC CGGTGGCGCA GGTGTAATTT ACGACGGTTT CATTACTGGA GAAAATGAAA GCCATCTGCA GGGCGCACTT ACCTATTCAG GTACTGCCCA GGGTGCTAAA AATGCAGGCA GCTATTTCAT CAGCCCTGCA GGTTATACTT CCAGTAACTA CGCCATCAGC TACCAGGATG GCAACCTGAG CATTGCCAAA GCAAGCCTTA ATGTAACTGC AGCAGCAAAA AGCAAAACCT ATGGCGATGA CGATCCTGTA TTCAATTACA ACGCTACAGG TTTAATTGGT ACAGATGGCC TTACCGGAAG CCTTACACGG GCCGCCGGCA ATAACGCCAC AACTTATGCC ATTACCCAGG GAACTTTAAC TGCAGGCAAT AATTACACCA TTGTTTATAC CCCGGCCAAT CTTACCATCG GTAAAGCCCA GTTAATGGTA ACAGCCGAGG ATAAACAGAT GTGTCAGGGC GCAGCTTTAC CGGCATTTAC CGTAAGTTAC AGCGGTTTCA AATACAACGA TGGACCTGCC AGTCTGAACG CCGCCCAGCT AAACAGTACA GGAAACCAGT CTTCAGAAGC AGGTAATTAT GTGATTTCCG CAAGCGGGGC CACAGCAGCC AATTATACTT TCAACTATGT GAACGGGACT TTAAAGATCA ACCCCATGCC GGTGCTTACG GTTAATAGTG ATAAAGGCAG TACTATCAGT AAAGGAGAAA TTGTACAGCT GACTGTAACC GGAGCCATGA ACTATAGCTG GACAGCAAAC AGCAGCATCC TGGACGGACA GCAAACAGGC TTGCTAAGGG TAAGACCTAA AGAAACCACC ACCTATACGG TAACCGGAAC CAATGCCAGC GGTTGTAGCC AGCGCATTAG CTTTACCCTT ACTGTATTGG ATGATCTTGA AAAAATTAAG GCCAATAACA TTCTTACACC CAACAATGAC GGCTATAACG ACAAATGGGT GGTAGATAAT ATTGATTTTT ATCCCGACAA TACGGTGAAA GTATTTGACA GATCGGGCAG GGTGGTTTAT GCCAAAAAAG GATACGACAA CAGCTGGGAA GGTACGCTGA ATGGCACGGC CCTGGCAGAG GGTACCTATT ACTATATTAT TGATTTTGGT ATAAACAAAA GACCATTTAA GGGATATATC ACTTTAATCA GGGAAAACTA A
|
Protein sequence | MSYINFSSPL PTLSRYIIIS FTFILCTLGF QAHAQYCTPN LNCTDGDLIL NVSLSTLNNS STCGTNGYSN FTALAAPSLA MGTSYPISVT VGAGWSSEAV SVWIDYNGNG IFEASEFTYI GAGSGSVVSG NIAIQATTLG SKRMRVRVAA VGAIAATDDM ACDAMQEYGE FEDYTVNIVA ATATPNGNGI LHVKKNGAGN FNGNSWANAI PELADALKFA KIQNAIIPGT VKEIWVAKGT YKPMYSPEDG PDFGTDKGRD NAFLLVKDVK VYGGFAGTET TLSQRDLSVT TNKSTLSGDF NDDDVVTGTG STLNISGNTE NAYHVLVSAG GAGNAVLNGF TITGAYGASG TAIIVNTFSV YRNTGGGIIN TNASSPTVAN CIFINNMVKG GSGGAMYNSI SANPLVMNCS FINNFASNIS GQGWGDEGGG GMYNQGSAPV IVNCTFYGNL ATGNRSGGAI SNLQSIPSII NTVISNNSAS ASSGIYSPNM GTLFIRYSLV QDMPADVANH NLDGTVDPLF TNPATGDYTL KSGSPAINSG SNVLYEAADG NVGNNSLGLD KDLAGNPRLA GTNIDIGAYE SQIQSQTITA GNIVKTYGDV AFVPTATASS GLEVSYASAD NTIAEAFQDA ADGNKWKLKI KKAGTVNITA SQPGGNGYDP APDVVFSLTV NKRPVTLSIK PAATFSKVYD AGTAGTFLAT DLILASGDVI NSDEVLLSLS SGAAQYDTKN AGTGKTITLP IASVSLSGAQ AGNYSIANLA DLSSSNAEIT AMPLTITASN ASKVYDGIAY AGGNGVSYGA FAAGESSADL SGLLSYGGTA QNAINAGSYT IIPGGLSSGN YAITYVNAEL TISQNPVNTL TFNTQTAGST LNKTYGDAGI NAAANASSGL TVLYSSSNTA VASVNTAGQV SFLAAGTATI TASQAGNANY AAGTAISFQV QVAKKILTVT AKDFNKTYDG LPYTGGNGVS YSGFENGDDF SALTGTIGYI GTSQGALNAG TYAIVPTGLT SANYDFDYRG GTLGITQSAN NAIVFNSQTA GSTLNKTYGD AGINAAANAS SGLTVLYSSS NTAVASVNTS GTVQLLSFGT AIITASQPGN INYVAATPVS FTVNVQKKQL SITAKNASKI YDGNIYTGGA GVIYDGFITG ENESHLQGAL TYSGTAQGAK NAGSYFISPA GYTSSNYAIS YQDGNLSIAK ASLNVTAAAK SKTYGDDDPV FNYNATGLIG TDGLTGSLTR AAGNNATTYA ITQGTLTAGN NYTIVYTPAN LTIGKAQLMV TAEDKQMCQG AALPAFTVSY SGFKYNDGPA SLNAAQLNST GNQSSEAGNY VISASGATAA NYTFNYVNGT LKINPMPVLT VNSDKGSTIS KGEIVQLTVT GAMNYSWTAN SSILDGQQTG LLRVRPKETT TYTVTGTNAS GCSQRISFTL TVLDDLEKIK ANNILTPNND GYNDKWVVDN IDFYPDNTVK VFDRSGRVVY AKKGYDNSWE GTLNGTALAE GTYYYIIDFG INKRPFKGYI TLIREN
|
| |