Gene Phep_3798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3798 
Symbol 
ID8254932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4557901 
End bp4562481 
Gene Length4581 bp 
Protein Length1526 aa 
Translation table11 
GC content46% 
IMG OID644937462 
Productfilamentous hemagglutinin outer membrane protein 
Protein accessionYP_003094051 
Protein GI255533679 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0575475 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTATA TTAACTTCTC ATCCCCTCTA CCTACCCTGA GTAGGTACAT CATCATTTCA 
TTTACCTTTA TTTTGTGTAC GCTTGGTTTT CAGGCACATG CACAATATTG TACCCCAAAT
CTTAACTGTA CAGATGGTGA TCTGATCCTG AATGTAAGCC TGTCTACCTT AAACAACTCC
AGCACATGCG GTACAAATGG TTATAGCAAT TTTACGGCTT TGGCTGCCCC GTCGCTGGCT
ATGGGCACAA GTTATCCTAT TTCTGTAACT GTTGGTGCTG GCTGGTCATC TGAGGCTGTA
AGTGTCTGGA TTGATTACAA TGGAAATGGA ATTTTTGAAG CTTCAGAGTT TACCTATATT
GGTGCTGGCA GCGGCAGTGT AGTTTCCGGA AATATTGCCA TTCAGGCTAC CACCTTAGGT
TCAAAAAGAA TGAGGGTGCG TGTGGCTGCA GTAGGAGCTA TTGCTGCTAC TGATGATATG
GCATGTGATG CAATGCAGGA GTATGGTGAA TTTGAAGATT ATACAGTCAA TATTGTTGCG
GCAACAGCAA CACCCAATGG CAATGGCATT TTGCATGTAA AGAAAAATGG GGCAGGTAAT
TTTAATGGCA ACAGCTGGGC CAATGCCATT CCGGAACTGG CCGATGCGCT CAAGTTTGCA
AAAATTCAGA ATGCCATCAT TCCGGGTACA GTCAAAGAAA TCTGGGTGGC AAAGGGTACC
TATAAGCCCA TGTACAGTCC TGAGGATGGC CCTGATTTTG GGACAGATAA AGGCAGGGAC
AATGCTTTTC TGCTGGTAAA GGATGTAAAA GTTTATGGTG GTTTTGCGGG CACAGAAACC
ACCCTTTCCC AACGTGACCT GTCTGTAACA ACAAATAAGA GCACATTAAG CGGAGATTTT
AATGATGATG ATGTTGTTAC TGGTACAGGC AGCACCTTGA ACATTTCGGG AAATACAGAG
AACGCCTATC ATGTCCTTGT TTCTGCCGGT GGGGCAGGAA ATGCGGTATT AAATGGCTTT
ACCATTACAG GTGCATATGG TGCCAGCGGA ACTGCCATCA TAGTCAATAC ATTTAGTGTT
TACAGAAATA CGGGTGGGGG TATAATTAAT ACCAATGCTT CAAGCCCAAC CGTAGCCAAC
TGTATCTTCA TCAACAATAT GGTGAAAGGA GGCTCTGGTG GTGCTATGTA TAATTCTATA
TCGGCTAACC CGCTGGTGAT GAATTGCAGT TTCATCAACA ACTTTGCCAG CAATATTTCC
GGTCAAGGCT GGGGCGATGA AGGCGGCGGC GGAATGTATA ACCAGGGTTC GGCCCCTGTT
ATTGTCAATT GTACGTTTTA CGGTAATTTA GCGACGGGAA ACCGTAGTGG GGGGGCAATC
AGTAACCTTC AGTCCATTCC AAGCATTATA AATACGGTCA TATCCAATAA CTCGGCAAGT
GCTTCAAGTG GAATATACAG TCCCAATATG GGTACGCTTT TTATCCGCTA TAGCCTGGTA
CAGGACATGC CGGCCGATGT TGCCAATCAT AATTTAGATG GTACAGTAGA TCCTTTGTTT
ACCAATCCGG CTACAGGAGA TTACACTTTA AAATCAGGAA GTCCTGCGAT CAATTCAGGA
AGTAATGTAT TATACGAAGC AGCCGACGGT AATGTGGGTA ACAACAGTTT AGGTCTGGAC
AAGGACCTGG CCGGCAATCC CCGGTTGGCA GGTACAAATA TAGACATAGG TGCTTACGAA
TCGCAAATTC AATCTCAAAC CATCACTGCA GGAAATATTG TTAAAACTTA CGGCGATGTA
GCCTTTGTGC CGACCGCTAC AGCAAGTTCA GGTCTGGAAG TCAGCTATGC ATCTGCCGAC
AATACCATTG CTGAAGCCTT TCAGGATGCT GCAGATGGTA ACAAATGGAA ACTGAAAATC
AAAAAAGCAG GTACAGTAAA CATCACTGCA AGTCAGCCAG GTGGCAATGG TTATGACCCG
GCCCCTGACG TTGTGTTCAG TTTAACGGTG AATAAAAGAC CGGTAACCTT AAGCATCAAA
CCCGCAGCTA CCTTTAGCAA GGTGTATGAT GCCGGCACTG CAGGAACTTT TCTGGCAACA
GACCTTATAT TGGCCAGCGG CGATGTCATC AATAGCGATG AAGTGTTGTT GAGTTTAAGC
TCCGGAGCTG CCCAATATGA TACCAAAAAC GCGGGAACCG GAAAAACAAT TACCTTGCCA
ATTGCCAGTG TATCACTAAG CGGTGCGCAG GCTGGAAATT ACAGCATTGC CAATTTAGCA
GATCTGAGCA GCAGCAATGC AGAAATTACT GCTATGCCAT TAACCATAAC GGCCAGCAAT
GCCAGCAAAG TTTACGACGG TATTGCCTAT GCCGGTGGTA ACGGTGTAAG TTATGGTGCC
TTTGCTGCTG GGGAAAGTAG TGCCGATCTC TCCGGACTGT TGTCCTATGG TGGAACAGCA
CAAAATGCTA TAAATGCAGG TAGCTATACC ATCATACCAG GTGGGCTAAG TTCAGGCAAC
TATGCCATTA CTTATGTGAA CGCTGAACTG ACCATTTCCC AAAACCCCGT AAACACGCTT
ACATTTAACA CACAGACTGC GGGCAGTACC CTTAACAAAA CCTATGGTGA TGCAGGCATC
AATGCTGCTG CCAATGCCAG TTCTGGCCTC ACTGTTCTTT ACAGCAGTAG CAATACAGCA
GTAGCCAGTG TAAATACTGC TGGCCAGGTT AGTTTCCTGG CTGCTGGTAC TGCTACCATT
ACCGCCAGTC AGGCCGGCAA TGCCAATTAC GCTGCAGGAA CAGCCATCAG TTTCCAGGTA
CAGGTGGCCA AAAAAATACT TACAGTTACT GCCAAGGATT TCAATAAAAC CTACGATGGT
CTGCCTTATA CCGGTGGCAA TGGCGTAAGT TATAGTGGCT TTGAAAATGG CGATGACTTC
AGCGCTTTAA CCGGAACAAT TGGTTATATA GGCACTTCAC AAGGGGCGCT AAATGCTGGC
ACCTATGCTA TTGTTCCGAC GGGGCTGACA TCGGCCAATT ATGATTTCGA TTATAGGGGA
GGAACACTGG GCATCACGCA ATCTGCCAAT AATGCAATAG TCTTTAACAG TCAGACTGCG
GGCAGTACGC TTAACAAAAC CTATGGTGAT GCAGGCATCA ACGCTGCCGC CAATGCCAGT
TCTGGCCTCA CTGTTCTTTA CAGCAGCAGC AATACAGCAG TGGCCAGTGT TAACACGAGC
GGCACTGTGC AGCTCCTTTC TTTCGGTACA GCAATTATAA CAGCCAGTCA GCCAGGCAAC
ATCAACTATG TAGCCGCCAC ACCGGTTAGC TTTACCGTCA ATGTACAGAA AAAACAGCTT
AGCATTACCG CCAAAAACGC CAGCAAAATT TATGATGGCA ATATCTATAC CGGTGGCGCA
GGTGTAATTT ACGACGGTTT CATTACTGGA GAAAATGAAA GCCATCTGCA GGGCGCACTT
ACCTATTCAG GTACTGCCCA GGGTGCTAAA AATGCAGGCA GCTATTTCAT CAGCCCTGCA
GGTTATACTT CCAGTAACTA CGCCATCAGC TACCAGGATG GCAACCTGAG CATTGCCAAA
GCAAGCCTTA ATGTAACTGC AGCAGCAAAA AGCAAAACCT ATGGCGATGA CGATCCTGTA
TTCAATTACA ACGCTACAGG TTTAATTGGT ACAGATGGCC TTACCGGAAG CCTTACACGG
GCCGCCGGCA ATAACGCCAC AACTTATGCC ATTACCCAGG GAACTTTAAC TGCAGGCAAT
AATTACACCA TTGTTTATAC CCCGGCCAAT CTTACCATCG GTAAAGCCCA GTTAATGGTA
ACAGCCGAGG ATAAACAGAT GTGTCAGGGC GCAGCTTTAC CGGCATTTAC CGTAAGTTAC
AGCGGTTTCA AATACAACGA TGGACCTGCC AGTCTGAACG CCGCCCAGCT AAACAGTACA
GGAAACCAGT CTTCAGAAGC AGGTAATTAT GTGATTTCCG CAAGCGGGGC CACAGCAGCC
AATTATACTT TCAACTATGT GAACGGGACT TTAAAGATCA ACCCCATGCC GGTGCTTACG
GTTAATAGTG ATAAAGGCAG TACTATCAGT AAAGGAGAAA TTGTACAGCT GACTGTAACC
GGAGCCATGA ACTATAGCTG GACAGCAAAC AGCAGCATCC TGGACGGACA GCAAACAGGC
TTGCTAAGGG TAAGACCTAA AGAAACCACC ACCTATACGG TAACCGGAAC CAATGCCAGC
GGTTGTAGCC AGCGCATTAG CTTTACCCTT ACTGTATTGG ATGATCTTGA AAAAATTAAG
GCCAATAACA TTCTTACACC CAACAATGAC GGCTATAACG ACAAATGGGT GGTAGATAAT
ATTGATTTTT ATCCCGACAA TACGGTGAAA GTATTTGACA GATCGGGCAG GGTGGTTTAT
GCCAAAAAAG GATACGACAA CAGCTGGGAA GGTACGCTGA ATGGCACGGC CCTGGCAGAG
GGTACCTATT ACTATATTAT TGATTTTGGT ATAAACAAAA GACCATTTAA GGGATATATC
ACTTTAATCA GGGAAAACTA A
 
Protein sequence
MSYINFSSPL PTLSRYIIIS FTFILCTLGF QAHAQYCTPN LNCTDGDLIL NVSLSTLNNS 
STCGTNGYSN FTALAAPSLA MGTSYPISVT VGAGWSSEAV SVWIDYNGNG IFEASEFTYI
GAGSGSVVSG NIAIQATTLG SKRMRVRVAA VGAIAATDDM ACDAMQEYGE FEDYTVNIVA
ATATPNGNGI LHVKKNGAGN FNGNSWANAI PELADALKFA KIQNAIIPGT VKEIWVAKGT
YKPMYSPEDG PDFGTDKGRD NAFLLVKDVK VYGGFAGTET TLSQRDLSVT TNKSTLSGDF
NDDDVVTGTG STLNISGNTE NAYHVLVSAG GAGNAVLNGF TITGAYGASG TAIIVNTFSV
YRNTGGGIIN TNASSPTVAN CIFINNMVKG GSGGAMYNSI SANPLVMNCS FINNFASNIS
GQGWGDEGGG GMYNQGSAPV IVNCTFYGNL ATGNRSGGAI SNLQSIPSII NTVISNNSAS
ASSGIYSPNM GTLFIRYSLV QDMPADVANH NLDGTVDPLF TNPATGDYTL KSGSPAINSG
SNVLYEAADG NVGNNSLGLD KDLAGNPRLA GTNIDIGAYE SQIQSQTITA GNIVKTYGDV
AFVPTATASS GLEVSYASAD NTIAEAFQDA ADGNKWKLKI KKAGTVNITA SQPGGNGYDP
APDVVFSLTV NKRPVTLSIK PAATFSKVYD AGTAGTFLAT DLILASGDVI NSDEVLLSLS
SGAAQYDTKN AGTGKTITLP IASVSLSGAQ AGNYSIANLA DLSSSNAEIT AMPLTITASN
ASKVYDGIAY AGGNGVSYGA FAAGESSADL SGLLSYGGTA QNAINAGSYT IIPGGLSSGN
YAITYVNAEL TISQNPVNTL TFNTQTAGST LNKTYGDAGI NAAANASSGL TVLYSSSNTA
VASVNTAGQV SFLAAGTATI TASQAGNANY AAGTAISFQV QVAKKILTVT AKDFNKTYDG
LPYTGGNGVS YSGFENGDDF SALTGTIGYI GTSQGALNAG TYAIVPTGLT SANYDFDYRG
GTLGITQSAN NAIVFNSQTA GSTLNKTYGD AGINAAANAS SGLTVLYSSS NTAVASVNTS
GTVQLLSFGT AIITASQPGN INYVAATPVS FTVNVQKKQL SITAKNASKI YDGNIYTGGA
GVIYDGFITG ENESHLQGAL TYSGTAQGAK NAGSYFISPA GYTSSNYAIS YQDGNLSIAK
ASLNVTAAAK SKTYGDDDPV FNYNATGLIG TDGLTGSLTR AAGNNATTYA ITQGTLTAGN
NYTIVYTPAN LTIGKAQLMV TAEDKQMCQG AALPAFTVSY SGFKYNDGPA SLNAAQLNST
GNQSSEAGNY VISASGATAA NYTFNYVNGT LKINPMPVLT VNSDKGSTIS KGEIVQLTVT
GAMNYSWTAN SSILDGQQTG LLRVRPKETT TYTVTGTNAS GCSQRISFTL TVLDDLEKIK
ANNILTPNND GYNDKWVVDN IDFYPDNTVK VFDRSGRVVY AKKGYDNSWE GTLNGTALAE
GTYYYIIDFG INKRPFKGYI TLIREN