Gene Ppha_0940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpha_0940 
Symbol 
ID6460902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelodictyon phaeoclathratiforme BU-1 
KingdomBacteria 
Replicon accessionNC_011060 
Strand
Start bp974995 
End bp980070 
Gene Length5076 bp 
Protein Length1691 aa 
Translation table11 
GC content51% 
IMG OID642727194 
Productfilamentous haemagglutinin family outer membrane protein 
Protein accessionYP_002017844 
Protein GI194336050 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGCAGCA CCCATAGCTT TATCAGGAGC AATGTCAGGA GTACAGTAGT TGTCGCTGTT 
GTTGTACTTT TCTCCCTGCT TGCTGGACAC GAGGTTGTCG CTTTGCCCGC CGGGGGTGAG
TTGGCTGCCG GAAAGGCCAC CATCAGCACT CCATCACTCA CAGCAATGCA AATTGAGCAA
CAGAGCCGCC AAGCCATTAT CAACTGGAGC TCCTTCGGTA TCGGCAGAGG TGAAAGCGTC
AACGTAGTCC AGCCTGACAG CCGCTCCGCC CTCCTCAACC GGGTTTCGGG TGAGGATCCT
TCTGAACTGT TCGGTACGCT TTCCGCAAAC GGACGCATCT TTCTGGTGAA CCCGAACGGC
GTACTGTTTG CTCCCGGCGC CAGCATCAAC GTTGGAGGAT TGGTGACCTC TACACTCGAT
ATCAGAAACA GCGACTTCCT CTCAGAAAAA TATGCATTCT TCCGGCATGG CACGGTGGGT
TCAGTAATAA ATCAGGGCTC TTTGAGTGGT GGATTCATAG CGCTGCTTGG AAAAAATGTC
TCGAACACCG GCACTATTGT CACCACAGGC GGAACAACAG GAATAGCCGC CGGTGAAAAA
ATTACCCTCA ACATTGACCC CTCCGGTCTG GTAGCAATCA GAGTTGAAGA GACGGCATAT
AATGCTCAGA TCAGAAACAG CGGCGTGATC GAAGCCAACG ATGGAAGAGT GGTGATAAAA
GCCTCGGCCG CCGACGCTCT GCTTGCCTCC GTGGTAAATA ACAGTGGTCA GGTTCGTGCA
ACCAGCATGA AGGAACGCAA TGGCACCATC GTGCTCGAAG GCGCATCAAT CATCAACACA
GGCACATTCA CCGCTTCTGC AATCACTGTC AGTAGCAACA ACCTTATCGA TGCTGGCACC
TGGAATGCCG GGGGCTCTGC CAGTGGAGGA ACAATAGCAA TTGATGCAAC AGGTAACATT
GAGCAAACCG CAGCAAGCCG CATGACTGCC GATGGAGAAG AGGGTGGCCA GATTGAGCTT
GATGCAGGAA AGGGACTCTA TCTCTCTGGA ATGTTCAGTG CAAACGGCAG TTCGGGACGA
GGCGGGCAGA TCAGCATCAC TGCACCACAA ACGTTTCTCG CCGGAGCTCA GATAGAAGCT
AATGGCCAGA ATGGAGGAGG AAGCATTCTC ATCGGGGGTG GCTGGCAAGG AAAAAGCAAC
TATCTGGCCA ACGCCGAATC CACCACAGTA ACCGGGAGCT GCAACATCAG GGCCAACGCT
CTTGAAAATG GCAATGGCGG TACGGTAGTA CTCTGGTCAG ATCACTCAAC CGCTTTTTCC
GGCACAATTG AAGCCATGGG AGGCCATATC AGTGGGAACG GAGGAGAGGT CGAGCTCTCC
AGCCACGAAA CACTCACGAT CTCCGGCCAG GTCAGCACAG GCGCGCCTCA TGGCAACAAC
GGCTTTCTGC TGCTTGACCC GCAAAATATC ACCATTGACG CCAGTGTCAC AATTCCCCTC
TTTTCCCTTA TTCCCTTGCC TGATGCAAAC CCGGCAGAGG GTGATCAGCA TGGCTCCGGA
GATATCCTTG AGCTGAGCAA CGGCAACATT CTTGTTGCCA GCCCCTTCGA TGATTTCGTT
GCAACCGATG CCGGAGCTGT CAGACTCTAC AGGCCAGATG GAACCTTGCT CTCCATGCTC
TGCGGCTCAA CAGCCAATGA CCAGGTAGGT GAAACGGTCA CTGCACTGAC TGGCAGGAAG
AGTGCTGTAA CCTCAACATC TGAATGGTCG AATGGAGATC AAGCTGCTGC TGGCGCAGTC
ACATGGATTA ATGAGAGTAT CGGCGTAAGC GGAAGCGTAT CTGGCACCAA CAGCTTGGTG
GGCTCTACAG TCAATGATGG TACTTCAATC CGTGTAATCA CTCTGAACAA CGGAAACTAC
GTGGTCAGCT CACCAGACTG GGACAATGGT ACAGCCAGCG ACGCCGGGGC TGTCACCTGG
GGGAATGGGC TCAGCGGAAC GATTGGGGTA ATAAGTGCTG CCAACAGCCT CGTCGGTTCC
AAAAAAAATG ATCAGGTTGG TACGGTAACA GCATTGACGA ATGGCCATTA CGTCGTTTCA
TCTCCATTAT GGGACAAAGG AACTGTTACC AATGCCGGTG CCGTAACCTG GTCGAACGGT
CTCGGCGGAA CAGTTGGAAC AATAAGCGCG ACCAACAGTC TGGTCGGCTC CAAAACCGGC
GATCAGGTTG GCAGCGTCAC TGCCCTGACA AACGGTAATT ACGTTGTAAG CGCACCAAAT
TGGGATAATG GCTCAATCAC CAATGCAGGT ATGGCAACCC TGGTGAATGG ACTCAACGGC
GCTGTCGGAA CAATAATCGC AACAAACAGC CTCGTCGGTT CAAAAAAAGA GGATAAAGTT
GGAACCGGGG TAACGGCACT GACCAATGGC AATTACGTGG TCACATCACC CTCATGGGAC
AATGGTACAA CTACTGACGC AGGAGCGGTG ACCTGGGGTA ATGGAATCAA TGGTGCCATC
GGAGTAATAA GCACGACGAA CAGTCTGGTG GGATCAACAG CAAATGACCT TGCTTCAGCT
TATGTTACAG CTCTTGCCAA TGGCAACTAC GTGGTTGGAT CCCCCGAATG GGATAATGGT
CCAACGACTA ATGCTGGTGC AGTGACCTGG GGTAATGGAA TCGGTGGAAC TGCCGGAATG
ATAAGTAGTT CCAGCAGTCT GACCGGTTCA GCAGCCAGTG ATGGGCTTGG TTCAACAGTG
ACGGCACTGA CCAACGGAAA CTACGTAGCG GCATGGCCGC ATTGGGACAA CGGCACGGCC
ACTGACGCAG GAGCAGTAAG CTGGGGAAAT GGCAATGGAG GCTCGGTCGG TACCATAAGC
CCTGGCAACA GTCTGGTTGG TTCAACTGTC AATGACGGAA CACGTTACAA TATAATACCC
CTCACAAACG GGAACTACGT CGTAGGATCC CCATACTGGG ACAATGGTCC GGCTACCGAC
GCAGGTGCAG TGACCTGGGG AAATGGTTTC GAGGGAACTA CAGGAGTGAT TGGTGAAGAG
AACAGTCTGA TCGGTTCAAG CAAAAATGAC TATGCCGGAT CAGACAGTTC AGGGAAAAAT
AACATAACCG CTTTGAGTAA CGGTAATTAT GTTGCCTCTT CCATACTATG GGATCAGGGA
ACAACGGCCA ATACCGGCGC GGTGACCTGG GGAGATGGTC TTGGAGGGAG TGCCGGAGTA
ATCAACGCTT CCAACAGTCT GGTCGGCTCG AAAACAGGCG ATCAGGTCGG AACAGTGACT
GCTCTGCCCA ACGGTAATTA TGTCGTCTCT TCTCCATTAT GGAACAATGG GACATTAACC
AATGCTGGTG CCTTGACCTT GCTGGATGGA GTCAACAACA ACACGACCGG AGCAATAAGC
TCCTTGAACA GCCTTGTTGG ATCCAGCAAA GAGGATCAAC TCGGTATTGG CGGCATTACA
GCGCTCACCA TCGGCAGCAT GAACGGGAGC TTCGTTGTCA GCTCTTTTAA CCAAGCAAAC
GAAACGGGTT GGGTAGAGAT TCTTACCCCG AACCCCGAAC AGGAATCTGT TGAGCAGGAA
TACCGTTTCA ACCCGGATGC TGCCAACACC TTCCATCCAT CACAAATAAC CGCGCTGCTC
AATAAAGGGA GCCAGGTCGT TCTGAAGGCA AACAATGACA TCACCATCAA TTCATCCATC
ATCGCCGACA ATCCCCTTGG AAACGGGGGA AATCTCTACC TGAATGCAGG AAAAAGCATA
CTGCTCAATG CGACTATCTC AACCGATAAC GGCAATCTGA CAATGGCTTC GAACGATACT
CAGGCAAATG GAGTCGTCGA TCAATGGCGT TCGGCAGGAA ACGCAACAAT CACCATGGGT
GAAGGCACAT CGATCATCGC AGGTTCAGGA AATGTAGCCA TCGAACTGCG CGATGGATCG
GGAAATACCA ACAGGGAGAG CGGTGACATC ACCTTGCGCA ACATCACCGC CAGCAACATC
ATCGCCCTGA ACTTCGGGCC GACAGAGGGC TCCGGTATCA CTCTCGCCTC AGGAACTCTT
GCAAGCGAAG CGACAGAGGG GAGCACGATT ATTCTGGCAG GTAACGATTT CGACAATAGC
GCAAATGCAA GACTCTCCAC CAACGGAACA GCCCGATGGA TCGTCTATTC TGACAATCCA
GAAGCAACCA TCAAAGGCGG ACTGAAATCA GACTTTCGTC ACTACAATGC CAGTTATACA
AGCTACCCAC CCTTCAACAT CAACGAATCC GGCAATGGGT TTATCTATGC CTCTCTTCCG
AACCTGCTTT ATGTCACCAC CACCTTGACC AGCGGCAGCG CCAGCAGCAC TTTCGGAACA
TGGCCTGTTG CTACTTTCGG CTATACACTC ACGGGTTTTG CCGATAGCGA AGATAATGCT
GAAAATATCG GACTTCATGG CACCATGATG GTCAGCGAAT TTCCCAATGC AACATCGAAA
ACAGGCAACT ACTTGATAGA ATATGCTGGC GGCCTGTCAA GCAGTGTTGG CTTCACCTTT
ACAGCAGGCA CCGGGCTCAT TTATACGGTT GAAAAACAGC CGATCAACTT TAATGAGATC
AATCCGATAC TGAGCTGGCA ATCTGAAACA GAACATCGAG CAAAAAGAGT TCTTTCTGGC
GAAACCCTTT TCAATAAACC TGATCAGACA ATCAACTCGC CACAGAAAAG TTTTATCGTA
AGGCGAAGAG CCATTAAGAA TACAAAATAC AGCTCCCAGA GTGGTGACCT TATTGCTCCC
GAGACGATAA CCGTGCCAGC AAGAGCAGAA GCATTGTTCT TTTTTTCTCT CCCTGACAGC
ATCTTCAGCA ACAGCAATCC CGATGCAATT GTCACTCTCG AAGCACACTC AATCAATGGC
ACAGCAGTAC CGTCATGGAT CTCATTTGAC CCCAAACGAA AAATAATAAG CGGGAAAGCG
CCAAAAGAGG CAATAGGAAA ATACAGGATT GAGCTGGTCG CCAAAGACAA GTTTGGTGGC
GAGAGCCGGT CAATTGTACT GATAACAATC GGATAA
 
Protein sequence
MCSTHSFIRS NVRSTVVVAV VVLFSLLAGH EVVALPAGGE LAAGKATIST PSLTAMQIEQ 
QSRQAIINWS SFGIGRGESV NVVQPDSRSA LLNRVSGEDP SELFGTLSAN GRIFLVNPNG
VLFAPGASIN VGGLVTSTLD IRNSDFLSEK YAFFRHGTVG SVINQGSLSG GFIALLGKNV
SNTGTIVTTG GTTGIAAGEK ITLNIDPSGL VAIRVEETAY NAQIRNSGVI EANDGRVVIK
ASAADALLAS VVNNSGQVRA TSMKERNGTI VLEGASIINT GTFTASAITV SSNNLIDAGT
WNAGGSASGG TIAIDATGNI EQTAASRMTA DGEEGGQIEL DAGKGLYLSG MFSANGSSGR
GGQISITAPQ TFLAGAQIEA NGQNGGGSIL IGGGWQGKSN YLANAESTTV TGSCNIRANA
LENGNGGTVV LWSDHSTAFS GTIEAMGGHI SGNGGEVELS SHETLTISGQ VSTGAPHGNN
GFLLLDPQNI TIDASVTIPL FSLIPLPDAN PAEGDQHGSG DILELSNGNI LVASPFDDFV
ATDAGAVRLY RPDGTLLSML CGSTANDQVG ETVTALTGRK SAVTSTSEWS NGDQAAAGAV
TWINESIGVS GSVSGTNSLV GSTVNDGTSI RVITLNNGNY VVSSPDWDNG TASDAGAVTW
GNGLSGTIGV ISAANSLVGS KKNDQVGTVT ALTNGHYVVS SPLWDKGTVT NAGAVTWSNG
LGGTVGTISA TNSLVGSKTG DQVGSVTALT NGNYVVSAPN WDNGSITNAG MATLVNGLNG
AVGTIIATNS LVGSKKEDKV GTGVTALTNG NYVVTSPSWD NGTTTDAGAV TWGNGINGAI
GVISTTNSLV GSTANDLASA YVTALANGNY VVGSPEWDNG PTTNAGAVTW GNGIGGTAGM
ISSSSSLTGS AASDGLGSTV TALTNGNYVA AWPHWDNGTA TDAGAVSWGN GNGGSVGTIS
PGNSLVGSTV NDGTRYNIIP LTNGNYVVGS PYWDNGPATD AGAVTWGNGF EGTTGVIGEE
NSLIGSSKND YAGSDSSGKN NITALSNGNY VASSILWDQG TTANTGAVTW GDGLGGSAGV
INASNSLVGS KTGDQVGTVT ALPNGNYVVS SPLWNNGTLT NAGALTLLDG VNNNTTGAIS
SLNSLVGSSK EDQLGIGGIT ALTIGSMNGS FVVSSFNQAN ETGWVEILTP NPEQESVEQE
YRFNPDAANT FHPSQITALL NKGSQVVLKA NNDITINSSI IADNPLGNGG NLYLNAGKSI
LLNATISTDN GNLTMASNDT QANGVVDQWR SAGNATITMG EGTSIIAGSG NVAIELRDGS
GNTNRESGDI TLRNITASNI IALNFGPTEG SGITLASGTL ASEATEGSTI ILAGNDFDNS
ANARLSTNGT ARWIVYSDNP EATIKGGLKS DFRHYNASYT SYPPFNINES GNGFIYASLP
NLLYVTTTLT SGSASSTFGT WPVATFGYTL TGFADSEDNA ENIGLHGTMM VSEFPNATSK
TGNYLIEYAG GLSSSVGFTF TAGTGLIYTV EKQPINFNEI NPILSWQSET EHRAKRVLSG
ETLFNKPDQT INSPQKSFIV RRRAIKNTKY SSQSGDLIAP ETITVPARAE ALFFFSLPDS
IFSNSNPDAI VTLEAHSING TAVPSWISFD PKRKIISGKA PKEAIGKYRI ELVAKDKFGG
ESRSIVLITI G