Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ppha_0940 |
Symbol | |
ID | 6460902 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pelodictyon phaeoclathratiforme BU-1 |
Kingdom | Bacteria |
Replicon accession | NC_011060 |
Strand | - |
Start bp | 974995 |
End bp | 980070 |
Gene Length | 5076 bp |
Protein Length | 1691 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642727194 |
Product | filamentous haemagglutinin family outer membrane protein |
Protein accession | YP_002017844 |
Protein GI | 194336050 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGCAGCA CCCATAGCTT TATCAGGAGC AATGTCAGGA GTACAGTAGT TGTCGCTGTT GTTGTACTTT TCTCCCTGCT TGCTGGACAC GAGGTTGTCG CTTTGCCCGC CGGGGGTGAG TTGGCTGCCG GAAAGGCCAC CATCAGCACT CCATCACTCA CAGCAATGCA AATTGAGCAA CAGAGCCGCC AAGCCATTAT CAACTGGAGC TCCTTCGGTA TCGGCAGAGG TGAAAGCGTC AACGTAGTCC AGCCTGACAG CCGCTCCGCC CTCCTCAACC GGGTTTCGGG TGAGGATCCT TCTGAACTGT TCGGTACGCT TTCCGCAAAC GGACGCATCT TTCTGGTGAA CCCGAACGGC GTACTGTTTG CTCCCGGCGC CAGCATCAAC GTTGGAGGAT TGGTGACCTC TACACTCGAT ATCAGAAACA GCGACTTCCT CTCAGAAAAA TATGCATTCT TCCGGCATGG CACGGTGGGT TCAGTAATAA ATCAGGGCTC TTTGAGTGGT GGATTCATAG CGCTGCTTGG AAAAAATGTC TCGAACACCG GCACTATTGT CACCACAGGC GGAACAACAG GAATAGCCGC CGGTGAAAAA ATTACCCTCA ACATTGACCC CTCCGGTCTG GTAGCAATCA GAGTTGAAGA GACGGCATAT AATGCTCAGA TCAGAAACAG CGGCGTGATC GAAGCCAACG ATGGAAGAGT GGTGATAAAA GCCTCGGCCG CCGACGCTCT GCTTGCCTCC GTGGTAAATA ACAGTGGTCA GGTTCGTGCA ACCAGCATGA AGGAACGCAA TGGCACCATC GTGCTCGAAG GCGCATCAAT CATCAACACA GGCACATTCA CCGCTTCTGC AATCACTGTC AGTAGCAACA ACCTTATCGA TGCTGGCACC TGGAATGCCG GGGGCTCTGC CAGTGGAGGA ACAATAGCAA TTGATGCAAC AGGTAACATT GAGCAAACCG CAGCAAGCCG CATGACTGCC GATGGAGAAG AGGGTGGCCA GATTGAGCTT GATGCAGGAA AGGGACTCTA TCTCTCTGGA ATGTTCAGTG CAAACGGCAG TTCGGGACGA GGCGGGCAGA TCAGCATCAC TGCACCACAA ACGTTTCTCG CCGGAGCTCA GATAGAAGCT AATGGCCAGA ATGGAGGAGG AAGCATTCTC ATCGGGGGTG GCTGGCAAGG AAAAAGCAAC TATCTGGCCA ACGCCGAATC CACCACAGTA ACCGGGAGCT GCAACATCAG GGCCAACGCT CTTGAAAATG GCAATGGCGG TACGGTAGTA CTCTGGTCAG ATCACTCAAC CGCTTTTTCC GGCACAATTG AAGCCATGGG AGGCCATATC AGTGGGAACG GAGGAGAGGT CGAGCTCTCC AGCCACGAAA CACTCACGAT CTCCGGCCAG GTCAGCACAG GCGCGCCTCA TGGCAACAAC GGCTTTCTGC TGCTTGACCC GCAAAATATC ACCATTGACG CCAGTGTCAC AATTCCCCTC TTTTCCCTTA TTCCCTTGCC TGATGCAAAC CCGGCAGAGG GTGATCAGCA TGGCTCCGGA GATATCCTTG AGCTGAGCAA CGGCAACATT CTTGTTGCCA GCCCCTTCGA TGATTTCGTT GCAACCGATG CCGGAGCTGT CAGACTCTAC AGGCCAGATG GAACCTTGCT CTCCATGCTC TGCGGCTCAA CAGCCAATGA CCAGGTAGGT GAAACGGTCA CTGCACTGAC TGGCAGGAAG AGTGCTGTAA CCTCAACATC TGAATGGTCG AATGGAGATC AAGCTGCTGC TGGCGCAGTC ACATGGATTA ATGAGAGTAT CGGCGTAAGC GGAAGCGTAT CTGGCACCAA CAGCTTGGTG GGCTCTACAG TCAATGATGG TACTTCAATC CGTGTAATCA CTCTGAACAA CGGAAACTAC GTGGTCAGCT CACCAGACTG GGACAATGGT ACAGCCAGCG ACGCCGGGGC TGTCACCTGG GGGAATGGGC TCAGCGGAAC GATTGGGGTA ATAAGTGCTG CCAACAGCCT CGTCGGTTCC AAAAAAAATG ATCAGGTTGG TACGGTAACA GCATTGACGA ATGGCCATTA CGTCGTTTCA TCTCCATTAT GGGACAAAGG AACTGTTACC AATGCCGGTG CCGTAACCTG GTCGAACGGT CTCGGCGGAA CAGTTGGAAC AATAAGCGCG ACCAACAGTC TGGTCGGCTC CAAAACCGGC GATCAGGTTG GCAGCGTCAC TGCCCTGACA AACGGTAATT ACGTTGTAAG CGCACCAAAT TGGGATAATG GCTCAATCAC CAATGCAGGT ATGGCAACCC TGGTGAATGG ACTCAACGGC GCTGTCGGAA CAATAATCGC AACAAACAGC CTCGTCGGTT CAAAAAAAGA GGATAAAGTT GGAACCGGGG TAACGGCACT GACCAATGGC AATTACGTGG TCACATCACC CTCATGGGAC AATGGTACAA CTACTGACGC AGGAGCGGTG ACCTGGGGTA ATGGAATCAA TGGTGCCATC GGAGTAATAA GCACGACGAA CAGTCTGGTG GGATCAACAG CAAATGACCT TGCTTCAGCT TATGTTACAG CTCTTGCCAA TGGCAACTAC GTGGTTGGAT CCCCCGAATG GGATAATGGT CCAACGACTA ATGCTGGTGC AGTGACCTGG GGTAATGGAA TCGGTGGAAC TGCCGGAATG ATAAGTAGTT CCAGCAGTCT GACCGGTTCA GCAGCCAGTG ATGGGCTTGG TTCAACAGTG ACGGCACTGA CCAACGGAAA CTACGTAGCG GCATGGCCGC ATTGGGACAA CGGCACGGCC ACTGACGCAG GAGCAGTAAG CTGGGGAAAT GGCAATGGAG GCTCGGTCGG TACCATAAGC CCTGGCAACA GTCTGGTTGG TTCAACTGTC AATGACGGAA CACGTTACAA TATAATACCC CTCACAAACG GGAACTACGT CGTAGGATCC CCATACTGGG ACAATGGTCC GGCTACCGAC GCAGGTGCAG TGACCTGGGG AAATGGTTTC GAGGGAACTA CAGGAGTGAT TGGTGAAGAG AACAGTCTGA TCGGTTCAAG CAAAAATGAC TATGCCGGAT CAGACAGTTC AGGGAAAAAT AACATAACCG CTTTGAGTAA CGGTAATTAT GTTGCCTCTT CCATACTATG GGATCAGGGA ACAACGGCCA ATACCGGCGC GGTGACCTGG GGAGATGGTC TTGGAGGGAG TGCCGGAGTA ATCAACGCTT CCAACAGTCT GGTCGGCTCG AAAACAGGCG ATCAGGTCGG AACAGTGACT GCTCTGCCCA ACGGTAATTA TGTCGTCTCT TCTCCATTAT GGAACAATGG GACATTAACC AATGCTGGTG CCTTGACCTT GCTGGATGGA GTCAACAACA ACACGACCGG AGCAATAAGC TCCTTGAACA GCCTTGTTGG ATCCAGCAAA GAGGATCAAC TCGGTATTGG CGGCATTACA GCGCTCACCA TCGGCAGCAT GAACGGGAGC TTCGTTGTCA GCTCTTTTAA CCAAGCAAAC GAAACGGGTT GGGTAGAGAT TCTTACCCCG AACCCCGAAC AGGAATCTGT TGAGCAGGAA TACCGTTTCA ACCCGGATGC TGCCAACACC TTCCATCCAT CACAAATAAC CGCGCTGCTC AATAAAGGGA GCCAGGTCGT TCTGAAGGCA AACAATGACA TCACCATCAA TTCATCCATC ATCGCCGACA ATCCCCTTGG AAACGGGGGA AATCTCTACC TGAATGCAGG AAAAAGCATA CTGCTCAATG CGACTATCTC AACCGATAAC GGCAATCTGA CAATGGCTTC GAACGATACT CAGGCAAATG GAGTCGTCGA TCAATGGCGT TCGGCAGGAA ACGCAACAAT CACCATGGGT GAAGGCACAT CGATCATCGC AGGTTCAGGA AATGTAGCCA TCGAACTGCG CGATGGATCG GGAAATACCA ACAGGGAGAG CGGTGACATC ACCTTGCGCA ACATCACCGC CAGCAACATC ATCGCCCTGA ACTTCGGGCC GACAGAGGGC TCCGGTATCA CTCTCGCCTC AGGAACTCTT GCAAGCGAAG CGACAGAGGG GAGCACGATT ATTCTGGCAG GTAACGATTT CGACAATAGC GCAAATGCAA GACTCTCCAC CAACGGAACA GCCCGATGGA TCGTCTATTC TGACAATCCA GAAGCAACCA TCAAAGGCGG ACTGAAATCA GACTTTCGTC ACTACAATGC CAGTTATACA AGCTACCCAC CCTTCAACAT CAACGAATCC GGCAATGGGT TTATCTATGC CTCTCTTCCG AACCTGCTTT ATGTCACCAC CACCTTGACC AGCGGCAGCG CCAGCAGCAC TTTCGGAACA TGGCCTGTTG CTACTTTCGG CTATACACTC ACGGGTTTTG CCGATAGCGA AGATAATGCT GAAAATATCG GACTTCATGG CACCATGATG GTCAGCGAAT TTCCCAATGC AACATCGAAA ACAGGCAACT ACTTGATAGA ATATGCTGGC GGCCTGTCAA GCAGTGTTGG CTTCACCTTT ACAGCAGGCA CCGGGCTCAT TTATACGGTT GAAAAACAGC CGATCAACTT TAATGAGATC AATCCGATAC TGAGCTGGCA ATCTGAAACA GAACATCGAG CAAAAAGAGT TCTTTCTGGC GAAACCCTTT TCAATAAACC TGATCAGACA ATCAACTCGC CACAGAAAAG TTTTATCGTA AGGCGAAGAG CCATTAAGAA TACAAAATAC AGCTCCCAGA GTGGTGACCT TATTGCTCCC GAGACGATAA CCGTGCCAGC AAGAGCAGAA GCATTGTTCT TTTTTTCTCT CCCTGACAGC ATCTTCAGCA ACAGCAATCC CGATGCAATT GTCACTCTCG AAGCACACTC AATCAATGGC ACAGCAGTAC CGTCATGGAT CTCATTTGAC CCCAAACGAA AAATAATAAG CGGGAAAGCG CCAAAAGAGG CAATAGGAAA ATACAGGATT GAGCTGGTCG CCAAAGACAA GTTTGGTGGC GAGAGCCGGT CAATTGTACT GATAACAATC GGATAA
|
Protein sequence | MCSTHSFIRS NVRSTVVVAV VVLFSLLAGH EVVALPAGGE LAAGKATIST PSLTAMQIEQ QSRQAIINWS SFGIGRGESV NVVQPDSRSA LLNRVSGEDP SELFGTLSAN GRIFLVNPNG VLFAPGASIN VGGLVTSTLD IRNSDFLSEK YAFFRHGTVG SVINQGSLSG GFIALLGKNV SNTGTIVTTG GTTGIAAGEK ITLNIDPSGL VAIRVEETAY NAQIRNSGVI EANDGRVVIK ASAADALLAS VVNNSGQVRA TSMKERNGTI VLEGASIINT GTFTASAITV SSNNLIDAGT WNAGGSASGG TIAIDATGNI EQTAASRMTA DGEEGGQIEL DAGKGLYLSG MFSANGSSGR GGQISITAPQ TFLAGAQIEA NGQNGGGSIL IGGGWQGKSN YLANAESTTV TGSCNIRANA LENGNGGTVV LWSDHSTAFS GTIEAMGGHI SGNGGEVELS SHETLTISGQ VSTGAPHGNN GFLLLDPQNI TIDASVTIPL FSLIPLPDAN PAEGDQHGSG DILELSNGNI LVASPFDDFV ATDAGAVRLY RPDGTLLSML CGSTANDQVG ETVTALTGRK SAVTSTSEWS NGDQAAAGAV TWINESIGVS GSVSGTNSLV GSTVNDGTSI RVITLNNGNY VVSSPDWDNG TASDAGAVTW GNGLSGTIGV ISAANSLVGS KKNDQVGTVT ALTNGHYVVS SPLWDKGTVT NAGAVTWSNG LGGTVGTISA TNSLVGSKTG DQVGSVTALT NGNYVVSAPN WDNGSITNAG MATLVNGLNG AVGTIIATNS LVGSKKEDKV GTGVTALTNG NYVVTSPSWD NGTTTDAGAV TWGNGINGAI GVISTTNSLV GSTANDLASA YVTALANGNY VVGSPEWDNG PTTNAGAVTW GNGIGGTAGM ISSSSSLTGS AASDGLGSTV TALTNGNYVA AWPHWDNGTA TDAGAVSWGN GNGGSVGTIS PGNSLVGSTV NDGTRYNIIP LTNGNYVVGS PYWDNGPATD AGAVTWGNGF EGTTGVIGEE NSLIGSSKND YAGSDSSGKN NITALSNGNY VASSILWDQG TTANTGAVTW GDGLGGSAGV INASNSLVGS KTGDQVGTVT ALPNGNYVVS SPLWNNGTLT NAGALTLLDG VNNNTTGAIS SLNSLVGSSK EDQLGIGGIT ALTIGSMNGS FVVSSFNQAN ETGWVEILTP NPEQESVEQE YRFNPDAANT FHPSQITALL NKGSQVVLKA NNDITINSSI IADNPLGNGG NLYLNAGKSI LLNATISTDN GNLTMASNDT QANGVVDQWR SAGNATITMG EGTSIIAGSG NVAIELRDGS GNTNRESGDI TLRNITASNI IALNFGPTEG SGITLASGTL ASEATEGSTI ILAGNDFDNS ANARLSTNGT ARWIVYSDNP EATIKGGLKS DFRHYNASYT SYPPFNINES GNGFIYASLP NLLYVTTTLT SGSASSTFGT WPVATFGYTL TGFADSEDNA ENIGLHGTMM VSEFPNATSK TGNYLIEYAG GLSSSVGFTF TAGTGLIYTV EKQPINFNEI NPILSWQSET EHRAKRVLSG ETLFNKPDQT INSPQKSFIV RRRAIKNTKY SSQSGDLIAP ETITVPARAE ALFFFSLPDS IFSNSNPDAI VTLEAHSING TAVPSWISFD PKRKIISGKA PKEAIGKYRI ELVAKDKFGG ESRSIVLITI G
|
| |