Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_3442 |
Symbol | |
ID | 7103132 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 3587579 |
End bp | 3590437 |
Gene Length | 2859 bp |
Protein Length | 952 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 643476457 |
Product | filamentous hemagglutinin family outer membrane protein |
Protein accession | YP_002373566 |
Protein GI | 218248195 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3210] Large exoproteins involved in heme utilization or adhesion |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGAGATA TTACCCTCAC GACTAAAATA CTTGAGACAA TACCCAACTA TCCTGTGATG AGTGCCAGTT TTTATTTAAC TCCCCATGAC CTCCTTGCAA CTTTTCTAAG TATCAGTGGT TTAATGGTTT CTCCCATCGC TTCAGCACAA GTTGCTAGTG ATGGAACGGT ACAAACGCAA GTTAACACCC TAGGGAATCA ATTAGAGATT ACAGAGGGAA CGCAAGCAGG AAGCAATCTA TTTCATAGCT TCAGCCAATT TTCTGTTCCG AGTGGTTTTG AGGCTTATTT TAACAATTCT TCGACCATTA GCAATATTAT CAGCCGTGTT ACAGGGGGTT CAATTTCCAA TATAGAGGGA CTCATTCGAG CTAACGGGAC GGCTAATTTA TTCCTGATTA ACCCCAATGG TATTGTCTTT GGTCCGAATG CGGCGGTTGA TATTGGGGGG TCTTTTTTAG CCACTACCGC CGAATCCATC CAATTTGCTG ATGGCTCTCA ATTTAGTGCG ACAAACTCCC AATCTTCACC CATTTTAACG ATAGCGGTTC CAGTGGGGTT GCAATTTGGT TCTAACCCTG GAACCATTGT TAATCGAGCT AATCGAACAG TACCTGATCC CACTCCTGAT AACCCTAATA ATACTAGGCA AGTAGGATTT GAGGTTAAAT CGGGTAATAC TATAGCTCTA GTAGGGGGAG ATTTGGATTT TCAGGGGGGA AGGGTTAATG GTTCAGGAGG AAGGGTTGAA CTGGGCAGTG TGGGGGGTAA CAGTAGGGTT CAATTAAGCG TGACTAACCC AGGATTGGAG ATTGACTATC AAGGAGTTAC CAACTTTCAA GATATTTCCT TATCTCAACA ATCAATAGTA GATGTGAATA ATTCAACAAT TCAGGTACAG GGCAGCAATA TTCGGCTGAC TAATGGCTCT CAAATTAGCA GCACAACAAG AACAACAGAA GATGCTGGAG ATTTAACCGT TAATGCCACT GAATCTGTTG AACTGATTGG CGCTGCCCCT GAGGGGTTTC CTTTTCCTAG TGCTTTTATT GCTCAAGTCA ATCCAGGGGG TACAGGTAGA GGGGGTAATC TGATCATCAA TACGAAACAG TTAAGTATCC GTGAGGGTGC TGGTATATCA GTAGCGAGTC GAGGTAAAGG AATAGGCGGT CGGCTAGAAG TTAATGTCTC AGAGGCGATT GAGATTACGG GAACTGGTCC ACAATCCCTT AGTGTTCTCA CAAGCTCCAC AGATGGGACA GGGGATGGCG GCGAAATAGT CATTAATACG AGACAATTAA CGCTGAGCAA TGGCGGACAA ATACAGGCGT TTACGATTAA TCAAGGACGC GGGGGAACTA TCACCATTAA CGCTTCTGAT TCAATCGAAG TGAGTGGTCG AGGAGCGTTA CCAGAGTTTA ATACAGAAAG TTTTAGTTCT ATTACAGCAG AATCAGGTTT TCAACTCCTT GGATTTACAG GAATTGCCCC AGGAGGCAAT GTTAACATTA ATACTAATCA ACTTATTGTC ACCGACGGCG GAACGATTTC TGCTGGCAGT TTTGGACAAG GGAATGCGGG AAGCGTAGAT ATTACTTCAA ATTCGATCTT TTTAGACAAT CAAGGGGTAA TTACGGCTTC TAGTGAAGGA AGCGGGGATG CAGGGAACGT GAGTATCTAC ACAGACCAAC TCTCCGTTAA TAATCAATCA GAAATATCCG TCAGAAACAT TGGTTTTGGT CAGGGGGGGA ATTTAACGAT TAGTGCTGAT GCTATTTCGT TAAACCGAAA CAGTCAATTG AGTGCCGTTA GTCTTCCCTT AGAGGATCTA ACGCTCCAAA AATTAAGTAT TAACGCCGAA GAATTTGCCA ACCGTCCCAA TATTGGCAAC GCAGGGGATT TAATTCTTAA CACTTCTTCA TTAAATCTTA ATAATGATTC CCAAGTGACC GTCAGCAGTT TTGGAACGGG AAATGCAGGA AGTATGGGGA TTACAGCCCA AAATATCGCC CTAGATAACA GTAGCGAACT CGCTGCCGAA ACAGCTTCAG GGGAAGGGGG TAACATCAGT CTTTATGTCT CAGACTTCCT GAACCTTCGT CGCGCTAGTA CCATCTCCAC CACTGCAGGA ACCCTAGGTG GTGGTGGTAA TGGAGGTAAT ATCTTCATTG ATGCTGAATT TATGGTTACT GTACCCACCG AAAACAGCGA CATTATTGCC AATGCTTTTC TGGGTAATGG AGGAAACATT CGTATTAACG CCTCTGGAGT GTTTGGGATC GAAGAACGGG AACGTCTGAC CCCGTTGAAT GACATTACCG CGAGTTCCCA ATTTGGACAG GTAGGGAGTA TTGGTATTAA CCGACCCGAT GTTGATCCCC AACGCAGCTT AGTTAAATTG CCGGGTGAAG TGGTAGACGC AAAAAACCTT GTGGTTCAAG CGTGTAGTCC TGGGGGAGCG TATACGCGAG GCGAATTTAG CATAACGGGC TCAGGGGGTT TACCCGTTAA CCCTGATGAA GGAATTCAAA CTGCCCCAGG CTTGACTGAA TTAGGCTATC CTGAGATAGA AATGTTTAAT CAATCAGATA ATCAAGAATC TCTGAAAATC CCAAATGAGT CTTTAACCCC AGATTATAAG CGTCAGAGTT CTCCTACAAC CATTGTGGAA GCTCAAGGCT GGATCATGGA TAAGAATGGT AAAGTAGTGC TGACGGCTCA ATCACCTAAC GTTACTCCTC ACGGTTCTGG GTTTATGCCA TCCAACTGTT ATGACCTTTC TAACCGTTCC CTTTCCTCGG CTACGTCTCC ATCCCCATCC CTTTCTGATG TTCCTCAGCG TAATAGTCTG GCTGAGTAG
|
Protein sequence | MGDITLTTKI LETIPNYPVM SASFYLTPHD LLATFLSISG LMVSPIASAQ VASDGTVQTQ VNTLGNQLEI TEGTQAGSNL FHSFSQFSVP SGFEAYFNNS STISNIISRV TGGSISNIEG LIRANGTANL FLINPNGIVF GPNAAVDIGG SFLATTAESI QFADGSQFSA TNSQSSPILT IAVPVGLQFG SNPGTIVNRA NRTVPDPTPD NPNNTRQVGF EVKSGNTIAL VGGDLDFQGG RVNGSGGRVE LGSVGGNSRV QLSVTNPGLE IDYQGVTNFQ DISLSQQSIV DVNNSTIQVQ GSNIRLTNGS QISSTTRTTE DAGDLTVNAT ESVELIGAAP EGFPFPSAFI AQVNPGGTGR GGNLIINTKQ LSIREGAGIS VASRGKGIGG RLEVNVSEAI EITGTGPQSL SVLTSSTDGT GDGGEIVINT RQLTLSNGGQ IQAFTINQGR GGTITINASD SIEVSGRGAL PEFNTESFSS ITAESGFQLL GFTGIAPGGN VNINTNQLIV TDGGTISAGS FGQGNAGSVD ITSNSIFLDN QGVITASSEG SGDAGNVSIY TDQLSVNNQS EISVRNIGFG QGGNLTISAD AISLNRNSQL SAVSLPLEDL TLQKLSINAE EFANRPNIGN AGDLILNTSS LNLNNDSQVT VSSFGTGNAG SMGITAQNIA LDNSSELAAE TASGEGGNIS LYVSDFLNLR RASTISTTAG TLGGGGNGGN IFIDAEFMVT VPTENSDIIA NAFLGNGGNI RINASGVFGI EERERLTPLN DITASSQFGQ VGSIGINRPD VDPQRSLVKL PGEVVDAKNL VVQACSPGGA YTRGEFSITG SGGLPVNPDE GIQTAPGLTE LGYPEIEMFN QSDNQESLKI PNESLTPDYK RQSSPTTIVE AQGWIMDKNG KVVLTAQSPN VTPHGSGFMP SNCYDLSNRS LSSATSPSPS LSDVPQRNSL AE
|
| |