Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_2674 |
Symbol | |
ID | 8392000 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | - |
Start bp | 2702848 |
End bp | 2705706 |
Gene Length | 2859 bp |
Protein Length | 952 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644980633 |
Product | filamentous haemagglutinin family outer membrane protein |
Protein accession | YP_003138369 |
Protein GI | 257060481 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3210] Large exoproteins involved in heme utilization or adhesion |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGAGATA TTACCCTCAC GACTAAAATA CTTGAGACAA TACCCAACTA TCCTGTGATG AGTGCCAGTT TTTATTTAAC TCCCCATGAC CTCCTTGCAA CTTTTCTAAG TATCAGTGGT TTAATGGTTT CTCCCATCGC TTCAGCACAA GTTGCTAGTG ATGGAACGGT ACAAACGCAA GTTAACACCC TAGGGAATCA ATTAGAGATT ACAGAGGGAA CGCAAGCAGG AAGCAATCTA TTTCATAGCT TCAGCCAATT TTCTGTTCCG AGTGGTTTTG AGGCTTATTT TAACAATTCT TCGACCATTA GCAATATTAT CAGCCGTGTT ACAGGGGGTT CAATTTCCAA TATAGAGGGA CTCATTCGGG CTAACGGGAC GGCTAATTTA TTTCTCATCA ACCCCAATGG TATTGTCTTT GGTCCGAATG CGGTGGTTGA TATTGGGGGG TCTTTTTTAG CCACTACCGC CGAATCCATC CAATTTGCTG ATGGCTCTCA ATTTAGTGCG ACAAACTCCC AATCTTCACC CATTTTAACG ATAGCGGTTC CAGTGGGGTT GCAATTTGGT TCTAACCCTG GAACCATTGT TAATCGAGCT CATCGAACAG TACCTGATCC CACTCCTGAT AACCCTAATA ATACTAGGCA AGTAGGATTT GAGGTTAAAT CGGGTAATAC TATAGCTCTA GTAGGGGGAG ATTTGGATTT TCAGGGGGGA AGGGTTAATG GTTCAGGAGG AAGGGTTGAA CTGGGCAGTG TGGGGGGTAA CAGTAGGGTT CAATTAAGCG TGACTAACCC AGGATTGGAG ATTGACTATC AAGGAGTTAC CAACTTTCAA GATATTTCCT TATCTCAACA ATCAATAGTA GATGTGAATA ATTCAACAAT TCAGGTACAG GGCAGCAATA TTCGGCTGAC TAATGGCTCT CAAATTAGCA GCACAACAAG AACAACAGAA GATGCTGGAG ATTTAACCGT TAATGCCACT GAATCTGTTG AACTGATTGG CGCTGCCCCT GAGGGGTTTC CTTTTCCTAG TGCTTTTATT GCTCAAGTCA ATCCAGGGGG TACAGGTAGA GGGGGTAATC TGATCATCAA TACGAAACAG TTAAGTATCC GTGAGGGTGC TGGTATATCA GTAGCGAGTC AAGGTGAAGG AATAGGCGGT CGGCTAGAAG TTAATGTCTC AGAGGCGATT GAGATTACGG GAACTGGTCG ACTATTTCCC AGTGTTCTCA CAAGCTCCAC AGATGGGACA GGGGATGGGG GCGAAATAGT CATTAATACC AGACAATTAA CGCTGAGCAA TGGCGGACAA ATACAGGCGT TTACGATTAG TCAAGGACGC GGGGGAACCA TCACTATTAA CGCTTCTGAT TCCATCGAAG TGAGTGGTCG AGGAGCGTTA CCAGAGTTTA ATACAGAAAG TTTTAGTTCT ATTACAGCAG AATCAGGTTT TCAACTCCTT GGATTTACAG GAATTGCCCC AGGAGGTAAC GTTAACATTA ATACTAATCA ACTTATTGTC ACCGACGGCG GAACGATTTC TGCTGGCAGT TTTGGACAAG GGAATGCGGG AAGCGTAGAT ATTACTTCAA ATTCGATCTT TTTAGACAAT CAAGGGGTAA TTACGGCTTC TAGTGAAGGA AGCGGGGATG CAGGGAACAT CACCATCCTC ACTGACCAAC TCTCCGTTAA TAATCAATCA GAAATATCCG TCAGAAACAT TGGTTTTGGT CAGGGGGGGA ATTTAACCAT TAGTGCCGAT GCTATCTCCT TAAACCAAGA CAGTCAACTA ACTGCTGTTA GTTTTCCCCT AGAGGATCTA ACGCTCCAAG AATTAGGTAT TAACGCCGAA GAATTTGTTA ACCGTCCCAA TATTGGCAAC GCAGGGGATT TAATTCTTAA CACTTCTTCA TTAAATCTTA ATAATGATTC CCAAGTGACC GTCAGCAGTT TTGGAACGGG AAATGCAGGA AGTATGGGGA TTACAGCCCA AAATATCGCC CTAGACAACA GTAGCGAACT CGCTGCCGAA ACAGCTTCAG GGGAAGGGGG TAACATCAGT CTTTATGTCT CAGACTTCCT GAACCTTCGT CGCGCTAGTA CCATCTCCAC CACTGCAGGA ACCCTAGGTG GTGGTGGTAA TGGAGGTAAT ATCTTCATTG ATGCTGAATT TATGGTTACT GTACCCACCG AAAACAGCGA CATTATTGCC AATGCTTTTC TGGGTAATGG AGGAAACATT CGTATTAACG CCTCTGGAGT GTTTGGGATC GAAGAACGGG AACGTCTGAC CCCGTTGAAT GACATTACCG CGAGTTCCCA ATTTGGACAG GTAGGGAGTA TTGGTATTAA CCGACCCGAT GTTGATCCCC AACGCAGCTT AGTTAAATTG CCGGGTGAAG TGGTAGACGC AAAAAACCTT GTGGTTCAAG CGTGTAGTCC TGGGGGAGCG TATACGCGAG GCGAATTTAG CATAACGGGC TCAGGGGGTT TACCCGTTAA CCCTGATGAA GGAATTCAAA CTTCCCCAGG CTTGACTGAA TTAGGCTATC CTGAGATAGA AATGTTTAAT CAATCAGATA ATCAAGAATC TCTGAAAATC CCAAATGAGT CTTTAACCCC AGATTATAAG CGTCAGAGTT CTCCTACAAC CATTGTGGAA GCTCAAGGCT GGATCATGGA TAAGAATGGT AAAGTAGTGC TGACGGCTCA ATCACCTAAC GTTACTCCTC ACGGTTCTGG GTTTATGCCA TCCAACTGTT ATGACCTTTC TAACCGTTCC CTTTCCTCGG CTACGTCTCC ATCCCCATCC CTTTCTGATG TTCCTCAGCG TAATAGTCTG GCTGAGTAG
|
Protein sequence | MGDITLTTKI LETIPNYPVM SASFYLTPHD LLATFLSISG LMVSPIASAQ VASDGTVQTQ VNTLGNQLEI TEGTQAGSNL FHSFSQFSVP SGFEAYFNNS STISNIISRV TGGSISNIEG LIRANGTANL FLINPNGIVF GPNAVVDIGG SFLATTAESI QFADGSQFSA TNSQSSPILT IAVPVGLQFG SNPGTIVNRA HRTVPDPTPD NPNNTRQVGF EVKSGNTIAL VGGDLDFQGG RVNGSGGRVE LGSVGGNSRV QLSVTNPGLE IDYQGVTNFQ DISLSQQSIV DVNNSTIQVQ GSNIRLTNGS QISSTTRTTE DAGDLTVNAT ESVELIGAAP EGFPFPSAFI AQVNPGGTGR GGNLIINTKQ LSIREGAGIS VASQGEGIGG RLEVNVSEAI EITGTGRLFP SVLTSSTDGT GDGGEIVINT RQLTLSNGGQ IQAFTISQGR GGTITINASD SIEVSGRGAL PEFNTESFSS ITAESGFQLL GFTGIAPGGN VNINTNQLIV TDGGTISAGS FGQGNAGSVD ITSNSIFLDN QGVITASSEG SGDAGNITIL TDQLSVNNQS EISVRNIGFG QGGNLTISAD AISLNQDSQL TAVSFPLEDL TLQELGINAE EFVNRPNIGN AGDLILNTSS LNLNNDSQVT VSSFGTGNAG SMGITAQNIA LDNSSELAAE TASGEGGNIS LYVSDFLNLR RASTISTTAG TLGGGGNGGN IFIDAEFMVT VPTENSDIIA NAFLGNGGNI RINASGVFGI EERERLTPLN DITASSQFGQ VGSIGINRPD VDPQRSLVKL PGEVVDAKNL VVQACSPGGA YTRGEFSITG SGGLPVNPDE GIQTSPGLTE LGYPEIEMFN QSDNQESLKI PNESLTPDYK RQSSPTTIVE AQGWIMDKNG KVVLTAQSPN VTPHGSGFMP SNCYDLSNRS LSSATSPSPS LSDVPQRNSL AE
|
| |