Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_0285 |
Symbol | |
ID | 7104017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 269512 |
End bp | 273120 |
Gene Length | 3609 bp |
Protein Length | 1202 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643473395 |
Product | filamentous hemagglutinin family outer membrane protein |
Protein accession | YP_002370541 |
Protein GI | 218245170 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3210] Large exoproteins involved in heme utilization or adhesion |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAAT TGACTTTTTT GAATTCAAAT AATCTTAAAT TTAGATTAAC TTTAATCAGT GGCATTATTC TATTATCGAC GGGATTAACT ATCGATTCTC TTGAAGCACA AACGTCTCCT ATTATTCCTG ATAATACCTT ACCGATTAAT TCCCAAGTAA CGCCTAATGG CAATCAATTG ATTATCAACC AAGGAACGTC AGCAGGAAAC AATCTTTTTC ATAGTTTTGA AGCCTTTTCA CTGCCTACAG GTTACGAAGC TTTTTTTAAC AATTCTTTGG CGATTCAGAA CATTTTTAGT CGAGTAACAG GGACTTCAAT TTCTAATATT GATGGAATCA TTCGAGCCAA TGGGATAGCC AATTTATTTT TCTTAAATCC TAATGGAATT ATCTTTGGAT CTAATGCACA ATTAGACATT GGCGGTTCTT TTTTTGGCAG TACAGCTAAT AGTATTCTCT TTGAGGATAA CAGTGAGTTT AGTGCAACTA ACCCAAATAA TTCCTCATTA CTCTCTATAT CCATGCCTAT TGGCTTACAA TTTAGAGACA ATTCCGCTCA AATTCAAGTC CAAGGAGCGG GACATCAATT ACAATTAGTT GATCCCCTAA TTTTCTTTTC TGTTCAACAA CCTAATAACC CCAACCTAGG ACTTCAAGTT CAAGATGGAA AAACCTTAGC CTTAGTTGGA GGATTGGTTA ATCTAGAAGG AGGGATTATT ACAGCACCAG AGGGAAATAT TGAATTAGGT GGAGTAAAAT CTGGTTTAGT CCGACTTAAT TCAAGTGGCA TAGGTTGGAC GTTAGACTAT CAACAAGTAG ACAATTTTCA AAACATAGCA CTCACCCAAA AAGCCTTAGT AGATACCAGT GGCAACAGTG GGGGATCTAT CCAAGTTCAA GGATCTCAAA TTGCCCTCCT AGATGGCTCC GTTATTTGGA TAGAAAATAG GGGATCTCAA CCCAGTGGTA ATATAGCAAT TCATAGTTCC CAATCAGTTG AATTAAAAGG ATTAAATGGA GATAGAAGCA CGATTTTTCC TAGCACAATT TTTACGCAAG CAATAGGACA AGGAAAAGGC GGTGATATTC TTATTTCTAC CGAGCAATTG AATATGGATG AAATATCCCG TGTAGACGCT TTAACCTTAT TAGGAGGAGC AAGAGGGGGT AATCTGGTAC TCAATGCTCG TGATTCTATC AATATGAATT CAGGTTCCCC GTTGATTGAT ACCAGTATCC TGTCTCGAAC CCAATTTTCA GGAGACGCAG GAAACATTAC CGTATCAACC AATAAGCTAA CCATCACCAA TGGAGGTACG CTGATCTCTC CGTCCAATAT TGGAACTGGG AATGCAGGGG ATGTCACCGT CAAGGCTGTG GAAATTGAGA TAATAGGAGT CAACCCTGGA AACTTAGTGT CTAGCCAAAT TTCTACCCCT ACCCGTGATG GGAATGCTGG CAACGTAGAA ATTAATACCG CAAGGCTCAT TCTTCGGGAT GGAGGCTTAG TTAATTCCGC TACCTTAGCC ACTGGCAATG CTGGAAGTGT TAGGATTAAT GCCTCCGAAT CCGTCGAGAT TAGTGGTACA TTTCCTCAGT CTATTCTGGC CTCTCAAATC AGTTCATCGG CTCTAATTAT TGATGAAGCA TTACAGCAAC TGTTCGGAAT TGACCCTATC CCCACTGGTG CATCAGGGAA TGTAACCGTT AATACCCCTC AATTAACCAT TACGAATCAA GGATTAATCA GTGTTCGTAA TGATGGAACA GGTGATGCAG GGAATTTAGA GATCAATGCT GATTCGATTG TTCTCGATAA TCAAGGTGGT ATTGCTGCAT CGACTGCCTC TGGAGAAGGT GGCAATATTT TCCTTAATCT AGGGAATAAT CTACGAATGA ACAATAGCAG TTTTATCACC GCTACCGCAG GGGGAGGGGG TAATAGTGGC AACATAACCA TTGATACCCC CATCTTAATC CTTCAGAACG GTGCAACTAT TTCTGCTTTA AGCATCTTAA GACAAGGGGG AGATATTGAG TTAGAAGGTC TACAAACCCT AGAAATTAGA GATAACAGTG AAATTTCTGT TTCTACTGAA ACCGGAGAAG CTGGCAGTAT TGGGATTAAT CAAAATCAAA CCCCAGTGAA AACAGTTGAC ATCACCAACG GTAGCCGCTT AGCTGCCCAA GCCACTCAAC CCCAAGGGGA AGCGGGAAGT ATCAGCGTTA ACACTACAAA TCTAGCGGTT AATCAAGGTT CCTCCATCTC TGCAGCTAAT ATCTCAGGAA GAACTGGAGG CGATATTAAC CTCCTGAATT TAGGTCGTTT AGAAGTCAAT GGGGCAGAAA TTTCAGCAAC TACTCGCGAT GGCAAGGCAG GTAACTTAGC GATTAATCAA GGTCAAACTC CCGTCAAGAT AATCGAACTG AATGCAGGTA GTCTCACCGT TGAAGCTACC GGAATGGGAG AATCTGGCAA CCTTACTGTT AACGCACTAA GCTTAAATTT ACAAAATAAC GCTGAAATTT CAGCCTCAAC GAATTTCGGA CAAGGGGGGG ACATAACCCT AGAAGGGTTA GATACCCTTG AGGTCAACAA TAGTCAGATT TCGGCTTCAA CTCAGGCTGG ACGCGGTGGC AACTTAACCA TTAAGACGAC TAATTCTGTT CAGCTAAGTG GGCAAGGAAA GCTATCAGTG GAAGCAACAC AAGGCGGTAG AGCGGGGAAT TTAAGCCTAG AAACCGGGCA AATGAGCATT AGCGAGAGGG CAAACGTGAC CGTTAGCAGT CCCCAAGGAC AAGCCGGGAA TCTGACCATT AAGGCGAATA GCTTGTCATT AAATAACGGT TTTATAACGG CTGAAACCGG GCAAAGTGAG GGCGAAGAAG GCGCGAATAT TACTTTGAGG ATCTCAGACT TCATCACCCT GGAAAATGAA AGTCTCATCT CTGCTACGGC TAATGGGTCT GCCGATGGGG GTAACATTGA TATCGAAACC CCATTTTTAA TCGTTTTCCC AAGCAGTCCT AATGGTAGCG ATATCATTGC CAAAGCAGAA CAGGGTAGCG GGGGTCGTAT TGCCATTAAC AGCCAAGGAA TCTTCGGGAT TGAAGAAGGT ATAGCCACGC CAGGGAATCA ACGCAATGAC CTTGATGCTA GTTCAGAATC GGGTTCAACG GGAGAAATTC TGCTCAACCG CGAACTTGAC CCCAACCGAG GCTTAGTGGA ACTCCCTGAA ACCATTGTTG ATCCCAATAG TCTCATTGCT CAAAATGCTT GTCAACGGGG GACACAAAGC GAATTTTCTG TAACCGGACG CGGGGGATTA CCCCCTAGTC TCAATGAAGA CTTGAGCAGC GAGGCAACTC AAGTCGATTT AGTGCAACCT GCTCCGTTTA TGAAGTTAGA AGGCAGGAGG CAGGAGGCAG GAGGCAGGAG GCAAAATATT TCCCCACCCT CTACTTCTTC AGGACAGACC CCCGTAATTC CGGCTCAAGG CTGGATATTT AATGAGAAAG GAGAAATCGT CCTTGTTGCC TATGATCCTA CGATGACGGA TTCCCAACGT TTGCGAAAAC AGGGTAATGG CTGTCATCAG CAATCATGA
|
Protein sequence | MKQLTFLNSN NLKFRLTLIS GIILLSTGLT IDSLEAQTSP IIPDNTLPIN SQVTPNGNQL IINQGTSAGN NLFHSFEAFS LPTGYEAFFN NSLAIQNIFS RVTGTSISNI DGIIRANGIA NLFFLNPNGI IFGSNAQLDI GGSFFGSTAN SILFEDNSEF SATNPNNSSL LSISMPIGLQ FRDNSAQIQV QGAGHQLQLV DPLIFFSVQQ PNNPNLGLQV QDGKTLALVG GLVNLEGGII TAPEGNIELG GVKSGLVRLN SSGIGWTLDY QQVDNFQNIA LTQKALVDTS GNSGGSIQVQ GSQIALLDGS VIWIENRGSQ PSGNIAIHSS QSVELKGLNG DRSTIFPSTI FTQAIGQGKG GDILISTEQL NMDEISRVDA LTLLGGARGG NLVLNARDSI NMNSGSPLID TSILSRTQFS GDAGNITVST NKLTITNGGT LISPSNIGTG NAGDVTVKAV EIEIIGVNPG NLVSSQISTP TRDGNAGNVE INTARLILRD GGLVNSATLA TGNAGSVRIN ASESVEISGT FPQSILASQI SSSALIIDEA LQQLFGIDPI PTGASGNVTV NTPQLTITNQ GLISVRNDGT GDAGNLEINA DSIVLDNQGG IAASTASGEG GNIFLNLGNN LRMNNSSFIT ATAGGGGNSG NITIDTPILI LQNGATISAL SILRQGGDIE LEGLQTLEIR DNSEISVSTE TGEAGSIGIN QNQTPVKTVD ITNGSRLAAQ ATQPQGEAGS ISVNTTNLAV NQGSSISAAN ISGRTGGDIN LLNLGRLEVN GAEISATTRD GKAGNLAINQ GQTPVKIIEL NAGSLTVEAT GMGESGNLTV NALSLNLQNN AEISASTNFG QGGDITLEGL DTLEVNNSQI SASTQAGRGG NLTIKTTNSV QLSGQGKLSV EATQGGRAGN LSLETGQMSI SERANVTVSS PQGQAGNLTI KANSLSLNNG FITAETGQSE GEEGANITLR ISDFITLENE SLISATANGS ADGGNIDIET PFLIVFPSSP NGSDIIAKAE QGSGGRIAIN SQGIFGIEEG IATPGNQRND LDASSESGST GEILLNRELD PNRGLVELPE TIVDPNSLIA QNACQRGTQS EFSVTGRGGL PPSLNEDLSS EATQVDLVQP APFMKLEGRR QEAGGRRQNI SPPSTSSGQT PVIPAQGWIF NEKGEIVLVA YDPTMTDSQR LRKQGNGCHQ QS
|
| |