Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC7424_1065 |
Symbol | |
ID | 7111653 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 7424 |
Kingdom | Bacteria |
Replicon accession | NC_011729 |
Strand | + |
Start bp | 1164434 |
End bp | 1169386 |
Gene Length | 4953 bp |
Protein Length | 1650 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643479335 |
Product | filamentous hemagglutinin family outer membrane protein |
Protein accession | YP_002376387 |
Protein GI | 218438058 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.195318 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGATTA CTTTAACTGG GTTACTTGCT GTTGGTGTTG GATTAATGCT AAATCAACCG ATGCAAGGAC AATCTATTAC TCCCGCTAAT GATGGCACAG GAACAATCAT TAATCGACAA GGGAATCAAT ATAATATTGA AGGGGGTACA TTATCAGAAG ATGGAAGAAA TTTATTTCAT AGTCTTGAGA AATTTGGGTT AACTCAAGCA GAAATTGCTA ACTTTCTCTC TAATCCTCAA ATCCGCAATA TTTTAACCCG AATAGTCGGG GGAAATCCCT CAGTGATTAA TGGGTTAATT CAGGTTCTTA ATGGGAATTC TAATCTATAC ATTATGAATC CCGCCGGTAT AATATTCGGC CCGAATGCGC GAATCAATGT ACCGGCTGAT TTTGTAGTAA CAACAGCGAC TCGAATCGGG TTTGATAATA ATTTATGGTT TAATGCTTTC GAGAGTAATG ACTATGCTCA ATTAATGGGA AATCCTTCTC AATTTGCTTT TGATTTAGCT AAATCCGGCA CGATTACTAA TTTGGGTAAT TTAACGGTGA CAGAAGGACA TCATTTAATG TTGTTGGGAA AAACTGTCAA TAATAGTGGC ACATTAAACG CGCCAGAAGG AAATATTACT ATTGCTGCTG TTGCCGGTAC TAACCGAATT AAAATATCTC AAGAAAATAA TTTAGTGAGT TTAGAAATTG AACGACCTGA CACCAATGTC AACTCAAATC TTTCTGCTTT AGATTTACCC ACTTTATTGA CAGAAGGAAA AGTTAGTAAT TCTGCCTCTA TTAGTCCTTT ACAACCGGGG ACGGTGATTA TTTCTGGGAA TTTATCAGTC TCTAATGCTG CCAATTCCCC TCTTCAACAA GGAGGACAAG TCAATATAAT AGGGGATCAA ATTAGCTTAA TTAATGCTAA TATTAACGCG ACTGGAACTC GTGGGGGAGG AATTGTTCAA ATAGGAGCAA ATAGTCAAGG TCAGGATATA ACAACCACTG CTTCCCGTAT TGCTATTAAT GGAAATTCAT CTATTGACGT GAGTGCTATT GATACAGGAG AAGGGGGTCA AGTTATTATT TCTGGGAACA CTTACACAAA TGTTGTTGGT CAAATTCGTT CTCGTGGTGG TCAAGAAGAG GGGAATGGAG GATATATTAA AATTGTTGGT GTAAATGGGT TAGATTTTCA AGGAACTGTA GATACTTCTG CTTTGCAAGG GGATTTTGGG GTTTTAGTTC TCAATTCAAA TAATTTAACT ATAGAAAATT CTCAGGGATT TTCCTCTCCT AAAATTAGAC CGTCTTCTCT TAATACGTTG TCTAGTAATC TGATTTTAGA AGCTTCTAAT AAAATCACGT TTAATGATCC GATTAATCTG CAAAATAGTC CATCAGGCTT GACAGTAATT GCTGATAAAA CTATTCAAGT TAATGATAAT ATTAATACTC AAAAGGGTTC AATTATTTTA CAAGCAAATG AAACTATTAA TATTAATAAT AGTCAGGTAA CAACTCAAGA TGGCTCAATT ATTTTAGAAC CTATTTTACC CGATTTAAAT TCTCTACTTA TTTCTCTTGA TAATAGTTTT ATTGAAACGG TTAATCCCAC AGAAAATATT CGTTTAACGG CTCAAGAAAT CGATCTTAAT GGGTCAACCG AAATTCTAGG ACAAAATCAG CTTGAAATCA AACCCCCAAC AGATGATAAA AATATTATTC TGGGAGGAGA TAATAATAAT CCATCAGCTT TAAATTTGAC GGGATCAGAA CTAGAAAAAA TTAATGGTTT TGAGTCGGTT ATCATTGGCA ATAACAACGG ACTTGGAAGG ATTGAGATAG GCAATGATAC TACTGAAGTC AATTTAAATA ATCAAGATTA TAATTTAATT TTGCAAGGTA AAGATATTAA ATTTAATAAT TCTTTGATAT TAGCAGATGA TAAAACTTTA ACCTTAAATA CTCAAACAGT CACTAGCGCG AATTCCCCCC CCTTCAGAGA TATCAAAATT GAGGGAGAAC AAGGTCAATT AATCCTCAAT ACCTCTGGTT CGGTAGGAAC TTTAGATAAT CCTTTAGATA CAGAGGTAAG TTACCTGACA ATTTCCTCTA CTTTAGGGGA TAATTTTATT TATAATCATT CAGATATAAG CTTAGGAAAA ATTTCGATTG CTGATAATTT AGATCTTAAT GTTGATGGGA ATATTAGAGA TAATGATAGG GTAGTGATAG GAGGAATTCT TACTCTCAGG GGACAAGATA TTATTTTAGA TAATAATAAT GACTTGAATA CAGTAGCAGT AACAGAAGCA GAAAATCTGA CCCTGAATGA TACTAATGAA TTAGACCTAG CTAATCTGCA AATTAACAAT GATTTAAACC TAACTACCAA TGGAGATATT ACTCATAGTG GGAGTATTCA AGTCTTAGGA ATAACTAATT TAAATACCAA TGGAAACGAT ATAATTTTAA ATAGTTCCAA TACTGATTTA AATATTGTTA ATATTTTTAA CAGTAAAAAT GTTTTTTTAA AAGATATTAA TGATATTATT TTAAATAACC CTCTAACAGC GAACAGTCTG ACTATTGAGG CCACGGGAAA TATTACAACG AATGATATTA ATACAAGTTC TACTCTAATT TCTGGAGGGA ATGTTAATTT AATCTCTGAT AATGGAAAGA TAACTACAGG CGGAATTAAT ACCAGTAGTC CTATAAATAA TGCAGGTAAT GTGACGTTAA AAGCTCAGGG AGATATAGAA ACTGCCTATA TTAATGCTCA AGGAATAAGC AATACTATAG GAGGTGACGT TGCAATTATT ACCAACAATT TTATCAGAAT AACCGACTTT TTCATTGACA GTAATAATAC ATTAGCTAGT ATCTCAACTG TGGGGGAAAA TGGAGGAGGA AATATTATTA TCCAACATGG AGGACAAGGA ATTACCCAGT TTAAAATAGG TGATCCTACG GTTAATGGAA CATCACAAGC AATTACGACG GGTAGTAGCA CAATTAATAA TGAATCTTTG TCATACACAA CAACCAGAGG AAATATTCAA CTGATTTCTG TTAATGAACT CTCTCCAGAA CCTTCCCCAG AACCTTCTCC AAGTCCGAAT CCTAACCCGA ATCCGATTAT TAATATCAAT CCTGAACCTA ATCCTAACCC GAATCCGATT ACTAATATTA ATCCTGAACC TAATCCTACC CCCAATCCGA ATACTAATAT TAATCCTGAA CCTAATCCTA ACCCCAATCC GAATACTAAT ATTAATCCTG AACCTAATCC TAACCCCAAT CCGATTACTA ATATCAATCC TGACCCGAAT CCGAATCCGA TTACTCATAT TAATCCTGAA CCTAATCCTA ACCCGAATCC GATTACTAAT ATCAATTTCA ACCCAAATCC GAATCCCATT ACTAATACTA ATTCCGACAT TCCTGTAGAT ACAAAAATTG ACCCTATTTT TACCAATAGC ATCGAAAATT TATCAACCAT TGAGGAAATT GATCCCATAA GACAAAGCCT ATTTGTAGAT ACAAATCAAC CCAGTTTGAG CTTGCAATCC AACAGGGACA TTGTAGGAAT AGTTCCTACA GGGTTTCTTT CTATTTTTGG AGGGACAGCA ACGGCTGAAG TCGGAAATGT AGAAAGTAAT TTTACTGATG CTTTTGAAAA TTATTTAGGA ATTAACAAAG CGTCTACCTT AAGCTTAAAA GAAAAACAAA TTATTCTCAA TAATATTGAA GAAAAGTCAG GTTATAAATC AGCATTTGTT TATATATATT TTAAACCCAA CTCTACAACA ACAGCCCAGG AAAAACAGGA ACAACAAGAA ATCTTATGGA GGTATCGCGG AGGTAAACCA TCTTCTCAAA AAAATCTAGC TTCTGAAGGT CAAGGAACAG ATCAATTAGA AATAATTGTT ATCACCTCTA CCGGAGATGT AATTCAAGAA AAAATAGAAG GAATCACACG AGATCAGGTC ATGTCGGTCG TGAAAAAATT TCAACAGACA GTGACAAATC CTCGAAGACC TACTGCTTAT CTCCAGCCAT CTCAACAACT TTATCAATGG TTTATTGCTT CTATTGAACA GGATTTACAA GCGCAAAATA TTGATACTCT TGTCTTTATT CTTGATGCTG GTTTACGGTC AGTTCCAATG GCGGCTCTTA ATGATGGGCA ACAATTTTTG ATTGAAAAAT ATAGTATGGG TTTGATGCCA AGTGTTGCCT TAACAGATAC CCGTTATGTT AATGTTCAAA ATTCCCAAGT TTTAGCAATG GGTGCATCTA CATTTAGGGA TCAAAATCCC TTACCATCAG TTCCTATTGA ATTATCACTC ATTACCGATC AACTTTGGCA AGGACGATCT TATATTAATG AAAGTTTTAC CCTTAATAAT CTTAAAAAAG CACATCAATC TGAGCAATTT AGAATCGTCC ATTTAGCGAC TCATGCCAAT TTTGAGGGAG GGACTCTTGC TAATTCTTAT ATTCAATTAT GGGATAATAA ATTAACAATG GAGCAGTTAA AAGGATTAGG TTTAGATCGG CCTCCCATAG ATTTATTAGT GTTAAGTGCT TGTCGAACTG CTCTAGGTAA TAGAGAGTCA GAATTAGGCT TTGCCGGAGC GGCTGTATTA GCGAATGTCA AGACGGTTTT AGGGAGTCTT TGGGAAGTGA GTGATGAAGG AACTTTAGCA TTAATGACCA GTTTTTATCA GCAACTTAGA GGTGTTCCTC TCAAAGCTGA AGCACTGCGA AAAGCTCAGT TAAGTCTTCT TCGAGGTGAT GTAAAAATGA TTCAGGAAAA CTCTAAACAA GCGGAACTTT TAATCGAAAA TCAACGTTTA CCCTTACCTC CTCAATTAGC AGAATTAGGC GCTAAAGAAT TTACTCATCC TTATTATTGG AGTAGTTTTA CAGTGATTGG TAATCCTTGG TAA
|
Protein sequence | MRITLTGLLA VGVGLMLNQP MQGQSITPAN DGTGTIINRQ GNQYNIEGGT LSEDGRNLFH SLEKFGLTQA EIANFLSNPQ IRNILTRIVG GNPSVINGLI QVLNGNSNLY IMNPAGIIFG PNARINVPAD FVVTTATRIG FDNNLWFNAF ESNDYAQLMG NPSQFAFDLA KSGTITNLGN LTVTEGHHLM LLGKTVNNSG TLNAPEGNIT IAAVAGTNRI KISQENNLVS LEIERPDTNV NSNLSALDLP TLLTEGKVSN SASISPLQPG TVIISGNLSV SNAANSPLQQ GGQVNIIGDQ ISLINANINA TGTRGGGIVQ IGANSQGQDI TTTASRIAIN GNSSIDVSAI DTGEGGQVII SGNTYTNVVG QIRSRGGQEE GNGGYIKIVG VNGLDFQGTV DTSALQGDFG VLVLNSNNLT IENSQGFSSP KIRPSSLNTL SSNLILEASN KITFNDPINL QNSPSGLTVI ADKTIQVNDN INTQKGSIIL QANETININN SQVTTQDGSI ILEPILPDLN SLLISLDNSF IETVNPTENI RLTAQEIDLN GSTEILGQNQ LEIKPPTDDK NIILGGDNNN PSALNLTGSE LEKINGFESV IIGNNNGLGR IEIGNDTTEV NLNNQDYNLI LQGKDIKFNN SLILADDKTL TLNTQTVTSA NSPPFRDIKI EGEQGQLILN TSGSVGTLDN PLDTEVSYLT ISSTLGDNFI YNHSDISLGK ISIADNLDLN VDGNIRDNDR VVIGGILTLR GQDIILDNNN DLNTVAVTEA ENLTLNDTNE LDLANLQINN DLNLTTNGDI THSGSIQVLG ITNLNTNGND IILNSSNTDL NIVNIFNSKN VFLKDINDII LNNPLTANSL TIEATGNITT NDINTSSTLI SGGNVNLISD NGKITTGGIN TSSPINNAGN VTLKAQGDIE TAYINAQGIS NTIGGDVAII TNNFIRITDF FIDSNNTLAS ISTVGENGGG NIIIQHGGQG ITQFKIGDPT VNGTSQAITT GSSTINNESL SYTTTRGNIQ LISVNELSPE PSPEPSPSPN PNPNPIININ PEPNPNPNPI TNINPEPNPT PNPNTNINPE PNPNPNPNTN INPEPNPNPN PITNINPDPN PNPITHINPE PNPNPNPITN INFNPNPNPI TNTNSDIPVD TKIDPIFTNS IENLSTIEEI DPIRQSLFVD TNQPSLSLQS NRDIVGIVPT GFLSIFGGTA TAEVGNVESN FTDAFENYLG INKASTLSLK EKQIILNNIE EKSGYKSAFV YIYFKPNSTT TAQEKQEQQE ILWRYRGGKP SSQKNLASEG QGTDQLEIIV ITSTGDVIQE KIEGITRDQV MSVVKKFQQT VTNPRRPTAY LQPSQQLYQW FIASIEQDLQ AQNIDTLVFI LDAGLRSVPM AALNDGQQFL IEKYSMGLMP SVALTDTRYV NVQNSQVLAM GASTFRDQNP LPSVPIELSL ITDQLWQGRS YINESFTLNN LKKAHQSEQF RIVHLATHAN FEGGTLANSY IQLWDNKLTM EQLKGLGLDR PPIDLLVLSA CRTALGNRES ELGFAGAAVL ANVKTVLGSL WEVSDEGTLA LMTSFYQQLR GVPLKAEALR KAQLSLLRGD VKMIQENSKQ AELLIENQRL PLPPQLAELG AKEFTHPYYW SSFTVIGNPW
|
| |