Gene PCC7424_1065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC7424_1065 
Symbol 
ID7111653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7424 
KingdomBacteria 
Replicon accessionNC_011729 
Strand
Start bp1164434 
End bp1169386 
Gene Length4953 bp 
Protein Length1650 aa 
Translation table11 
GC content35% 
IMG OID643479335 
Productfilamentous hemagglutinin family outer membrane protein 
Protein accessionYP_002376387 
Protein GI218438058 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.195318 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGATTA CTTTAACTGG GTTACTTGCT GTTGGTGTTG GATTAATGCT AAATCAACCG 
ATGCAAGGAC AATCTATTAC TCCCGCTAAT GATGGCACAG GAACAATCAT TAATCGACAA
GGGAATCAAT ATAATATTGA AGGGGGTACA TTATCAGAAG ATGGAAGAAA TTTATTTCAT
AGTCTTGAGA AATTTGGGTT AACTCAAGCA GAAATTGCTA ACTTTCTCTC TAATCCTCAA
ATCCGCAATA TTTTAACCCG AATAGTCGGG GGAAATCCCT CAGTGATTAA TGGGTTAATT
CAGGTTCTTA ATGGGAATTC TAATCTATAC ATTATGAATC CCGCCGGTAT AATATTCGGC
CCGAATGCGC GAATCAATGT ACCGGCTGAT TTTGTAGTAA CAACAGCGAC TCGAATCGGG
TTTGATAATA ATTTATGGTT TAATGCTTTC GAGAGTAATG ACTATGCTCA ATTAATGGGA
AATCCTTCTC AATTTGCTTT TGATTTAGCT AAATCCGGCA CGATTACTAA TTTGGGTAAT
TTAACGGTGA CAGAAGGACA TCATTTAATG TTGTTGGGAA AAACTGTCAA TAATAGTGGC
ACATTAAACG CGCCAGAAGG AAATATTACT ATTGCTGCTG TTGCCGGTAC TAACCGAATT
AAAATATCTC AAGAAAATAA TTTAGTGAGT TTAGAAATTG AACGACCTGA CACCAATGTC
AACTCAAATC TTTCTGCTTT AGATTTACCC ACTTTATTGA CAGAAGGAAA AGTTAGTAAT
TCTGCCTCTA TTAGTCCTTT ACAACCGGGG ACGGTGATTA TTTCTGGGAA TTTATCAGTC
TCTAATGCTG CCAATTCCCC TCTTCAACAA GGAGGACAAG TCAATATAAT AGGGGATCAA
ATTAGCTTAA TTAATGCTAA TATTAACGCG ACTGGAACTC GTGGGGGAGG AATTGTTCAA
ATAGGAGCAA ATAGTCAAGG TCAGGATATA ACAACCACTG CTTCCCGTAT TGCTATTAAT
GGAAATTCAT CTATTGACGT GAGTGCTATT GATACAGGAG AAGGGGGTCA AGTTATTATT
TCTGGGAACA CTTACACAAA TGTTGTTGGT CAAATTCGTT CTCGTGGTGG TCAAGAAGAG
GGGAATGGAG GATATATTAA AATTGTTGGT GTAAATGGGT TAGATTTTCA AGGAACTGTA
GATACTTCTG CTTTGCAAGG GGATTTTGGG GTTTTAGTTC TCAATTCAAA TAATTTAACT
ATAGAAAATT CTCAGGGATT TTCCTCTCCT AAAATTAGAC CGTCTTCTCT TAATACGTTG
TCTAGTAATC TGATTTTAGA AGCTTCTAAT AAAATCACGT TTAATGATCC GATTAATCTG
CAAAATAGTC CATCAGGCTT GACAGTAATT GCTGATAAAA CTATTCAAGT TAATGATAAT
ATTAATACTC AAAAGGGTTC AATTATTTTA CAAGCAAATG AAACTATTAA TATTAATAAT
AGTCAGGTAA CAACTCAAGA TGGCTCAATT ATTTTAGAAC CTATTTTACC CGATTTAAAT
TCTCTACTTA TTTCTCTTGA TAATAGTTTT ATTGAAACGG TTAATCCCAC AGAAAATATT
CGTTTAACGG CTCAAGAAAT CGATCTTAAT GGGTCAACCG AAATTCTAGG ACAAAATCAG
CTTGAAATCA AACCCCCAAC AGATGATAAA AATATTATTC TGGGAGGAGA TAATAATAAT
CCATCAGCTT TAAATTTGAC GGGATCAGAA CTAGAAAAAA TTAATGGTTT TGAGTCGGTT
ATCATTGGCA ATAACAACGG ACTTGGAAGG ATTGAGATAG GCAATGATAC TACTGAAGTC
AATTTAAATA ATCAAGATTA TAATTTAATT TTGCAAGGTA AAGATATTAA ATTTAATAAT
TCTTTGATAT TAGCAGATGA TAAAACTTTA ACCTTAAATA CTCAAACAGT CACTAGCGCG
AATTCCCCCC CCTTCAGAGA TATCAAAATT GAGGGAGAAC AAGGTCAATT AATCCTCAAT
ACCTCTGGTT CGGTAGGAAC TTTAGATAAT CCTTTAGATA CAGAGGTAAG TTACCTGACA
ATTTCCTCTA CTTTAGGGGA TAATTTTATT TATAATCATT CAGATATAAG CTTAGGAAAA
ATTTCGATTG CTGATAATTT AGATCTTAAT GTTGATGGGA ATATTAGAGA TAATGATAGG
GTAGTGATAG GAGGAATTCT TACTCTCAGG GGACAAGATA TTATTTTAGA TAATAATAAT
GACTTGAATA CAGTAGCAGT AACAGAAGCA GAAAATCTGA CCCTGAATGA TACTAATGAA
TTAGACCTAG CTAATCTGCA AATTAACAAT GATTTAAACC TAACTACCAA TGGAGATATT
ACTCATAGTG GGAGTATTCA AGTCTTAGGA ATAACTAATT TAAATACCAA TGGAAACGAT
ATAATTTTAA ATAGTTCCAA TACTGATTTA AATATTGTTA ATATTTTTAA CAGTAAAAAT
GTTTTTTTAA AAGATATTAA TGATATTATT TTAAATAACC CTCTAACAGC GAACAGTCTG
ACTATTGAGG CCACGGGAAA TATTACAACG AATGATATTA ATACAAGTTC TACTCTAATT
TCTGGAGGGA ATGTTAATTT AATCTCTGAT AATGGAAAGA TAACTACAGG CGGAATTAAT
ACCAGTAGTC CTATAAATAA TGCAGGTAAT GTGACGTTAA AAGCTCAGGG AGATATAGAA
ACTGCCTATA TTAATGCTCA AGGAATAAGC AATACTATAG GAGGTGACGT TGCAATTATT
ACCAACAATT TTATCAGAAT AACCGACTTT TTCATTGACA GTAATAATAC ATTAGCTAGT
ATCTCAACTG TGGGGGAAAA TGGAGGAGGA AATATTATTA TCCAACATGG AGGACAAGGA
ATTACCCAGT TTAAAATAGG TGATCCTACG GTTAATGGAA CATCACAAGC AATTACGACG
GGTAGTAGCA CAATTAATAA TGAATCTTTG TCATACACAA CAACCAGAGG AAATATTCAA
CTGATTTCTG TTAATGAACT CTCTCCAGAA CCTTCCCCAG AACCTTCTCC AAGTCCGAAT
CCTAACCCGA ATCCGATTAT TAATATCAAT CCTGAACCTA ATCCTAACCC GAATCCGATT
ACTAATATTA ATCCTGAACC TAATCCTACC CCCAATCCGA ATACTAATAT TAATCCTGAA
CCTAATCCTA ACCCCAATCC GAATACTAAT ATTAATCCTG AACCTAATCC TAACCCCAAT
CCGATTACTA ATATCAATCC TGACCCGAAT CCGAATCCGA TTACTCATAT TAATCCTGAA
CCTAATCCTA ACCCGAATCC GATTACTAAT ATCAATTTCA ACCCAAATCC GAATCCCATT
ACTAATACTA ATTCCGACAT TCCTGTAGAT ACAAAAATTG ACCCTATTTT TACCAATAGC
ATCGAAAATT TATCAACCAT TGAGGAAATT GATCCCATAA GACAAAGCCT ATTTGTAGAT
ACAAATCAAC CCAGTTTGAG CTTGCAATCC AACAGGGACA TTGTAGGAAT AGTTCCTACA
GGGTTTCTTT CTATTTTTGG AGGGACAGCA ACGGCTGAAG TCGGAAATGT AGAAAGTAAT
TTTACTGATG CTTTTGAAAA TTATTTAGGA ATTAACAAAG CGTCTACCTT AAGCTTAAAA
GAAAAACAAA TTATTCTCAA TAATATTGAA GAAAAGTCAG GTTATAAATC AGCATTTGTT
TATATATATT TTAAACCCAA CTCTACAACA ACAGCCCAGG AAAAACAGGA ACAACAAGAA
ATCTTATGGA GGTATCGCGG AGGTAAACCA TCTTCTCAAA AAAATCTAGC TTCTGAAGGT
CAAGGAACAG ATCAATTAGA AATAATTGTT ATCACCTCTA CCGGAGATGT AATTCAAGAA
AAAATAGAAG GAATCACACG AGATCAGGTC ATGTCGGTCG TGAAAAAATT TCAACAGACA
GTGACAAATC CTCGAAGACC TACTGCTTAT CTCCAGCCAT CTCAACAACT TTATCAATGG
TTTATTGCTT CTATTGAACA GGATTTACAA GCGCAAAATA TTGATACTCT TGTCTTTATT
CTTGATGCTG GTTTACGGTC AGTTCCAATG GCGGCTCTTA ATGATGGGCA ACAATTTTTG
ATTGAAAAAT ATAGTATGGG TTTGATGCCA AGTGTTGCCT TAACAGATAC CCGTTATGTT
AATGTTCAAA ATTCCCAAGT TTTAGCAATG GGTGCATCTA CATTTAGGGA TCAAAATCCC
TTACCATCAG TTCCTATTGA ATTATCACTC ATTACCGATC AACTTTGGCA AGGACGATCT
TATATTAATG AAAGTTTTAC CCTTAATAAT CTTAAAAAAG CACATCAATC TGAGCAATTT
AGAATCGTCC ATTTAGCGAC TCATGCCAAT TTTGAGGGAG GGACTCTTGC TAATTCTTAT
ATTCAATTAT GGGATAATAA ATTAACAATG GAGCAGTTAA AAGGATTAGG TTTAGATCGG
CCTCCCATAG ATTTATTAGT GTTAAGTGCT TGTCGAACTG CTCTAGGTAA TAGAGAGTCA
GAATTAGGCT TTGCCGGAGC GGCTGTATTA GCGAATGTCA AGACGGTTTT AGGGAGTCTT
TGGGAAGTGA GTGATGAAGG AACTTTAGCA TTAATGACCA GTTTTTATCA GCAACTTAGA
GGTGTTCCTC TCAAAGCTGA AGCACTGCGA AAAGCTCAGT TAAGTCTTCT TCGAGGTGAT
GTAAAAATGA TTCAGGAAAA CTCTAAACAA GCGGAACTTT TAATCGAAAA TCAACGTTTA
CCCTTACCTC CTCAATTAGC AGAATTAGGC GCTAAAGAAT TTACTCATCC TTATTATTGG
AGTAGTTTTA CAGTGATTGG TAATCCTTGG TAA
 
Protein sequence
MRITLTGLLA VGVGLMLNQP MQGQSITPAN DGTGTIINRQ GNQYNIEGGT LSEDGRNLFH 
SLEKFGLTQA EIANFLSNPQ IRNILTRIVG GNPSVINGLI QVLNGNSNLY IMNPAGIIFG
PNARINVPAD FVVTTATRIG FDNNLWFNAF ESNDYAQLMG NPSQFAFDLA KSGTITNLGN
LTVTEGHHLM LLGKTVNNSG TLNAPEGNIT IAAVAGTNRI KISQENNLVS LEIERPDTNV
NSNLSALDLP TLLTEGKVSN SASISPLQPG TVIISGNLSV SNAANSPLQQ GGQVNIIGDQ
ISLINANINA TGTRGGGIVQ IGANSQGQDI TTTASRIAIN GNSSIDVSAI DTGEGGQVII
SGNTYTNVVG QIRSRGGQEE GNGGYIKIVG VNGLDFQGTV DTSALQGDFG VLVLNSNNLT
IENSQGFSSP KIRPSSLNTL SSNLILEASN KITFNDPINL QNSPSGLTVI ADKTIQVNDN
INTQKGSIIL QANETININN SQVTTQDGSI ILEPILPDLN SLLISLDNSF IETVNPTENI
RLTAQEIDLN GSTEILGQNQ LEIKPPTDDK NIILGGDNNN PSALNLTGSE LEKINGFESV
IIGNNNGLGR IEIGNDTTEV NLNNQDYNLI LQGKDIKFNN SLILADDKTL TLNTQTVTSA
NSPPFRDIKI EGEQGQLILN TSGSVGTLDN PLDTEVSYLT ISSTLGDNFI YNHSDISLGK
ISIADNLDLN VDGNIRDNDR VVIGGILTLR GQDIILDNNN DLNTVAVTEA ENLTLNDTNE
LDLANLQINN DLNLTTNGDI THSGSIQVLG ITNLNTNGND IILNSSNTDL NIVNIFNSKN
VFLKDINDII LNNPLTANSL TIEATGNITT NDINTSSTLI SGGNVNLISD NGKITTGGIN
TSSPINNAGN VTLKAQGDIE TAYINAQGIS NTIGGDVAII TNNFIRITDF FIDSNNTLAS
ISTVGENGGG NIIIQHGGQG ITQFKIGDPT VNGTSQAITT GSSTINNESL SYTTTRGNIQ
LISVNELSPE PSPEPSPSPN PNPNPIININ PEPNPNPNPI TNINPEPNPT PNPNTNINPE
PNPNPNPNTN INPEPNPNPN PITNINPDPN PNPITHINPE PNPNPNPITN INFNPNPNPI
TNTNSDIPVD TKIDPIFTNS IENLSTIEEI DPIRQSLFVD TNQPSLSLQS NRDIVGIVPT
GFLSIFGGTA TAEVGNVESN FTDAFENYLG INKASTLSLK EKQIILNNIE EKSGYKSAFV
YIYFKPNSTT TAQEKQEQQE ILWRYRGGKP SSQKNLASEG QGTDQLEIIV ITSTGDVIQE
KIEGITRDQV MSVVKKFQQT VTNPRRPTAY LQPSQQLYQW FIASIEQDLQ AQNIDTLVFI
LDAGLRSVPM AALNDGQQFL IEKYSMGLMP SVALTDTRYV NVQNSQVLAM GASTFRDQNP
LPSVPIELSL ITDQLWQGRS YINESFTLNN LKKAHQSEQF RIVHLATHAN FEGGTLANSY
IQLWDNKLTM EQLKGLGLDR PPIDLLVLSA CRTALGNRES ELGFAGAAVL ANVKTVLGSL
WEVSDEGTLA LMTSFYQQLR GVPLKAEALR KAQLSLLRGD VKMIQENSKQ AELLIENQRL
PLPPQLAELG AKEFTHPYYW SSFTVIGNPW