Gene PCC8801_0285 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_0285 
Symbol 
ID7104017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp269512 
End bp273120 
Gene Length3609 bp 
Protein Length1202 aa 
Translation table11 
GC content43% 
IMG OID643473395 
Productfilamentous hemagglutinin family outer membrane protein 
Protein accessionYP_002370541 
Protein GI218245170 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAAT TGACTTTTTT GAATTCAAAT AATCTTAAAT TTAGATTAAC TTTAATCAGT 
GGCATTATTC TATTATCGAC GGGATTAACT ATCGATTCTC TTGAAGCACA AACGTCTCCT
ATTATTCCTG ATAATACCTT ACCGATTAAT TCCCAAGTAA CGCCTAATGG CAATCAATTG
ATTATCAACC AAGGAACGTC AGCAGGAAAC AATCTTTTTC ATAGTTTTGA AGCCTTTTCA
CTGCCTACAG GTTACGAAGC TTTTTTTAAC AATTCTTTGG CGATTCAGAA CATTTTTAGT
CGAGTAACAG GGACTTCAAT TTCTAATATT GATGGAATCA TTCGAGCCAA TGGGATAGCC
AATTTATTTT TCTTAAATCC TAATGGAATT ATCTTTGGAT CTAATGCACA ATTAGACATT
GGCGGTTCTT TTTTTGGCAG TACAGCTAAT AGTATTCTCT TTGAGGATAA CAGTGAGTTT
AGTGCAACTA ACCCAAATAA TTCCTCATTA CTCTCTATAT CCATGCCTAT TGGCTTACAA
TTTAGAGACA ATTCCGCTCA AATTCAAGTC CAAGGAGCGG GACATCAATT ACAATTAGTT
GATCCCCTAA TTTTCTTTTC TGTTCAACAA CCTAATAACC CCAACCTAGG ACTTCAAGTT
CAAGATGGAA AAACCTTAGC CTTAGTTGGA GGATTGGTTA ATCTAGAAGG AGGGATTATT
ACAGCACCAG AGGGAAATAT TGAATTAGGT GGAGTAAAAT CTGGTTTAGT CCGACTTAAT
TCAAGTGGCA TAGGTTGGAC GTTAGACTAT CAACAAGTAG ACAATTTTCA AAACATAGCA
CTCACCCAAA AAGCCTTAGT AGATACCAGT GGCAACAGTG GGGGATCTAT CCAAGTTCAA
GGATCTCAAA TTGCCCTCCT AGATGGCTCC GTTATTTGGA TAGAAAATAG GGGATCTCAA
CCCAGTGGTA ATATAGCAAT TCATAGTTCC CAATCAGTTG AATTAAAAGG ATTAAATGGA
GATAGAAGCA CGATTTTTCC TAGCACAATT TTTACGCAAG CAATAGGACA AGGAAAAGGC
GGTGATATTC TTATTTCTAC CGAGCAATTG AATATGGATG AAATATCCCG TGTAGACGCT
TTAACCTTAT TAGGAGGAGC AAGAGGGGGT AATCTGGTAC TCAATGCTCG TGATTCTATC
AATATGAATT CAGGTTCCCC GTTGATTGAT ACCAGTATCC TGTCTCGAAC CCAATTTTCA
GGAGACGCAG GAAACATTAC CGTATCAACC AATAAGCTAA CCATCACCAA TGGAGGTACG
CTGATCTCTC CGTCCAATAT TGGAACTGGG AATGCAGGGG ATGTCACCGT CAAGGCTGTG
GAAATTGAGA TAATAGGAGT CAACCCTGGA AACTTAGTGT CTAGCCAAAT TTCTACCCCT
ACCCGTGATG GGAATGCTGG CAACGTAGAA ATTAATACCG CAAGGCTCAT TCTTCGGGAT
GGAGGCTTAG TTAATTCCGC TACCTTAGCC ACTGGCAATG CTGGAAGTGT TAGGATTAAT
GCCTCCGAAT CCGTCGAGAT TAGTGGTACA TTTCCTCAGT CTATTCTGGC CTCTCAAATC
AGTTCATCGG CTCTAATTAT TGATGAAGCA TTACAGCAAC TGTTCGGAAT TGACCCTATC
CCCACTGGTG CATCAGGGAA TGTAACCGTT AATACCCCTC AATTAACCAT TACGAATCAA
GGATTAATCA GTGTTCGTAA TGATGGAACA GGTGATGCAG GGAATTTAGA GATCAATGCT
GATTCGATTG TTCTCGATAA TCAAGGTGGT ATTGCTGCAT CGACTGCCTC TGGAGAAGGT
GGCAATATTT TCCTTAATCT AGGGAATAAT CTACGAATGA ACAATAGCAG TTTTATCACC
GCTACCGCAG GGGGAGGGGG TAATAGTGGC AACATAACCA TTGATACCCC CATCTTAATC
CTTCAGAACG GTGCAACTAT TTCTGCTTTA AGCATCTTAA GACAAGGGGG AGATATTGAG
TTAGAAGGTC TACAAACCCT AGAAATTAGA GATAACAGTG AAATTTCTGT TTCTACTGAA
ACCGGAGAAG CTGGCAGTAT TGGGATTAAT CAAAATCAAA CCCCAGTGAA AACAGTTGAC
ATCACCAACG GTAGCCGCTT AGCTGCCCAA GCCACTCAAC CCCAAGGGGA AGCGGGAAGT
ATCAGCGTTA ACACTACAAA TCTAGCGGTT AATCAAGGTT CCTCCATCTC TGCAGCTAAT
ATCTCAGGAA GAACTGGAGG CGATATTAAC CTCCTGAATT TAGGTCGTTT AGAAGTCAAT
GGGGCAGAAA TTTCAGCAAC TACTCGCGAT GGCAAGGCAG GTAACTTAGC GATTAATCAA
GGTCAAACTC CCGTCAAGAT AATCGAACTG AATGCAGGTA GTCTCACCGT TGAAGCTACC
GGAATGGGAG AATCTGGCAA CCTTACTGTT AACGCACTAA GCTTAAATTT ACAAAATAAC
GCTGAAATTT CAGCCTCAAC GAATTTCGGA CAAGGGGGGG ACATAACCCT AGAAGGGTTA
GATACCCTTG AGGTCAACAA TAGTCAGATT TCGGCTTCAA CTCAGGCTGG ACGCGGTGGC
AACTTAACCA TTAAGACGAC TAATTCTGTT CAGCTAAGTG GGCAAGGAAA GCTATCAGTG
GAAGCAACAC AAGGCGGTAG AGCGGGGAAT TTAAGCCTAG AAACCGGGCA AATGAGCATT
AGCGAGAGGG CAAACGTGAC CGTTAGCAGT CCCCAAGGAC AAGCCGGGAA TCTGACCATT
AAGGCGAATA GCTTGTCATT AAATAACGGT TTTATAACGG CTGAAACCGG GCAAAGTGAG
GGCGAAGAAG GCGCGAATAT TACTTTGAGG ATCTCAGACT TCATCACCCT GGAAAATGAA
AGTCTCATCT CTGCTACGGC TAATGGGTCT GCCGATGGGG GTAACATTGA TATCGAAACC
CCATTTTTAA TCGTTTTCCC AAGCAGTCCT AATGGTAGCG ATATCATTGC CAAAGCAGAA
CAGGGTAGCG GGGGTCGTAT TGCCATTAAC AGCCAAGGAA TCTTCGGGAT TGAAGAAGGT
ATAGCCACGC CAGGGAATCA ACGCAATGAC CTTGATGCTA GTTCAGAATC GGGTTCAACG
GGAGAAATTC TGCTCAACCG CGAACTTGAC CCCAACCGAG GCTTAGTGGA ACTCCCTGAA
ACCATTGTTG ATCCCAATAG TCTCATTGCT CAAAATGCTT GTCAACGGGG GACACAAAGC
GAATTTTCTG TAACCGGACG CGGGGGATTA CCCCCTAGTC TCAATGAAGA CTTGAGCAGC
GAGGCAACTC AAGTCGATTT AGTGCAACCT GCTCCGTTTA TGAAGTTAGA AGGCAGGAGG
CAGGAGGCAG GAGGCAGGAG GCAAAATATT TCCCCACCCT CTACTTCTTC AGGACAGACC
CCCGTAATTC CGGCTCAAGG CTGGATATTT AATGAGAAAG GAGAAATCGT CCTTGTTGCC
TATGATCCTA CGATGACGGA TTCCCAACGT TTGCGAAAAC AGGGTAATGG CTGTCATCAG
CAATCATGA
 
Protein sequence
MKQLTFLNSN NLKFRLTLIS GIILLSTGLT IDSLEAQTSP IIPDNTLPIN SQVTPNGNQL 
IINQGTSAGN NLFHSFEAFS LPTGYEAFFN NSLAIQNIFS RVTGTSISNI DGIIRANGIA
NLFFLNPNGI IFGSNAQLDI GGSFFGSTAN SILFEDNSEF SATNPNNSSL LSISMPIGLQ
FRDNSAQIQV QGAGHQLQLV DPLIFFSVQQ PNNPNLGLQV QDGKTLALVG GLVNLEGGII
TAPEGNIELG GVKSGLVRLN SSGIGWTLDY QQVDNFQNIA LTQKALVDTS GNSGGSIQVQ
GSQIALLDGS VIWIENRGSQ PSGNIAIHSS QSVELKGLNG DRSTIFPSTI FTQAIGQGKG
GDILISTEQL NMDEISRVDA LTLLGGARGG NLVLNARDSI NMNSGSPLID TSILSRTQFS
GDAGNITVST NKLTITNGGT LISPSNIGTG NAGDVTVKAV EIEIIGVNPG NLVSSQISTP
TRDGNAGNVE INTARLILRD GGLVNSATLA TGNAGSVRIN ASESVEISGT FPQSILASQI
SSSALIIDEA LQQLFGIDPI PTGASGNVTV NTPQLTITNQ GLISVRNDGT GDAGNLEINA
DSIVLDNQGG IAASTASGEG GNIFLNLGNN LRMNNSSFIT ATAGGGGNSG NITIDTPILI
LQNGATISAL SILRQGGDIE LEGLQTLEIR DNSEISVSTE TGEAGSIGIN QNQTPVKTVD
ITNGSRLAAQ ATQPQGEAGS ISVNTTNLAV NQGSSISAAN ISGRTGGDIN LLNLGRLEVN
GAEISATTRD GKAGNLAINQ GQTPVKIIEL NAGSLTVEAT GMGESGNLTV NALSLNLQNN
AEISASTNFG QGGDITLEGL DTLEVNNSQI SASTQAGRGG NLTIKTTNSV QLSGQGKLSV
EATQGGRAGN LSLETGQMSI SERANVTVSS PQGQAGNLTI KANSLSLNNG FITAETGQSE
GEEGANITLR ISDFITLENE SLISATANGS ADGGNIDIET PFLIVFPSSP NGSDIIAKAE
QGSGGRIAIN SQGIFGIEEG IATPGNQRND LDASSESGST GEILLNRELD PNRGLVELPE
TIVDPNSLIA QNACQRGTQS EFSVTGRGGL PPSLNEDLSS EATQVDLVQP APFMKLEGRR
QEAGGRRQNI SPPSTSSGQT PVIPAQGWIF NEKGEIVLVA YDPTMTDSQR LRKQGNGCHQ
QS