Gene Cyan8802_0285 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_0285 
Symbol 
ID8389589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp269826 
End bp273413 
Gene Length3588 bp 
Protein Length1195 aa 
Translation table11 
GC content42% 
IMG OID644978326 
Productfilamentous haemagglutinin family outer membrane protein 
Protein accessionYP_003136084 
Protein GI257058196 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.579969 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAAT TGACTTTTTT TAATTCAAAT AATCTTAAAT TTAGATTAAC TTTAATCAGT 
GGCATTATTC TATTATCGAC GGGATTAACT ATCGATTCTC TTGAAGCACA AACGTCTCCT
ATTATTCCTG ATAATACCTT ACCGATTAAT TCCCAAGTAA CGCCTAATGG CAATCAATTG
ATTATCAACC AAGGAACGTC AGCAGGAAAC AATCTTTTTC ATAGTTTTGA AGCCTTTTCA
CTGCCTACAG GTTACGAAGC TTTTTTTAAC AATTCTTTGG CGATTCAGAA CATTTTTAGT
CGAGTAACAG GGACTTCAAT TTCTAATATT GATGGAATCA TTCGAGCCAA TGGGATAGCC
AATTTATTTT TCTTAAATCC TAATGGAATT ATCTTTGGAT CTAATGCACA ATTAGACATT
GGCGGTTCTT TTTTTGGCAG TACAGCTAAT AGTATTCTCT TTGAGGATAA CAGTGAGTTT
AGTGCAACTA ACCCAAATAA TTCCTCATTA CTCTCTATAT CCATGCCTAT TGGCTTACAA
TTTAGAGACA ATTCCGCTCA AATTCAAGTC CAAGGAGCGG GACATCAATT ACAATTAGTT
GACCCCCTAA TTTTCTTTTC TGTTCAACAA CCTAATAACC CCAACCTAGG ACTCCAAGTT
CAAAACGGAA AAACCTTAGC CTTAGTTGGA GGATTGGTTA ATCTAGAAGG AGGGATTATT
ACAGCACCAG AGGGAAATAT TGAATTAGGT GGAGTAAAAT CTGGTTTAGT CCGACTTAAT
TCAAGTGGCA TAGGTTGGAC GTTAGACTAT CAACAAGTAG ACAATTTTCA AAACATAGCA
CTCACCCAAA AAGCCTTAGT AGATACCAGT GGCAACAGTG GGGGATCTAT CCAAGTTCAA
GGATCTCAAA TTGCCCTCCT AGATGGCTCC GTTATTTGGA TAGAAAATAG GGGATCTCAA
CCCAGTGGTA ATATAGCAAT TCATAGTTCC CAATCAGTTG AATTAAAAGG ATTAAATGGA
GATAGAAGCA CGATTTTTCC TAGCACAATT TTTACGCAAG CAATAGGACA AGGAAAAGGC
GGTGATATTC TTATTTCTAC CGAGCAATTA AATATGGATG AATTATCCCG TGTAGACGCT
TTAACCTTAT TAGGAGGAGC AAGAGGGGGT AATCTGGTAC TCAATGCCCG TGATTCTATC
AATATGAATT CAGGTTCCCC GTTGATTGAT ACCAGTATCC TGTCTCGAAC CCAATTTTCA
GGAGACGCAG GAAACATTAC CGTATCAACC AATAAGCTAA CCATCACCAA TGGAGGTACG
CTGATCTCTC CGTCCAATAT TGGAACTGGG AATGCAGGGG ATGTCACCGT CAAGGCTGTG
GAAATTGAGA TAATAGGAGT CAACCCTGGA AACTTAGTGT CTAGCCAAAT TTCTACCCCT
ACCCGTGATG GGAATGCTGG CAACGTAGAA ATTAATACCG CAAGGCTCAT TCTTCGGGAT
GGAGGCTTAG TTAATTCCGC TACCTTAGCC ACTGGCAATG CTGGAAGTGT TAGGATTAAT
GCCTCTGAAT CCGTCGAGAT TAGTGGTACA TTTCCTCAGT CTATTCTGGC CTCTCAAATC
AGTTCATCGG CTCTAATTAT TGATGAAGCA TTACAGCAAC TGTTCGGAAT TGACCCTATC
CCCACTGGTG CATCAGGGAA TGTAACCGTT AATACCCCTC AATTAACCAT TACGAATCAA
GGATTAATCA GTGTTCGTAA TGATGGAACA GGTGATGCAG GGAATTTAGA GATCAATGCT
GATTCGATTG TTCTCGATAA TCAAGGTGGT ATTGCTGCAT CGACTGCCTC TGGAGAAGGT
GGCAATATTT TCCTTAATCT AGGGAATAAT CTACGAATGA ACAATAGCAG TTTTATCACC
GCTACCGCAG GGGGAGGGGG TAATAGTGGC AACATAACCA TTGATACCCC CATCTTAATC
CTTCAGAACG GTGCAACTAT TTCTGCTTTA AGCATCTTAA GACAAGGGGG AGATATTGAG
TTAGAAGGTC TACAAACCCT AGAAATTAGA GATAACAGTG AAATTTCTGT TTCTACTGAA
ACGGGAGAAG CTGGCAGTAT TGGGATTAAT CAAAATCAAA CCCCAGTGAA AACAGTTGAC
ATCACCAACG GTAGCCGCTT AGCTGCCCAA GCCACTCAAC CCCAAGGGGA AGCGGGAAGT
ATCAGCGTTA ACACTACAAA TCTAGCGGTT AATCAAGGTT CCTCCATCTC TGCAGCTAAT
ATCTCAGGAA GAACTGGAGG CGATATTAAC CTCCTGAATT TAGGTCGTTT AGAAGTCAAT
GGGGCAGAAA TTTCAGCAAC TACTCGCGAT GGCAAGGCAG GTAACTTAGC GATTAATCAA
GGTCAAACTC CCGTCAAGAT AATCGAACTG AATGCAGGTA GTCTCACCGT TGAAGCTACC
GGAATGGGAG AATCTGGCAA CCTTACTGTT AACGCACTAA GCTTAAATTT ACAAAATAAC
GCTGAAATTT CAGCCTCAAC GAATTTCGGA CAAGGGGGGG ACATAACCCT AGAAGGGTTA
GATACCCTTG AGGTCAACAA TAGTCAGATT TCGGCTTCAA CTCAGGCTGG ACGCGGTGGC
AACTTAACCA TTAAGACGAC TAATTCTGTT CAGCTAAGTG GGCAAGGAAA GCTATCAGTG
GAAGCAACAC AAGGCGGTAG AGCGGGGAAT TTAAGCCTAG AAACCGGGCA AATGAGCATT
AGCGAGAGGG CAAACGTGAC CGTTAGCAGT CCCCAAGGAC AAGCCGGGAA TCTGACCATT
AAGGCGAATA GCTTGTCATT AAATAACGGT TTTATAACGG CTGAAACCGG GCAAAGTGAG
GGCGAAGAAG GCGCGAATAT TACTTTGAGG ATCTCAGACT TCATCACTCT CGAAAATGAA
AGTCTCATCT CTGCTACGGC TAATGGGTCT GCGGATGGGG GTAACATTGA TATCGAAACC
CCATTTTTAA TCGTTTTCCC AAGCAGTCCT AATGGTAGCG ATATCATTGC CAAAGCAGAA
CAAGGTAGTG GGGGTCGTAT TGCTATTAAC AGCCAAGGAA TCTTCGGGAT TGAAGAAGGT
AGAGCAACGC CAGGGAATCA ACGCAATGAC CTTGATGCGA GTTCAGAATC CGGCTCAACG
GGAGAAATTC TACTCAACCG CGAACTTGAC CCCAACCGAG GCTTAGTGGA ACTCCCTGAA
ACCATTGTTG ATCCCAATAG TCTTATTGCT CAAAATGCTT GTCAACGAGG GACACAAAGC
GAATTTTCTG TAACCGGACG CGGGGGATTA CCCCCTAGTC TCAATGAAGA CTTGAGCAGC
GAGGCAACTC AAGTCGATTT AGTACAACCT GCTCCGTTCA AGAAATCAGA AGTTAGAAGT
CAGAAGTTAG AAGTTGTTTC CAATTCTTCA GGACAGACCC CCGTAATTCC GGCTCAAGGC
TGGATATTTA ATGAGAAAGG AGAAATCGTC CTTGTTGCCT ATGATCCTAC GATGACGGAT
TCCCAACGTT TGCGAAAACA GGGTAATGGC TGTCATCAGC AATCATGA
 
Protein sequence
MKQLTFFNSN NLKFRLTLIS GIILLSTGLT IDSLEAQTSP IIPDNTLPIN SQVTPNGNQL 
IINQGTSAGN NLFHSFEAFS LPTGYEAFFN NSLAIQNIFS RVTGTSISNI DGIIRANGIA
NLFFLNPNGI IFGSNAQLDI GGSFFGSTAN SILFEDNSEF SATNPNNSSL LSISMPIGLQ
FRDNSAQIQV QGAGHQLQLV DPLIFFSVQQ PNNPNLGLQV QNGKTLALVG GLVNLEGGII
TAPEGNIELG GVKSGLVRLN SSGIGWTLDY QQVDNFQNIA LTQKALVDTS GNSGGSIQVQ
GSQIALLDGS VIWIENRGSQ PSGNIAIHSS QSVELKGLNG DRSTIFPSTI FTQAIGQGKG
GDILISTEQL NMDELSRVDA LTLLGGARGG NLVLNARDSI NMNSGSPLID TSILSRTQFS
GDAGNITVST NKLTITNGGT LISPSNIGTG NAGDVTVKAV EIEIIGVNPG NLVSSQISTP
TRDGNAGNVE INTARLILRD GGLVNSATLA TGNAGSVRIN ASESVEISGT FPQSILASQI
SSSALIIDEA LQQLFGIDPI PTGASGNVTV NTPQLTITNQ GLISVRNDGT GDAGNLEINA
DSIVLDNQGG IAASTASGEG GNIFLNLGNN LRMNNSSFIT ATAGGGGNSG NITIDTPILI
LQNGATISAL SILRQGGDIE LEGLQTLEIR DNSEISVSTE TGEAGSIGIN QNQTPVKTVD
ITNGSRLAAQ ATQPQGEAGS ISVNTTNLAV NQGSSISAAN ISGRTGGDIN LLNLGRLEVN
GAEISATTRD GKAGNLAINQ GQTPVKIIEL NAGSLTVEAT GMGESGNLTV NALSLNLQNN
AEISASTNFG QGGDITLEGL DTLEVNNSQI SASTQAGRGG NLTIKTTNSV QLSGQGKLSV
EATQGGRAGN LSLETGQMSI SERANVTVSS PQGQAGNLTI KANSLSLNNG FITAETGQSE
GEEGANITLR ISDFITLENE SLISATANGS ADGGNIDIET PFLIVFPSSP NGSDIIAKAE
QGSGGRIAIN SQGIFGIEEG RATPGNQRND LDASSESGST GEILLNRELD PNRGLVELPE
TIVDPNSLIA QNACQRGTQS EFSVTGRGGL PPSLNEDLSS EATQVDLVQP APFKKSEVRS
QKLEVVSNSS GQTPVIPAQG WIFNEKGEIV LVAYDPTMTD SQRLRKQGNG CHQQS