Gene Cyan8802_2674 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_2674 
Symbol 
ID8392000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp2702848 
End bp2705706 
Gene Length2859 bp 
Protein Length952 aa 
Translation table11 
GC content45% 
IMG OID644980633 
Productfilamentous haemagglutinin family outer membrane protein 
Protein accessionYP_003138369 
Protein GI257060481 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGAGATA TTACCCTCAC GACTAAAATA CTTGAGACAA TACCCAACTA TCCTGTGATG 
AGTGCCAGTT TTTATTTAAC TCCCCATGAC CTCCTTGCAA CTTTTCTAAG TATCAGTGGT
TTAATGGTTT CTCCCATCGC TTCAGCACAA GTTGCTAGTG ATGGAACGGT ACAAACGCAA
GTTAACACCC TAGGGAATCA ATTAGAGATT ACAGAGGGAA CGCAAGCAGG AAGCAATCTA
TTTCATAGCT TCAGCCAATT TTCTGTTCCG AGTGGTTTTG AGGCTTATTT TAACAATTCT
TCGACCATTA GCAATATTAT CAGCCGTGTT ACAGGGGGTT CAATTTCCAA TATAGAGGGA
CTCATTCGGG CTAACGGGAC GGCTAATTTA TTTCTCATCA ACCCCAATGG TATTGTCTTT
GGTCCGAATG CGGTGGTTGA TATTGGGGGG TCTTTTTTAG CCACTACCGC CGAATCCATC
CAATTTGCTG ATGGCTCTCA ATTTAGTGCG ACAAACTCCC AATCTTCACC CATTTTAACG
ATAGCGGTTC CAGTGGGGTT GCAATTTGGT TCTAACCCTG GAACCATTGT TAATCGAGCT
CATCGAACAG TACCTGATCC CACTCCTGAT AACCCTAATA ATACTAGGCA AGTAGGATTT
GAGGTTAAAT CGGGTAATAC TATAGCTCTA GTAGGGGGAG ATTTGGATTT TCAGGGGGGA
AGGGTTAATG GTTCAGGAGG AAGGGTTGAA CTGGGCAGTG TGGGGGGTAA CAGTAGGGTT
CAATTAAGCG TGACTAACCC AGGATTGGAG ATTGACTATC AAGGAGTTAC CAACTTTCAA
GATATTTCCT TATCTCAACA ATCAATAGTA GATGTGAATA ATTCAACAAT TCAGGTACAG
GGCAGCAATA TTCGGCTGAC TAATGGCTCT CAAATTAGCA GCACAACAAG AACAACAGAA
GATGCTGGAG ATTTAACCGT TAATGCCACT GAATCTGTTG AACTGATTGG CGCTGCCCCT
GAGGGGTTTC CTTTTCCTAG TGCTTTTATT GCTCAAGTCA ATCCAGGGGG TACAGGTAGA
GGGGGTAATC TGATCATCAA TACGAAACAG TTAAGTATCC GTGAGGGTGC TGGTATATCA
GTAGCGAGTC AAGGTGAAGG AATAGGCGGT CGGCTAGAAG TTAATGTCTC AGAGGCGATT
GAGATTACGG GAACTGGTCG ACTATTTCCC AGTGTTCTCA CAAGCTCCAC AGATGGGACA
GGGGATGGGG GCGAAATAGT CATTAATACC AGACAATTAA CGCTGAGCAA TGGCGGACAA
ATACAGGCGT TTACGATTAG TCAAGGACGC GGGGGAACCA TCACTATTAA CGCTTCTGAT
TCCATCGAAG TGAGTGGTCG AGGAGCGTTA CCAGAGTTTA ATACAGAAAG TTTTAGTTCT
ATTACAGCAG AATCAGGTTT TCAACTCCTT GGATTTACAG GAATTGCCCC AGGAGGTAAC
GTTAACATTA ATACTAATCA ACTTATTGTC ACCGACGGCG GAACGATTTC TGCTGGCAGT
TTTGGACAAG GGAATGCGGG AAGCGTAGAT ATTACTTCAA ATTCGATCTT TTTAGACAAT
CAAGGGGTAA TTACGGCTTC TAGTGAAGGA AGCGGGGATG CAGGGAACAT CACCATCCTC
ACTGACCAAC TCTCCGTTAA TAATCAATCA GAAATATCCG TCAGAAACAT TGGTTTTGGT
CAGGGGGGGA ATTTAACCAT TAGTGCCGAT GCTATCTCCT TAAACCAAGA CAGTCAACTA
ACTGCTGTTA GTTTTCCCCT AGAGGATCTA ACGCTCCAAG AATTAGGTAT TAACGCCGAA
GAATTTGTTA ACCGTCCCAA TATTGGCAAC GCAGGGGATT TAATTCTTAA CACTTCTTCA
TTAAATCTTA ATAATGATTC CCAAGTGACC GTCAGCAGTT TTGGAACGGG AAATGCAGGA
AGTATGGGGA TTACAGCCCA AAATATCGCC CTAGACAACA GTAGCGAACT CGCTGCCGAA
ACAGCTTCAG GGGAAGGGGG TAACATCAGT CTTTATGTCT CAGACTTCCT GAACCTTCGT
CGCGCTAGTA CCATCTCCAC CACTGCAGGA ACCCTAGGTG GTGGTGGTAA TGGAGGTAAT
ATCTTCATTG ATGCTGAATT TATGGTTACT GTACCCACCG AAAACAGCGA CATTATTGCC
AATGCTTTTC TGGGTAATGG AGGAAACATT CGTATTAACG CCTCTGGAGT GTTTGGGATC
GAAGAACGGG AACGTCTGAC CCCGTTGAAT GACATTACCG CGAGTTCCCA ATTTGGACAG
GTAGGGAGTA TTGGTATTAA CCGACCCGAT GTTGATCCCC AACGCAGCTT AGTTAAATTG
CCGGGTGAAG TGGTAGACGC AAAAAACCTT GTGGTTCAAG CGTGTAGTCC TGGGGGAGCG
TATACGCGAG GCGAATTTAG CATAACGGGC TCAGGGGGTT TACCCGTTAA CCCTGATGAA
GGAATTCAAA CTTCCCCAGG CTTGACTGAA TTAGGCTATC CTGAGATAGA AATGTTTAAT
CAATCAGATA ATCAAGAATC TCTGAAAATC CCAAATGAGT CTTTAACCCC AGATTATAAG
CGTCAGAGTT CTCCTACAAC CATTGTGGAA GCTCAAGGCT GGATCATGGA TAAGAATGGT
AAAGTAGTGC TGACGGCTCA ATCACCTAAC GTTACTCCTC ACGGTTCTGG GTTTATGCCA
TCCAACTGTT ATGACCTTTC TAACCGTTCC CTTTCCTCGG CTACGTCTCC ATCCCCATCC
CTTTCTGATG TTCCTCAGCG TAATAGTCTG GCTGAGTAG
 
Protein sequence
MGDITLTTKI LETIPNYPVM SASFYLTPHD LLATFLSISG LMVSPIASAQ VASDGTVQTQ 
VNTLGNQLEI TEGTQAGSNL FHSFSQFSVP SGFEAYFNNS STISNIISRV TGGSISNIEG
LIRANGTANL FLINPNGIVF GPNAVVDIGG SFLATTAESI QFADGSQFSA TNSQSSPILT
IAVPVGLQFG SNPGTIVNRA HRTVPDPTPD NPNNTRQVGF EVKSGNTIAL VGGDLDFQGG
RVNGSGGRVE LGSVGGNSRV QLSVTNPGLE IDYQGVTNFQ DISLSQQSIV DVNNSTIQVQ
GSNIRLTNGS QISSTTRTTE DAGDLTVNAT ESVELIGAAP EGFPFPSAFI AQVNPGGTGR
GGNLIINTKQ LSIREGAGIS VASQGEGIGG RLEVNVSEAI EITGTGRLFP SVLTSSTDGT
GDGGEIVINT RQLTLSNGGQ IQAFTISQGR GGTITINASD SIEVSGRGAL PEFNTESFSS
ITAESGFQLL GFTGIAPGGN VNINTNQLIV TDGGTISAGS FGQGNAGSVD ITSNSIFLDN
QGVITASSEG SGDAGNITIL TDQLSVNNQS EISVRNIGFG QGGNLTISAD AISLNQDSQL
TAVSFPLEDL TLQELGINAE EFVNRPNIGN AGDLILNTSS LNLNNDSQVT VSSFGTGNAG
SMGITAQNIA LDNSSELAAE TASGEGGNIS LYVSDFLNLR RASTISTTAG TLGGGGNGGN
IFIDAEFMVT VPTENSDIIA NAFLGNGGNI RINASGVFGI EERERLTPLN DITASSQFGQ
VGSIGINRPD VDPQRSLVKL PGEVVDAKNL VVQACSPGGA YTRGEFSITG SGGLPVNPDE
GIQTSPGLTE LGYPEIEMFN QSDNQESLKI PNESLTPDYK RQSSPTTIVE AQGWIMDKNG
KVVLTAQSPN VTPHGSGFMP SNCYDLSNRS LSSATSPSPS LSDVPQRNSL AE