Gene PCC8801_3442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3442 
Symbol 
ID7103132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3587579 
End bp3590437 
Gene Length2859 bp 
Protein Length952 aa 
Translation table11 
GC content44% 
IMG OID643476457 
Productfilamentous hemagglutinin family outer membrane protein 
Protein accessionYP_002373566 
Protein GI218248195 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3210] Large exoproteins involved in heme utilization or adhesion 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGAGATA TTACCCTCAC GACTAAAATA CTTGAGACAA TACCCAACTA TCCTGTGATG 
AGTGCCAGTT TTTATTTAAC TCCCCATGAC CTCCTTGCAA CTTTTCTAAG TATCAGTGGT
TTAATGGTTT CTCCCATCGC TTCAGCACAA GTTGCTAGTG ATGGAACGGT ACAAACGCAA
GTTAACACCC TAGGGAATCA ATTAGAGATT ACAGAGGGAA CGCAAGCAGG AAGCAATCTA
TTTCATAGCT TCAGCCAATT TTCTGTTCCG AGTGGTTTTG AGGCTTATTT TAACAATTCT
TCGACCATTA GCAATATTAT CAGCCGTGTT ACAGGGGGTT CAATTTCCAA TATAGAGGGA
CTCATTCGAG CTAACGGGAC GGCTAATTTA TTCCTGATTA ACCCCAATGG TATTGTCTTT
GGTCCGAATG CGGCGGTTGA TATTGGGGGG TCTTTTTTAG CCACTACCGC CGAATCCATC
CAATTTGCTG ATGGCTCTCA ATTTAGTGCG ACAAACTCCC AATCTTCACC CATTTTAACG
ATAGCGGTTC CAGTGGGGTT GCAATTTGGT TCTAACCCTG GAACCATTGT TAATCGAGCT
AATCGAACAG TACCTGATCC CACTCCTGAT AACCCTAATA ATACTAGGCA AGTAGGATTT
GAGGTTAAAT CGGGTAATAC TATAGCTCTA GTAGGGGGAG ATTTGGATTT TCAGGGGGGA
AGGGTTAATG GTTCAGGAGG AAGGGTTGAA CTGGGCAGTG TGGGGGGTAA CAGTAGGGTT
CAATTAAGCG TGACTAACCC AGGATTGGAG ATTGACTATC AAGGAGTTAC CAACTTTCAA
GATATTTCCT TATCTCAACA ATCAATAGTA GATGTGAATA ATTCAACAAT TCAGGTACAG
GGCAGCAATA TTCGGCTGAC TAATGGCTCT CAAATTAGCA GCACAACAAG AACAACAGAA
GATGCTGGAG ATTTAACCGT TAATGCCACT GAATCTGTTG AACTGATTGG CGCTGCCCCT
GAGGGGTTTC CTTTTCCTAG TGCTTTTATT GCTCAAGTCA ATCCAGGGGG TACAGGTAGA
GGGGGTAATC TGATCATCAA TACGAAACAG TTAAGTATCC GTGAGGGTGC TGGTATATCA
GTAGCGAGTC GAGGTAAAGG AATAGGCGGT CGGCTAGAAG TTAATGTCTC AGAGGCGATT
GAGATTACGG GAACTGGTCC ACAATCCCTT AGTGTTCTCA CAAGCTCCAC AGATGGGACA
GGGGATGGCG GCGAAATAGT CATTAATACG AGACAATTAA CGCTGAGCAA TGGCGGACAA
ATACAGGCGT TTACGATTAA TCAAGGACGC GGGGGAACTA TCACCATTAA CGCTTCTGAT
TCAATCGAAG TGAGTGGTCG AGGAGCGTTA CCAGAGTTTA ATACAGAAAG TTTTAGTTCT
ATTACAGCAG AATCAGGTTT TCAACTCCTT GGATTTACAG GAATTGCCCC AGGAGGCAAT
GTTAACATTA ATACTAATCA ACTTATTGTC ACCGACGGCG GAACGATTTC TGCTGGCAGT
TTTGGACAAG GGAATGCGGG AAGCGTAGAT ATTACTTCAA ATTCGATCTT TTTAGACAAT
CAAGGGGTAA TTACGGCTTC TAGTGAAGGA AGCGGGGATG CAGGGAACGT GAGTATCTAC
ACAGACCAAC TCTCCGTTAA TAATCAATCA GAAATATCCG TCAGAAACAT TGGTTTTGGT
CAGGGGGGGA ATTTAACGAT TAGTGCTGAT GCTATTTCGT TAAACCGAAA CAGTCAATTG
AGTGCCGTTA GTCTTCCCTT AGAGGATCTA ACGCTCCAAA AATTAAGTAT TAACGCCGAA
GAATTTGCCA ACCGTCCCAA TATTGGCAAC GCAGGGGATT TAATTCTTAA CACTTCTTCA
TTAAATCTTA ATAATGATTC CCAAGTGACC GTCAGCAGTT TTGGAACGGG AAATGCAGGA
AGTATGGGGA TTACAGCCCA AAATATCGCC CTAGATAACA GTAGCGAACT CGCTGCCGAA
ACAGCTTCAG GGGAAGGGGG TAACATCAGT CTTTATGTCT CAGACTTCCT GAACCTTCGT
CGCGCTAGTA CCATCTCCAC CACTGCAGGA ACCCTAGGTG GTGGTGGTAA TGGAGGTAAT
ATCTTCATTG ATGCTGAATT TATGGTTACT GTACCCACCG AAAACAGCGA CATTATTGCC
AATGCTTTTC TGGGTAATGG AGGAAACATT CGTATTAACG CCTCTGGAGT GTTTGGGATC
GAAGAACGGG AACGTCTGAC CCCGTTGAAT GACATTACCG CGAGTTCCCA ATTTGGACAG
GTAGGGAGTA TTGGTATTAA CCGACCCGAT GTTGATCCCC AACGCAGCTT AGTTAAATTG
CCGGGTGAAG TGGTAGACGC AAAAAACCTT GTGGTTCAAG CGTGTAGTCC TGGGGGAGCG
TATACGCGAG GCGAATTTAG CATAACGGGC TCAGGGGGTT TACCCGTTAA CCCTGATGAA
GGAATTCAAA CTGCCCCAGG CTTGACTGAA TTAGGCTATC CTGAGATAGA AATGTTTAAT
CAATCAGATA ATCAAGAATC TCTGAAAATC CCAAATGAGT CTTTAACCCC AGATTATAAG
CGTCAGAGTT CTCCTACAAC CATTGTGGAA GCTCAAGGCT GGATCATGGA TAAGAATGGT
AAAGTAGTGC TGACGGCTCA ATCACCTAAC GTTACTCCTC ACGGTTCTGG GTTTATGCCA
TCCAACTGTT ATGACCTTTC TAACCGTTCC CTTTCCTCGG CTACGTCTCC ATCCCCATCC
CTTTCTGATG TTCCTCAGCG TAATAGTCTG GCTGAGTAG
 
Protein sequence
MGDITLTTKI LETIPNYPVM SASFYLTPHD LLATFLSISG LMVSPIASAQ VASDGTVQTQ 
VNTLGNQLEI TEGTQAGSNL FHSFSQFSVP SGFEAYFNNS STISNIISRV TGGSISNIEG
LIRANGTANL FLINPNGIVF GPNAAVDIGG SFLATTAESI QFADGSQFSA TNSQSSPILT
IAVPVGLQFG SNPGTIVNRA NRTVPDPTPD NPNNTRQVGF EVKSGNTIAL VGGDLDFQGG
RVNGSGGRVE LGSVGGNSRV QLSVTNPGLE IDYQGVTNFQ DISLSQQSIV DVNNSTIQVQ
GSNIRLTNGS QISSTTRTTE DAGDLTVNAT ESVELIGAAP EGFPFPSAFI AQVNPGGTGR
GGNLIINTKQ LSIREGAGIS VASRGKGIGG RLEVNVSEAI EITGTGPQSL SVLTSSTDGT
GDGGEIVINT RQLTLSNGGQ IQAFTINQGR GGTITINASD SIEVSGRGAL PEFNTESFSS
ITAESGFQLL GFTGIAPGGN VNINTNQLIV TDGGTISAGS FGQGNAGSVD ITSNSIFLDN
QGVITASSEG SGDAGNVSIY TDQLSVNNQS EISVRNIGFG QGGNLTISAD AISLNRNSQL
SAVSLPLEDL TLQKLSINAE EFANRPNIGN AGDLILNTSS LNLNNDSQVT VSSFGTGNAG
SMGITAQNIA LDNSSELAAE TASGEGGNIS LYVSDFLNLR RASTISTTAG TLGGGGNGGN
IFIDAEFMVT VPTENSDIIA NAFLGNGGNI RINASGVFGI EERERLTPLN DITASSQFGQ
VGSIGINRPD VDPQRSLVKL PGEVVDAKNL VVQACSPGGA YTRGEFSITG SGGLPVNPDE
GIQTAPGLTE LGYPEIEMFN QSDNQESLKI PNESLTPDYK RQSSPTTIVE AQGWIMDKNG
KVVLTAQSPN VTPHGSGFMP SNCYDLSNRS LSSATSPSPS LSDVPQRNSL AE