Gene PCC8801_1300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1300 
Symbol 
ID7102194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1350045 
End bp1352681 
Gene Length2637 bp 
Protein Length878 aa 
Translation table11 
GC content45% 
IMG OID643474384 
ProductTPR repeat-containing protein 
Protein accessionYP_002371521 
Protein GI218246150 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID[TIGR02521] type IV pilus biogenesis/stability protein PilW 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTGC GATCGCTGTT ACCTTTGTTG TTAACTCTGT CCCTGGTAGG GGTTTTTACC 
CCTTCAGTGG TCTTGTCACA AAGTATCGAT CAATTATTTC AACAAGGCAG AACAGCCGGA
AAGATGGGAA AATATACCGA AGCAGAAGCT ATCTTCCGTC GAGTAATTGA ATTAGACCCC
AACTTAGCTG ATGCTTACAA CAATCTCGGT AATGCGCTGT ATTACCAAGG AAAACTAGAC
GAGGCGATCG CAGCTTATCA AAAAGCCATC CAACTCAACC CCAACGATGC TGATGCTTAC
AACAATCTCG GTAATGCGCT GTCTGACCAA GGAAAACTAG AGGAGGCGAT CGCAGCTTAT
CAAAAAGCCA TCCAACTCAA CCCCAACTAT GCTGATGCTT ACTACAATCT AGGTATTGCG
CTGTCTGACC AAGGAAAACT AGAGGAGGCG ATCGCAGCTT ATCAAAAAGC CATCCAACTC
AACCCCAACT TTACTCAAGC TTACTACAAT CTAGGTATTG CGCTGTCTGA CCAAGGAAAA
CTAGAGGAGG CGATCGCGGC TTATCAAAAA GCGATCCAAC TCAACCCCAA CTATGCTGAT
GCTTACTACA ATCTAGGTAA TGCGCTGTTT GACCAAGGAA AACTAGACGA GGCGATCGCA
GCTTATCAAA AAGCCATCCA ACTTGACCCC AACGATGCTA ATGCTTACAA CAATCTAGGT
GCTGCACTGT ACAAGCAAGG AAAACTAGAA GAGGCGATCG CAGCTTATCA AAAAGCGATC
CAACTCAACC CCAACTTAGC TGAAGCTTAC AACAATCTCG GTGTTGCGCT GTCTGACCAA
GGAAAACGAG ACGAGGCGAT CGCAGCTTAT CAAAAAGCGA TCCAACTCAA CCCCAACTTA
GCTGAAGCTT ACAACAATCT CGGTGTTGCG CTGTCTGACC AAGGAAAACG AGACGAGGCG
ATCGCAGCTT ATCAAAAAGC GATCCAACTC AACCCCAACT TTGCTTTAGC TTACAACAAT
CTAGGTGTTG CGCTGTCTGA CCAAGGAAAA CGAGACGAGG CGATCGCAGC TTATCAAAAA
GCGATCCAAC TCAACCCCAA CTTTGCTTTA GCTTACAACA ATCTAGGTGT TGCGCTGTCT
GACCAAGGAA AACGAGACGA GGCGATCGCA GCTTATCAAA AAGCGATCCA ACTCAACCCC
AACTTTGCTT TAGCTTACAA CAATCTAGGT GTTGCGCTGA GAAACCAAGG AAAACGAGAC
GAGGCGATCG CAGCTTATCA AAAAGCCATC CAACTTGACC CCAACGATGC TAATGCTTAC
AACAATCTAG GTCTTGCGCT GAGAAACCAA GGAAAACGAG ACGAGGCGAT CACAGCTTAT
CAAAAAGCGA TCCAACTCAA CCCCAACTTT GCTTTAGCTT ACAACAATCT AGGTAATGCG
CTGTATTCCC AAGGAAAACG AGAGGAGGCG ATCGCAGCTT ATCAAAAAGC GATCCAACTT
AACCCCAACT TTGCTTTAGC TTACAACAAT CTAGGTAATG CGCTGTCTGA CCAAGGAAAA
CGAGACGAGG CGATCGCAGC TTATCAAAAA GCGATCCAAC TCAACCCCAA CTTTGCTTTA
GCTTACAACA ATCTAGGTAA TGCGCTGTCT GACCAAGGAA AACTAAACGA GGCGATCGCA
ACTTATCAAA AAGCGATCCA ACTTAACCCC AACTTTGCTT TAGCTTACAA CAATCTAGGT
AATGCGCTGA AAGACCAAGG AAAACTAAAC GAGGCGATCG CAGCTTATCA AAAAGCCCTA
AGTTTGCCCG AAGATACTTC AGTAACTCCA ACCACTGCTC ATACCTTGGC ACATAATAAT
TTAGGCTTAG TTTATCAACC ACAGGGAAAG TTAGAGGAAG CTTTGCGGGA ATATGAGGCA
GCCTTGAAAA TTGATCCCAA GTTCGAGTAT GCCATAAAGA ACCGAGATGC GGTGCTTGCA
CTCCTCAAAC AACCTACGGA ACTAGCCTAT ACTACTAACA ATTATCTGCC CTCAGATGAC
CCTTTCATTG CCCCTAAACG GTCTGTTGTG GTGCTAACTC CTATTTTCCC CCAAGGAAGA
AATAAGGACG GTACAGGCTT TATTATTAAG CGCAGTGGGG ACAAACTCTG GATAGTTACT
AACCGTCATA TCGTCGTTGA TGCCTACTAC AGTACACCAC CGCGCCTCTG TGACTATGTG
GAAGTGCAAA TCTATCTAGG AACTAAACCG AGTAATGCGC GGACTCAGGT GATAAAAGGT
CGTGTTATTT CTTCCCTAGA AGATCCCGAC CTCGCTATTA TTGAAATGGA AGCTCCTAAC
CTCCCTTCGG ATATTCAACC CTTGCCTATT TCTAGCCCTA GCGATAACAT GAAAGTGACA
ACCATTGGTC ATCCAGAAGG TAATACTTGG AAGCGAGATG AGGGAAAAGT TGTTAATGTC
TCTGATCAAC GATTACTATT AGAAATAAGC CTAGCTGTGG GCAGTTCAGG CAGTCCTATT
CTAGCAGAAA ATAATGGGGT TATTGGTATT ATTTATCGAA TCGATCAAGA GGGAACTGGC
TATGCAATTC CTATCAGTCG AGTAATGAGA CAGATAGAAA CTTGGGGAAT TAAATAA
 
Protein sequence
MKLRSLLPLL LTLSLVGVFT PSVVLSQSID QLFQQGRTAG KMGKYTEAEA IFRRVIELDP 
NLADAYNNLG NALYYQGKLD EAIAAYQKAI QLNPNDADAY NNLGNALSDQ GKLEEAIAAY
QKAIQLNPNY ADAYYNLGIA LSDQGKLEEA IAAYQKAIQL NPNFTQAYYN LGIALSDQGK
LEEAIAAYQK AIQLNPNYAD AYYNLGNALF DQGKLDEAIA AYQKAIQLDP NDANAYNNLG
AALYKQGKLE EAIAAYQKAI QLNPNLAEAY NNLGVALSDQ GKRDEAIAAY QKAIQLNPNL
AEAYNNLGVA LSDQGKRDEA IAAYQKAIQL NPNFALAYNN LGVALSDQGK RDEAIAAYQK
AIQLNPNFAL AYNNLGVALS DQGKRDEAIA AYQKAIQLNP NFALAYNNLG VALRNQGKRD
EAIAAYQKAI QLDPNDANAY NNLGLALRNQ GKRDEAITAY QKAIQLNPNF ALAYNNLGNA
LYSQGKREEA IAAYQKAIQL NPNFALAYNN LGNALSDQGK RDEAIAAYQK AIQLNPNFAL
AYNNLGNALS DQGKLNEAIA TYQKAIQLNP NFALAYNNLG NALKDQGKLN EAIAAYQKAL
SLPEDTSVTP TTAHTLAHNN LGLVYQPQGK LEEALREYEA ALKIDPKFEY AIKNRDAVLA
LLKQPTELAY TTNNYLPSDD PFIAPKRSVV VLTPIFPQGR NKDGTGFIIK RSGDKLWIVT
NRHIVVDAYY STPPRLCDYV EVQIYLGTKP SNARTQVIKG RVISSLEDPD LAIIEMEAPN
LPSDIQPLPI SSPSDNMKVT TIGHPEGNTW KRDEGKVVNV SDQRLLLEIS LAVGSSGSPI
LAENNGVIGI IYRIDQEGTG YAIPISRVMR QIETWGIK