Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_1300 |
Symbol | |
ID | 7102194 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | - |
Start bp | 1350045 |
End bp | 1352681 |
Gene Length | 2637 bp |
Protein Length | 878 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 643474384 |
Product | TPR repeat-containing protein |
Protein accession | YP_002371521 |
Protein GI | 218246150 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | [TIGR02521] type IV pilus biogenesis/stability protein PilW |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTGC GATCGCTGTT ACCTTTGTTG TTAACTCTGT CCCTGGTAGG GGTTTTTACC CCTTCAGTGG TCTTGTCACA AAGTATCGAT CAATTATTTC AACAAGGCAG AACAGCCGGA AAGATGGGAA AATATACCGA AGCAGAAGCT ATCTTCCGTC GAGTAATTGA ATTAGACCCC AACTTAGCTG ATGCTTACAA CAATCTCGGT AATGCGCTGT ATTACCAAGG AAAACTAGAC GAGGCGATCG CAGCTTATCA AAAAGCCATC CAACTCAACC CCAACGATGC TGATGCTTAC AACAATCTCG GTAATGCGCT GTCTGACCAA GGAAAACTAG AGGAGGCGAT CGCAGCTTAT CAAAAAGCCA TCCAACTCAA CCCCAACTAT GCTGATGCTT ACTACAATCT AGGTATTGCG CTGTCTGACC AAGGAAAACT AGAGGAGGCG ATCGCAGCTT ATCAAAAAGC CATCCAACTC AACCCCAACT TTACTCAAGC TTACTACAAT CTAGGTATTG CGCTGTCTGA CCAAGGAAAA CTAGAGGAGG CGATCGCGGC TTATCAAAAA GCGATCCAAC TCAACCCCAA CTATGCTGAT GCTTACTACA ATCTAGGTAA TGCGCTGTTT GACCAAGGAA AACTAGACGA GGCGATCGCA GCTTATCAAA AAGCCATCCA ACTTGACCCC AACGATGCTA ATGCTTACAA CAATCTAGGT GCTGCACTGT ACAAGCAAGG AAAACTAGAA GAGGCGATCG CAGCTTATCA AAAAGCGATC CAACTCAACC CCAACTTAGC TGAAGCTTAC AACAATCTCG GTGTTGCGCT GTCTGACCAA GGAAAACGAG ACGAGGCGAT CGCAGCTTAT CAAAAAGCGA TCCAACTCAA CCCCAACTTA GCTGAAGCTT ACAACAATCT CGGTGTTGCG CTGTCTGACC AAGGAAAACG AGACGAGGCG ATCGCAGCTT ATCAAAAAGC GATCCAACTC AACCCCAACT TTGCTTTAGC TTACAACAAT CTAGGTGTTG CGCTGTCTGA CCAAGGAAAA CGAGACGAGG CGATCGCAGC TTATCAAAAA GCGATCCAAC TCAACCCCAA CTTTGCTTTA GCTTACAACA ATCTAGGTGT TGCGCTGTCT GACCAAGGAA AACGAGACGA GGCGATCGCA GCTTATCAAA AAGCGATCCA ACTCAACCCC AACTTTGCTT TAGCTTACAA CAATCTAGGT GTTGCGCTGA GAAACCAAGG AAAACGAGAC GAGGCGATCG CAGCTTATCA AAAAGCCATC CAACTTGACC CCAACGATGC TAATGCTTAC AACAATCTAG GTCTTGCGCT GAGAAACCAA GGAAAACGAG ACGAGGCGAT CACAGCTTAT CAAAAAGCGA TCCAACTCAA CCCCAACTTT GCTTTAGCTT ACAACAATCT AGGTAATGCG CTGTATTCCC AAGGAAAACG AGAGGAGGCG ATCGCAGCTT ATCAAAAAGC GATCCAACTT AACCCCAACT TTGCTTTAGC TTACAACAAT CTAGGTAATG CGCTGTCTGA CCAAGGAAAA CGAGACGAGG CGATCGCAGC TTATCAAAAA GCGATCCAAC TCAACCCCAA CTTTGCTTTA GCTTACAACA ATCTAGGTAA TGCGCTGTCT GACCAAGGAA AACTAAACGA GGCGATCGCA ACTTATCAAA AAGCGATCCA ACTTAACCCC AACTTTGCTT TAGCTTACAA CAATCTAGGT AATGCGCTGA AAGACCAAGG AAAACTAAAC GAGGCGATCG CAGCTTATCA AAAAGCCCTA AGTTTGCCCG AAGATACTTC AGTAACTCCA ACCACTGCTC ATACCTTGGC ACATAATAAT TTAGGCTTAG TTTATCAACC ACAGGGAAAG TTAGAGGAAG CTTTGCGGGA ATATGAGGCA GCCTTGAAAA TTGATCCCAA GTTCGAGTAT GCCATAAAGA ACCGAGATGC GGTGCTTGCA CTCCTCAAAC AACCTACGGA ACTAGCCTAT ACTACTAACA ATTATCTGCC CTCAGATGAC CCTTTCATTG CCCCTAAACG GTCTGTTGTG GTGCTAACTC CTATTTTCCC CCAAGGAAGA AATAAGGACG GTACAGGCTT TATTATTAAG CGCAGTGGGG ACAAACTCTG GATAGTTACT AACCGTCATA TCGTCGTTGA TGCCTACTAC AGTACACCAC CGCGCCTCTG TGACTATGTG GAAGTGCAAA TCTATCTAGG AACTAAACCG AGTAATGCGC GGACTCAGGT GATAAAAGGT CGTGTTATTT CTTCCCTAGA AGATCCCGAC CTCGCTATTA TTGAAATGGA AGCTCCTAAC CTCCCTTCGG ATATTCAACC CTTGCCTATT TCTAGCCCTA GCGATAACAT GAAAGTGACA ACCATTGGTC ATCCAGAAGG TAATACTTGG AAGCGAGATG AGGGAAAAGT TGTTAATGTC TCTGATCAAC GATTACTATT AGAAATAAGC CTAGCTGTGG GCAGTTCAGG CAGTCCTATT CTAGCAGAAA ATAATGGGGT TATTGGTATT ATTTATCGAA TCGATCAAGA GGGAACTGGC TATGCAATTC CTATCAGTCG AGTAATGAGA CAGATAGAAA CTTGGGGAAT TAAATAA
|
Protein sequence | MKLRSLLPLL LTLSLVGVFT PSVVLSQSID QLFQQGRTAG KMGKYTEAEA IFRRVIELDP NLADAYNNLG NALYYQGKLD EAIAAYQKAI QLNPNDADAY NNLGNALSDQ GKLEEAIAAY QKAIQLNPNY ADAYYNLGIA LSDQGKLEEA IAAYQKAIQL NPNFTQAYYN LGIALSDQGK LEEAIAAYQK AIQLNPNYAD AYYNLGNALF DQGKLDEAIA AYQKAIQLDP NDANAYNNLG AALYKQGKLE EAIAAYQKAI QLNPNLAEAY NNLGVALSDQ GKRDEAIAAY QKAIQLNPNL AEAYNNLGVA LSDQGKRDEA IAAYQKAIQL NPNFALAYNN LGVALSDQGK RDEAIAAYQK AIQLNPNFAL AYNNLGVALS DQGKRDEAIA AYQKAIQLNP NFALAYNNLG VALRNQGKRD EAIAAYQKAI QLDPNDANAY NNLGLALRNQ GKRDEAITAY QKAIQLNPNF ALAYNNLGNA LYSQGKREEA IAAYQKAIQL NPNFALAYNN LGNALSDQGK RDEAIAAYQK AIQLNPNFAL AYNNLGNALS DQGKLNEAIA TYQKAIQLNP NFALAYNNLG NALKDQGKLN EAIAAYQKAL SLPEDTSVTP TTAHTLAHNN LGLVYQPQGK LEEALREYEA ALKIDPKFEY AIKNRDAVLA LLKQPTELAY TTNNYLPSDD PFIAPKRSVV VLTPIFPQGR NKDGTGFIIK RSGDKLWIVT NRHIVVDAYY STPPRLCDYV EVQIYLGTKP SNARTQVIKG RVISSLEDPD LAIIEMEAPN LPSDIQPLPI SSPSDNMKVT TIGHPEGNTW KRDEGKVVNV SDQRLLLEIS LAVGSSGSPI LAENNGVIGI IYRIDQEGTG YAIPISRVMR QIETWGIK
|
| |