Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_1746 |
Symbol | |
ID | 7101818 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | - |
Start bp | 1827925 |
End bp | 1830228 |
Gene Length | 2304 bp |
Protein Length | 767 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643474814 |
Product | Tetratricopeptide TPR_2 repeat protein |
Protein accession | YP_002371949 |
Protein GI | 218246578 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGGGCA TCGGTAAGAC AGAATTGGCT TTACAGTATG GTTGGAAGGA ATGGCACAAT AAAACCTATC CAGGGGGGAT ATGTTGGGTA AATGGGGCAG ATAGTGACCC AGGACTCAAT ATTTTATCGT TTGCGAGGCA ATATCTCAAG TTAACTATAC TCGATGAGGG GACATTAGCG GAAAGGGTAG CTTATTGTTG GCGCAATTGG TTAGCGGGTG ATACCCTGAT TATTTTTGAT GATGTGGGAG ATTATCAGCG AATTAAGGAC TTTTTGCCCC CAAAACAAGA GAAACGATTT AAGGTACTCA TTACTACCCG TCGGGAATTT TTAGCGGGGA CAATAGAGAA TTATGCTGTA GAAGTCTTAG ATGAAGAGCC AGCCGTTGAT TTATTGAGGT CTTACGTTAC AGATGGCAGA ATTGACGTAG AAATTGAGCA AGCTAAGCTA TTATGTCGAG ATTTAGGTTA TTTACCTCTA GCGTTAGAGT TGGTAGCGCG GTTATTGAAA CGGCGTAAAG ATTGGACTTT AGGCAAAATT AGGGATAAAT TAGCAGAAAA AGGCTTAGAT GAAAAAATTT TACAGAGAGA TCCCAAGTTT GCTGATGAAA TGACGGCGCA ACGAGGTGTT AAGGCTGCTT TTGACCTGAG TTGGCAAGAA TTGGATAGTG AACCAGCAGC GCAAAAATTA GCACTCTATT TAAGTTTATT TGCCCTTGCA CCGTTTCCCA AGTCTCTTAT TGAGGGTTTA TTCCCTGATG AGGATAAGGA TGAGATAGAG GAATGGTTAA CCGATAGCTT GGTTGACCTT AATTTGGTGC AATGTTTAGA TAATGGATGG TATGAACTGC ATACTCTGCT ACGTCGCTAT TTTCGTGATA AATTGGAGCT TTCTGCCGAT GTTGAGACAG CTAAAAAGGC TTATTGCCGT GCTATAGTAC CCATCGGGAA AGCAATACCA GAGACTACCA CCCTAGAAGA CATAGAACGG TTTGAACCTC TCATCCCTCA CTTGACCATT GCCGCAGATG AGTTAGTAAC ATGGGTAAAA GATGGTGATA TTTTGGGGTT ATTTACTGGT TTAGGTTCAT TTTATCGGGG ACAGGGACGC TATCAAGATG CAGAACCCTA TCTTGAACAC TGTCGTATCC TAACACGACA ACGGTTAGGG GATAATCACC CTCATGTTGC AGTTTCCCTC AATAATTTAG CATTACTCTA CGATTCCCAA GGAAGGTATT CAGAAGCTGA ACCCCTCTAT CAAAAAGCTT TGTCCCTTTA TAAACGTCTG TTAGGGAATA ATCACCCTAA TATGGCACAA TCCCTCAATA ATTTAGCAGA ACTCTACCGT AACCAAGGAA GATATGCAGA AGCAGAACTC CTCCATCAAG AAGCTTTGTC CCTGAGAAAA CGTCTGTTAG GGGATCATCA CCCTGATGTC GCACTATCCC TCAATAATTT AGCAGCACTT TACTATTCCC AAGGAAGGTA TTCAGAAGCT GAACCCCTAT TAAAAAAAGC CTTGTCCCTT TATAAACGTC TGTTAGGGGA TAATCACCCT CATATCGCAT CTTCCCTGAA TAATTTAGCA GGACTCTATG ATTCCCAAGG AAAGTATGGA GAAGCTGAAC CCCTCTATCA ACAAGCTTTG TCTCTGAGAA AACGACTGTT AGGGGATCAT CACCCTGATG TCGCACAATC CCTCAATAAT TTAGCAGAAC TCTACCGTAA CCAAGGAAGG TATGGAGAAG CAGAACCCCT CTATCAACAA GCTTTGTCTC TGAGAAAACG ACTGTTAGGG GATCATCACC CTGATGTCGC ACAATCCCTC AATAATTTAG CAGAACTCTA CCGTAACCAA GGAAGGTATG GAGAAGCAGA ACCCCTCCAT CAAGAAGCTT TGTCCCTGAG TAAACGACTG TTAGGAGATA ATCACCCTGA TGTCGCACAA TCCCTCAATA ATTTAGCATT ACTCTACAAT TCTCAAGGAA GGCATGGAGA AGCAGAACCC CTCCATCAAG AAGCTTTGTC TCTGAGAAAA CGACTGTTAG GGGATAATCA CCCTGATGTC GCACAATCCC TCAATAATTT ATCATTACTC TACGATTGCC AAGGAAGGTA TGCAGAAGCT GAACCTCTCT ATCAAGAAGC TATCGCTATT GCACTTCGTA CCCTGGGGGA AAATCATCCC CATACCCAAA CTATTTACAG GAATTATCTA TTGATGCTAT CAAAATTACC CGATGAAGAA TTAGCGCAAC GGTTTCCTGC GGAGTTGGTG GAGATAGTGC GAGGGTTGAG ATAA
|
Protein sequence | MGGIGKTELA LQYGWKEWHN KTYPGGICWV NGADSDPGLN ILSFARQYLK LTILDEGTLA ERVAYCWRNW LAGDTLIIFD DVGDYQRIKD FLPPKQEKRF KVLITTRREF LAGTIENYAV EVLDEEPAVD LLRSYVTDGR IDVEIEQAKL LCRDLGYLPL ALELVARLLK RRKDWTLGKI RDKLAEKGLD EKILQRDPKF ADEMTAQRGV KAAFDLSWQE LDSEPAAQKL ALYLSLFALA PFPKSLIEGL FPDEDKDEIE EWLTDSLVDL NLVQCLDNGW YELHTLLRRY FRDKLELSAD VETAKKAYCR AIVPIGKAIP ETTTLEDIER FEPLIPHLTI AADELVTWVK DGDILGLFTG LGSFYRGQGR YQDAEPYLEH CRILTRQRLG DNHPHVAVSL NNLALLYDSQ GRYSEAEPLY QKALSLYKRL LGNNHPNMAQ SLNNLAELYR NQGRYAEAEL LHQEALSLRK RLLGDHHPDV ALSLNNLAAL YYSQGRYSEA EPLLKKALSL YKRLLGDNHP HIASSLNNLA GLYDSQGKYG EAEPLYQQAL SLRKRLLGDH HPDVAQSLNN LAELYRNQGR YGEAEPLYQQ ALSLRKRLLG DHHPDVAQSL NNLAELYRNQ GRYGEAEPLH QEALSLSKRL LGDNHPDVAQ SLNNLALLYN SQGRHGEAEP LHQEALSLRK RLLGDNHPDV AQSLNNLSLL YDCQGRYAEA EPLYQEAIAI ALRTLGENHP HTQTIYRNYL LMLSKLPDEE LAQRFPAELV EIVRGLR
|
| |