Gene PCC8801_3268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3268 
Symbol 
ID7105919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3420122 
End bp3421447 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content36% 
IMG OID643476287 
ProductTPR repeat-containing protein 
Protein accessionYP_002373397 
Protein GI218248026 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACTG TTATTTATCA ATCCTATAGA ACAGAAAACG TTCCTTCTTG GATTACTCAA 
TGTATGAAAA CGGTCAAAGA TTGGTCACAA GCCAATAACT TTGATTACCA GTTTATTGAT
GATCGGTTAT TTGAATATGT ACCTGAATGG TATCGCAAGA AAGTCAATAA TCAAATTCAT
TTAACTTCTG ACTTAGCTAG ACTTAATCTG GCTAAAGAAT ATCTCTCAAA GGGGTATGAT
CGGGCAATTT GGATTGATGC GGATATACTC ATATTCAATC AAGAGAAGTT TAATCTACCC
ATCGATCAGC CCTATCTATT TTGTCGAGAA ATTTGGCTTG ATTATGATAG GATAGGACAA
AAAGTTACTT GCCAAACAAG AGTAAATAAC TCTGTATCAG TATTTTTGAA AAACAATAGT
TTACTGGATT TTTATATCCA TGCTTGTGAA GAAATTGTTA AAAATAAAAG TCAAATTGTT
AACATATCTG TTGGTACAGT TTTCCTAACG TTATTGTATC AAGCAATAGG AGGAGAATTA
ATCAATAATG TGGGGTTATT TAGCCCTTTT ATTACTAATG ATATTGCCCA GTGTCAGGGA
ATTTTTAGCT CAATTTATAT GAAGGCTTTC GCTCATCCTA TCTACGCGGC TAATCTCTGT
ACCAGTTTTA GGGAGATGAA TTACTATGGG ATAATTGTAT CCGATGCCGT CTATGAACAG
GCGATCAATA ACTTATTACA AACTCAGGGA ATGGTGGTCA ACCAGTATTG TCCTGAAATA
ACCAACATTG AGCAGAAACT TACCATCAGA GAAGAAGATT ATCGAACAAT GGTTGTTTCT
ATTGAGGAAA ACTCTCAACC GCTAGACAGA GAGGCTAACC TGGCTTTAGC GATTAGACTA
CATAAAGCCA ATTTTTTGAC AGAAGCCGAA CAAATTTATC AGAACATTCT TAAAAATTAT
CCCGATGACC CAGAAGCTCT CTATTGGCTA GGAGTATTAT CTAATCAGTT AAGTCGGCAG
GAACAGGCAG AACAATTATT CAGAAAAAGT CTTAAAATTC AGCCAAATTT AGCTAAGGCT
TGGTTTAGCT TAGGTAATTT ATACCAAGGG CAAGGAAAGT TACAAGAGGG GATAGGGTAT
TATCAAAAAG CCTTAGACCT ACAACCAGAT TCAGCCTTTA TTCATAATAA TCTAGGCTAT
GCACTACAAC AACAAGGTCA ATGGGAAAAA GCGATCGCCT GTTATCAAAA AGCCTTAGAA
CTTCAACCTA ATTGCGTTGA AGCCAAGGTC AATTTAGATA ATGCCCTGAA GACTCAAGTA
ATCTGA
 
Protein sequence
MKTVIYQSYR TENVPSWITQ CMKTVKDWSQ ANNFDYQFID DRLFEYVPEW YRKKVNNQIH 
LTSDLARLNL AKEYLSKGYD RAIWIDADIL IFNQEKFNLP IDQPYLFCRE IWLDYDRIGQ
KVTCQTRVNN SVSVFLKNNS LLDFYIHACE EIVKNKSQIV NISVGTVFLT LLYQAIGGEL
INNVGLFSPF ITNDIAQCQG IFSSIYMKAF AHPIYAANLC TSFREMNYYG IIVSDAVYEQ
AINNLLQTQG MVVNQYCPEI TNIEQKLTIR EEDYRTMVVS IEENSQPLDR EANLALAIRL
HKANFLTEAE QIYQNILKNY PDDPEALYWL GVLSNQLSRQ EQAEQLFRKS LKIQPNLAKA
WFSLGNLYQG QGKLQEGIGY YQKALDLQPD SAFIHNNLGY ALQQQGQWEK AIACYQKALE
LQPNCVEAKV NLDNALKTQV I