Gene Cyan8802_1125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_1125 
Symbol 
ID8390436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp1155340 
End bp1157109 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content45% 
IMG OID644979140 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003136891 
Protein GI257059003 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.798248 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.349528 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGT TCTTATTTCT CTGTTTATTT ATTCTGGGAG TGTGGTTCGC CCTCAGTAGT 
TTTAAAGGGT TAGCAACGAA AGGCGAATTT AACTCCGTGA TTGTTAATTT CCGCGAAGAT
GTCCCCACTT CTGTTTTAAG TGAGGAAATT AGGACGATCG CACAAACCTA TCAAAAAACC
GCTAGTCTCA ATAGTATTTT TTCCATCGAT GATCATATTT ATACCCTAGA GGGCGATGGC
AACCTACTGA AAAAACTCAA ACAATCTTCT ATTAAACAGT ATATTGAATA TATCGAACCA
AACTATATTT ATCAAGCCCT AGAAGCCCCC AATGATCCCG ACTATAGCAA GCAATGGAAC
TTACATAATA TCAATATCGA ACGGGCTTGG GAAGACAGCA AAGGAGAAGG CGTAACCGTT
GCCGTTATTG ATACTGGAGT CAGTCGGGTT CCTGACTTAC GACAAACCGA ATTTGTCCAA
GGGTACGATT TTGTCAATGA TGGAAACAAC GCTGACGACG ATAACGGACA CGGAACCCAC
GTTGCCGGAA CCATTGCCCA ATCCACCAAT AATAACTATG GGGTTGCCGG GGTTGCCTAT
GGGGCTAAAA TTATGCCCCT GAAAGTCCTT TCCGCCGGAG GAGGGGGAAC CGTCGCTGAT
ATTGCCGAAG CCATCCGTTT TGCTGCCGAT CATGGCGCAG ATATCATTAA TATGAGTTTA
GGCGGCGGGG GCGAAAGTCA GGTGATGAAA GAAGCCATCG ACTACGCTGA CTCAAAAGGG
GTAGTCATTA TTGCTGCTGC TGGCAATGCT AATCAAAATT CGGCCTCCTA TCCCGCGCGT
TATCCCAAGG TGATCAGTGT TTCTGCCTTA GATCCTGCCG GGAAAAAAGC CCCCTATTCT
AACTACGGCG CAGGGGTCGA TATTTCTGCC CCAGGAGGCA GTGAAGCGGG CAAAATCCTC
CAAGAAACCA TCGATCCCAA AACGGGAGAA TCCGTATTTG CAGGGTTACA AGGAACCAGT
ATGGCAGCCC CCCATGTGGC GGGTGTGGCA GCGTTAATTA AAGCTTCTGG GATTAAAGAA
CCCTCTGAGG TGTTAAAGGT TCTTAAAGCC TCTTCGCGTA AGGTTCAAGA TGATCCGTTT
AATCATTTTG GGGCTGGACA ATTAGACGCA GGAGAAGCGG TTAAATTAGC CGTTAAAGGA
CAAATTACTT TCCGTGATTT CTTCCGATGG TTGCGCGATA ATGGCTATCT TAATCCTCGT
TTTTGGATTG ATGGCGGAGT CGTCGCTTTA ATGCCTAAAA TCTTAATGGT TTTGGGTTCC
TATCTTTTAG CTTGGTTGTT ACGGGTTTAT TTCCCCTTTC AATGGGGTTC GATGCTGAAT
TGGGGACTCA TTTTAGGCAG TTCAGGGTTA TTTTTCCTAC AAGGAGTCTA TATCTTTGAT
CTGCCTCAAT GGCCGTTTCG TGTCATGGGA AGTTCGGTTC CGGAATTAGC TAATGCTATT
CAAGGAACGG CTTTATTAAA TCCTTTATTA GCGAGTGTTT TAATTCCCTT TGTTTTGATT
GCTTTGTTGT TAGGGCATCG GCAAGGAAAA GGCTTTGCGA TCGGGATTTG TTTGGGGGTG
GCTTCCTGTT TAACGGTTCA TGCAGTGATG AGTCCTGAAG TATTATGGAT GCCTTCTGTT
AATCTTGCTC GAACTTTTTT AGGGGTTAAT GCCTTATTGT GTTTAGCATT AGCTTGGTTA
GCCACTAAAG GAGAGGGGAA AACGGCTTAA
 
Protein sequence
MKKFLFLCLF ILGVWFALSS FKGLATKGEF NSVIVNFRED VPTSVLSEEI RTIAQTYQKT 
ASLNSIFSID DHIYTLEGDG NLLKKLKQSS IKQYIEYIEP NYIYQALEAP NDPDYSKQWN
LHNINIERAW EDSKGEGVTV AVIDTGVSRV PDLRQTEFVQ GYDFVNDGNN ADDDNGHGTH
VAGTIAQSTN NNYGVAGVAY GAKIMPLKVL SAGGGGTVAD IAEAIRFAAD HGADIINMSL
GGGGESQVMK EAIDYADSKG VVIIAAAGNA NQNSASYPAR YPKVISVSAL DPAGKKAPYS
NYGAGVDISA PGGSEAGKIL QETIDPKTGE SVFAGLQGTS MAAPHVAGVA ALIKASGIKE
PSEVLKVLKA SSRKVQDDPF NHFGAGQLDA GEAVKLAVKG QITFRDFFRW LRDNGYLNPR
FWIDGGVVAL MPKILMVLGS YLLAWLLRVY FPFQWGSMLN WGLILGSSGL FFLQGVYIFD
LPQWPFRVMG SSVPELANAI QGTALLNPLL ASVLIPFVLI ALLLGHRQGK GFAIGICLGV
ASCLTVHAVM SPEVLWMPSV NLARTFLGVN ALLCLALAWL ATKGEGKTA