Gene Cyan8802_1158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_1158 
Symbol 
ID8390469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp1181031 
End bp1182257 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content43% 
IMG OID644979171 
ProductHtrA2 peptidase 
Protein accessionYP_003136922 
Protein GI257059034 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.201406 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGACG CTAAACTTAA CCTGACTCCT TCTCAATCTT CTTCTTGGAA AAAACCGATT 
ACTTACCTAT CTTTGATGCT GTTAGGAATG GGAGTAGGCA TTGGAGGAGC CTATACTTTC
AGCCAGTCTC ATCTCTTAGC AGACACCAGT GAAGTTCTCA AACCGTCTGA AACGAGACAA
ACCCCTGAAC AATTAGTGTT TGCTCCGACG ACTGGAAATT TTGTTACCGA TGTGGTGACT
AAAGTCGGTC CTGCGGTGGT TAGAATTGAT GCTTCTCGAA CCGTTAAAAC AGAAGTGCCG
CCAATGTTTG AAGATCCGTT TTTCCGTCGT TTTTTTGGCT CCCAACTTCC TGAGATTCCC
GATGAAGAAA TTCAACGGGG AACGGGGTCT GGATTTATCT TAAGTCAGGA TGGAAAAATT
CTGACCAATG CCCATGTTGT TGATGGAGCT TCGGAAGTAA CTGTTACCCT TAAAGACGGC
CGTACTTTCA CGGGAAAAGT GTTAGGAACG GATGCGTTAA CGGATGTAGC CGTGATTAAA
ATTGAGGCTG ATAATTTACC GACGGTTCAA CAAGGTAATT CCGATAATCT TCAAGTGGGA
GAATGGGCGA TCGCCATTGG GAATCCCTTG GGATTAGATA ATACGGTCAC AACGGGAATT
ATTAGTGCCA CAGGACGTTT AAGTTCTCAA GTGGGTGTGG GAGATAAGCG GGTAGAATTT
ATTCAAACCG ACGCAGCCAT TAACCCCGGT AATTCCGGTG GACCCTTGCT CAACGCCAAT
GGGGAAGTTA TTGGAATGAA TACGGCTATC ATTCAAAATG CCCAAGGGAT CGGCTTTGCT
ATTCCCATTA ATAAGGCTGA AAAAATAGCT GAACAATTAA TCGCTAATGG GAAGGTTGAA
CACCCATTTT TAGGGATTCA GATGGTAGAA ATTACTCCTG AAATCAAACA AAAACTCAAG
CAGAGTCAAG AATTAAATGT AGTGGCTGAT CAAGGGGTCT TAATTGTTAA AGTTATGCCC
AATTCTCCGG CTGATCAAGC AGGATTAAAA CCTGGAGATG TGATCCAATC CATTGAGCAG
GAACCCCTCA AAAATCCTGG TCAAGTTCAA CAAGCGGTAG AAAAAACAGA CATAGGATCA
ACCCTACCTT TACAGGTTGA ACGCAATGGT CAGACTCTGG ATATTAGTAT TAAGGTTGGG
GTTTTACCGA ATCAGCCAAG TAGTTAA
 
Protein sequence
MKDAKLNLTP SQSSSWKKPI TYLSLMLLGM GVGIGGAYTF SQSHLLADTS EVLKPSETRQ 
TPEQLVFAPT TGNFVTDVVT KVGPAVVRID ASRTVKTEVP PMFEDPFFRR FFGSQLPEIP
DEEIQRGTGS GFILSQDGKI LTNAHVVDGA SEVTVTLKDG RTFTGKVLGT DALTDVAVIK
IEADNLPTVQ QGNSDNLQVG EWAIAIGNPL GLDNTVTTGI ISATGRLSSQ VGVGDKRVEF
IQTDAAINPG NSGGPLLNAN GEVIGMNTAI IQNAQGIGFA IPINKAEKIA EQLIANGKVE
HPFLGIQMVE ITPEIKQKLK QSQELNVVAD QGVLIVKVMP NSPADQAGLK PGDVIQSIEQ
EPLKNPGQVQ QAVEKTDIGS TLPLQVERNG QTLDISIKVG VLPNQPSS