Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_1158 |
Symbol | |
ID | 8390469 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | - |
Start bp | 1181031 |
End bp | 1182257 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644979171 |
Product | HtrA2 peptidase |
Protein accession | YP_003136922 |
Protein GI | 257059034 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.201406 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGACG CTAAACTTAA CCTGACTCCT TCTCAATCTT CTTCTTGGAA AAAACCGATT ACTTACCTAT CTTTGATGCT GTTAGGAATG GGAGTAGGCA TTGGAGGAGC CTATACTTTC AGCCAGTCTC ATCTCTTAGC AGACACCAGT GAAGTTCTCA AACCGTCTGA AACGAGACAA ACCCCTGAAC AATTAGTGTT TGCTCCGACG ACTGGAAATT TTGTTACCGA TGTGGTGACT AAAGTCGGTC CTGCGGTGGT TAGAATTGAT GCTTCTCGAA CCGTTAAAAC AGAAGTGCCG CCAATGTTTG AAGATCCGTT TTTCCGTCGT TTTTTTGGCT CCCAACTTCC TGAGATTCCC GATGAAGAAA TTCAACGGGG AACGGGGTCT GGATTTATCT TAAGTCAGGA TGGAAAAATT CTGACCAATG CCCATGTTGT TGATGGAGCT TCGGAAGTAA CTGTTACCCT TAAAGACGGC CGTACTTTCA CGGGAAAAGT GTTAGGAACG GATGCGTTAA CGGATGTAGC CGTGATTAAA ATTGAGGCTG ATAATTTACC GACGGTTCAA CAAGGTAATT CCGATAATCT TCAAGTGGGA GAATGGGCGA TCGCCATTGG GAATCCCTTG GGATTAGATA ATACGGTCAC AACGGGAATT ATTAGTGCCA CAGGACGTTT AAGTTCTCAA GTGGGTGTGG GAGATAAGCG GGTAGAATTT ATTCAAACCG ACGCAGCCAT TAACCCCGGT AATTCCGGTG GACCCTTGCT CAACGCCAAT GGGGAAGTTA TTGGAATGAA TACGGCTATC ATTCAAAATG CCCAAGGGAT CGGCTTTGCT ATTCCCATTA ATAAGGCTGA AAAAATAGCT GAACAATTAA TCGCTAATGG GAAGGTTGAA CACCCATTTT TAGGGATTCA GATGGTAGAA ATTACTCCTG AAATCAAACA AAAACTCAAG CAGAGTCAAG AATTAAATGT AGTGGCTGAT CAAGGGGTCT TAATTGTTAA AGTTATGCCC AATTCTCCGG CTGATCAAGC AGGATTAAAA CCTGGAGATG TGATCCAATC CATTGAGCAG GAACCCCTCA AAAATCCTGG TCAAGTTCAA CAAGCGGTAG AAAAAACAGA CATAGGATCA ACCCTACCTT TACAGGTTGA ACGCAATGGT CAGACTCTGG ATATTAGTAT TAAGGTTGGG GTTTTACCGA ATCAGCCAAG TAGTTAA
|
Protein sequence | MKDAKLNLTP SQSSSWKKPI TYLSLMLLGM GVGIGGAYTF SQSHLLADTS EVLKPSETRQ TPEQLVFAPT TGNFVTDVVT KVGPAVVRID ASRTVKTEVP PMFEDPFFRR FFGSQLPEIP DEEIQRGTGS GFILSQDGKI LTNAHVVDGA SEVTVTLKDG RTFTGKVLGT DALTDVAVIK IEADNLPTVQ QGNSDNLQVG EWAIAIGNPL GLDNTVTTGI ISATGRLSSQ VGVGDKRVEF IQTDAAINPG NSGGPLLNAN GEVIGMNTAI IQNAQGIGFA IPINKAEKIA EQLIANGKVE HPFLGIQMVE ITPEIKQKLK QSQELNVVAD QGVLIVKVMP NSPADQAGLK PGDVIQSIEQ EPLKNPGQVQ QAVEKTDIGS TLPLQVERNG QTLDISIKVG VLPNQPSS
|
| |