Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_1457 |
Symbol | |
ID | 7103657 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 1530335 |
End bp | 1531588 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643474533 |
Product | peptidase M50 |
Protein accession | YP_002371670 |
Protein GI | 218246299 |
COG category | [R] General function prediction only |
COG ID | [COG1994] Zn-dependent proteases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGGAA AATGGCAAAT CGGTTCTTTA TTAGGAATCC CTCTCTATCT TGATCCGTCT TGGTTTATTA TTCTCCTGTT TGTGACCTTG GTTAATGCAG CAGAGATTAG CACGCAAAGA TTAGGGGGAA ATCTGCCAGG TTTGGGATGG TTAGCTGGAT TTATCATGGC TTTATTGCTA TTTGGGTCGG TTCTGCTTCA CGAATTGGGA CACAGCTTAG CCGCTCGCGC TCAGGGGATT AAGGTTAATT CAATTACACT ATTTCTCTTT GGGGGAGTGG CCTCGATCGA TCGAGAATCG AAGACCCCGG TCGGAGCCTT TTGGGTAGCG ATCGCAGGTC CTTTGGTCAG TTTTGGTCTG TTTATTTTGT TTTTTAGTCT CATTCAATGG GTGAATATTT CCAGCTTTGT CCCATCCGTT ACTCAAGAGC TTGGGAATAT TAAGTCATTA TTAAGGTATA TGTTGGGAGA TTTAGCCCGA ATTAATCTGG TTTTAGGGAT TTTTAATTTA ATTCCAGGGT TGCCCCTCGA TGGGGGACAA ATTTTAAAGG CGATTGTTTG GAAACTAACG GGCGATCGCT TTACGGGGGT TCGTTGGGCA GCAGCGAGTG GTAAATTAAT TGGTTGGGTG GGAATCTCGA CGGGATTGTT TTTTGTCTTG ACAACAGGGG GTTTAAGTCC CGTTTGGATC GCGTTGATCG GTTGGTTTGT TCTGCGTAAT GCTGATACCT ATGATCGCTT GACCGCTTTG CAAGAAAGTT TACTCAAAAT TGTGGCGGCT GAAGCGATGA GTCATGATTT TCGGGTGATT AATGCTCATC TAACCTTAAA CCAATTTGCT CAAGAATATA TTCTCAGAGA TTTGAATACG TCTTTAGTGT ATTATGCTGC GTCTGAAGGT CGTTATCGGG GACTCATTCG TGTTCAAGAT TTACAGTTAA TTGAGCGTTA TCTCTGGGAA AATCAAACGT TAATCGATAT TGTGCATCCT TTAACGGAGA TTCCTTCCGT TATAGAAAAG ACTCCCTTAG CAGAGGTAAT TGAAACGCTA GAATCTATTA GCGATCGTTC TGTAACAGTA TTATCTCCGG CGGGAGCCGT TGCAGGAGTT ATTGATCGCG CAGATATTGT GAAAATTATC GCTATACGCC ATAATCTTCC GATTCCTGAC AATGAAATCC ATCGGATCAA AGCTGAAGGA ACCTATCCCC CTTATTTACA ACTCCCTGCG ATCGCTAAAA GTCTTCATGA TTAG
|
Protein sequence | MQGKWQIGSL LGIPLYLDPS WFIILLFVTL VNAAEISTQR LGGNLPGLGW LAGFIMALLL FGSVLLHELG HSLAARAQGI KVNSITLFLF GGVASIDRES KTPVGAFWVA IAGPLVSFGL FILFFSLIQW VNISSFVPSV TQELGNIKSL LRYMLGDLAR INLVLGIFNL IPGLPLDGGQ ILKAIVWKLT GDRFTGVRWA AASGKLIGWV GISTGLFFVL TTGGLSPVWI ALIGWFVLRN ADTYDRLTAL QESLLKIVAA EAMSHDFRVI NAHLTLNQFA QEYILRDLNT SLVYYAASEG RYRGLIRVQD LQLIERYLWE NQTLIDIVHP LTEIPSVIEK TPLAEVIETL ESISDRSVTV LSPAGAVAGV IDRADIVKII AIRHNLPIPD NEIHRIKAEG TYPPYLQLPA IAKSLHD
|
| |