Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPS_0788 |
Symbol | |
ID | 3522613 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Colwellia psychrerythraea 34H |
Kingdom | Bacteria |
Replicon accession | NC_003910 |
Strand | + |
Start bp | 807552 |
End bp | 809372 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637283253 |
Product | thermostable serine protease |
Protein accession | YP_267537 |
Protein GI | 71282140 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.282797 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTACG TTAGTAACAA TCAAATCGAA ATCAGCTCAC TAATAAAACG CCAAAGCAAA AGTTTTGTAC TAGGCTCATT ATTATTACCC TTAACTTTCA GCAATTTCGC GATAGCGAAT CCTCCTGAAC ATTCAGCAGC TTTTGCAAAA GGTCAAATTT TAGTACAACC ACAACCGGGC TTATCAGAGC AGAACTTCCA AAAGATCCTA GGCAAGCACA AAGCCAGCTC AAAAGGTAAG CTTCATCAAT TACGCACCCA CATTATCAAT GTACCGGCAA ATGCAGAACA AGCTATTGTT AAAGCGTTAT CTAATAATCC AAACATTGAA TTTGCTGAGG TAGACATACT CGTTAAACCA AGCGAAATTA TCGCCAACGA CACCTATTAC AACAATGCTT GGCACCTTAA TAAAATGCAA TTACCTACTG CTTGGGAAAC AGCAAAGGGT AATGGCGTGG TAGTAGCTAT TCTAGATACT GGTGTTAATA GTAATCACAC CGATTTGTCA GCCAATATGA TTGCCGGTTG GAACAGTGTT AGTCGAAACA GCGAAACGAG TGACATCTAT GGTCACGGTA CTAAGGTTGC CGGTGTTGTA GCGGCAATAA GTGATAATAA TAATGGCGTA ACCTCTATCG CTTGGCACGC ATCAATTATG CCTATACGAA TTACTAACGA TAGTTCAGGT TATGCCTATT GGAGCGATAT AGCGAATGGA TTAACTTGGG CAGCCGACAA TGGCGCCGAC ATTGCCAATA TCAGTTATCA AGTAACTACA AGCTCATCAG TAACAAATGC GGCGCAATAC ATGCGTAGTA AAGGCGGTTT GGTTGTCGCT TCAGCAGGTA ACAGTGGTGC AGACCTAAAC TGTACAGATA ATCCAAGTAT TATCACAGTA TCAGCCACAG ACAGTGCTGA TAATAAAGCC AGCTGGTCGG ATTACGGAAA TTGTATCGAT GTATCGGCGC CGGGATCAGG GATTTGGACC ACGACCAAAA GTGGTGGCTA TGGTGCTGTA AACGGTACTT CTTTTGCCAG CCCAGCAACG GCAGCAACCT TAGCACTGAT CAAATCAGCT AACCTGAATC TTAGTAATGA TGAACTTGAA AATATTTTAG AGGCTAGCGC AGATAAGTCC AAAAATGGCG GTGTATTTAA TAGCTATTAT GGTCATGGCA GAATTGATGC TGCAGCTGCG GTTGCCATGG TAGTAAACAC ACCTACTATC GATCAGCAAG CACCAACAGT TGTTATAACC TCTCCAACAG AAAATAGCGT GCAAACAGGT ACCTTTAATA TCACTGCCAA TGCACAAGAC AATATGGCTG TGAGCTCTGT TAGCCTATAT GCCAATGGTG TATTAATTGG TACTGATACT GTCGCGCCAT TTAGCGCTAA CTTTAACAGT AATAATATTG CTGATGGTAA CGTAGCTTTC ACTGCTCAGG CGTATGATGC AACAGGCAAT CAAGGTAATT CTTCAACGTA TTGGCTGACT ATCGATAATA TTGTTGATGT TGCAGACACC ATAGCACCGA GTGTGAGTAT TACTAACCTC GTTAATGGCA GTAGCATAAG CGGTAATCAA GCGATTAAAG TGAGTGCAAC AGACAATACG GCGGTGACTA AAATTGAATT GTATATTGAC GGCCAGTTAA AAACGCAAAC GACTGAATCA ATACTGTCTT ATAGTTGGAA TACCCGTAAA GTGGCTAACG GATACCATAC CATTGTAACA AAGGCATTTG ATGCTGCCGG CAATCAGAAT CAAACCAGTA TTCAAGTCAA TGTGCAGGCA CGTAAAAAGG GCCGTAAATA G
|
Protein sequence | MKYVSNNQIE ISSLIKRQSK SFVLGSLLLP LTFSNFAIAN PPEHSAAFAK GQILVQPQPG LSEQNFQKIL GKHKASSKGK LHQLRTHIIN VPANAEQAIV KALSNNPNIE FAEVDILVKP SEIIANDTYY NNAWHLNKMQ LPTAWETAKG NGVVVAILDT GVNSNHTDLS ANMIAGWNSV SRNSETSDIY GHGTKVAGVV AAISDNNNGV TSIAWHASIM PIRITNDSSG YAYWSDIANG LTWAADNGAD IANISYQVTT SSSVTNAAQY MRSKGGLVVA SAGNSGADLN CTDNPSIITV SATDSADNKA SWSDYGNCID VSAPGSGIWT TTKSGGYGAV NGTSFASPAT AATLALIKSA NLNLSNDELE NILEASADKS KNGGVFNSYY GHGRIDAAAA VAMVVNTPTI DQQAPTVVIT SPTENSVQTG TFNITANAQD NMAVSSVSLY ANGVLIGTDT VAPFSANFNS NNIADGNVAF TAQAYDATGN QGNSSTYWLT IDNIVDVADT IAPSVSITNL VNGSSISGNQ AIKVSATDNT AVTKIELYID GQLKTQTTES ILSYSWNTRK VANGYHTIVT KAFDAAGNQN QTSIQVNVQA RKKGRK
|
| |