Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CHU_0052 |
Symbol | degQ |
ID | 4186973 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cytophaga hutchinsonii ATCC 33406 |
Kingdom | Bacteria |
Replicon accession | NC_008255 |
Strand | + |
Start bp | 63736 |
End bp | 65187 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 638070051 |
Product | serine protease |
Protein accession | YP_676686 |
Protein GI | 110636479 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00292411 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTAGCGG TAGTGTCTTC TATCTTTGGA GGCATTGTAG CTTTAGTCGG CTACCAGTAT TTTGTCAAAA AGAATGAATT TACGTCCATT GAATCGATGC AGCGCGCATC CTTTGCAAAT TTTTCAGATA CATCTGCTAT CGCAGTGCCT GCGGGTCTCA ATTTTATTCA TGCCGCAGAA TTGACTACAC CGGCAGTAGT GCATATAAAA ACAACGTATA TGCCTGAAAC TACACGCCCT AAAACGCGTG ATGAAGAATT GTTCCGGTAC TTCTATGGTG ATCCGTATGA AAATTACAAC CAGCCGCGCG AAGCTTCCGG CTCCGGCGTT ATTGTTACCG GCGGCGGATA TATCGTAACA AATAATCACG TGGTTGATAA AGCATCTAAG ATTCAGGTTG TATTAAATGA CAAAAGAACC TACGATGCAA AACTGATCGG AACAGATCCG ACAACGGATC TGGCATTGAT TAAAATTGAA GGTGAAAATC TTCCGTTTGT AGTGTATGGC AACTCGGATC AGGTGCGTAT CGGTGAGTGG GTACTGGCTG TAGGAAATCC GTTTAACTTA ACGTCTACGG TTACCGCAGG TATTATCAGC GCTAAAACAA GAAGCATCAA TATCCTGAGA GATAAAGATA ACATGGCGAT TGAATCTTTT CTGCAGACAG ATGCGGTAGT AAACCCGGGC AATAGCGGTG GTGCATTGGT AAACTTAAGA GGAGAACTGA TTGGTATCAA TACGGCTATT GCAAGTCCTA CGGGAGCATA TGCAGGTTAT TCGTTTGCTG TACCGGTATC TCTTGTTAAA AAAGTGATTG ATGACATCAT GAACTATGGC CAGGTGCAAC GTGGTTTATT AGGTGTGGTG ATTCAGGATA TGACGCCTGC TTTAGCAAAG GAAAAAACAA TCGATTTTAT TTCAGGAGTT TATGTGAGTG CCGTTAATCA GGGAAGTGCA GCAGACCTGG GAGGTATTAA AGAAGGCGAT ATTGTAACAA AGATCAATGA CATCAACATC GGCGCAACAA CACAATTGCA GGAAGTAGTG GCGCGCTACA GACCCGGCGA CAAATTGAAA GTTAAGTATG TGCGCAAAGG AAAAGAACTT GAAACTTCGG TTACCTTAAA AAATAAATTA GGCGATATGG CCATTGTTGC TAAAGACGAC AACTCTGTTA AAACGAAGCT TGGCGCAGAC TTACAGCCGG TATCGGGTGG TGAAATGAGT GTGCTGGAAA TTTCCGGCGG TGCAAAGGTT GCAAAATTAT TTAGCGGTAA ATTAAAAGAA GCAGGCGTAA GAGAAGGATT TATTATTACT TCCATCGATA AAAAACCTGT CAGCTCGCCG GAAGATGTTG TCCGCATTCT TGAATCTACT ACCAATGGCG GTATCTTGAT GGAAGGTATT TATCCGAATG GAAAAAAAGA ATTCTACGGC ATTGGTTGGT AA
|
Protein sequence | MLAVVSSIFG GIVALVGYQY FVKKNEFTSI ESMQRASFAN FSDTSAIAVP AGLNFIHAAE LTTPAVVHIK TTYMPETTRP KTRDEELFRY FYGDPYENYN QPREASGSGV IVTGGGYIVT NNHVVDKASK IQVVLNDKRT YDAKLIGTDP TTDLALIKIE GENLPFVVYG NSDQVRIGEW VLAVGNPFNL TSTVTAGIIS AKTRSINILR DKDNMAIESF LQTDAVVNPG NSGGALVNLR GELIGINTAI ASPTGAYAGY SFAVPVSLVK KVIDDIMNYG QVQRGLLGVV IQDMTPALAK EKTIDFISGV YVSAVNQGSA ADLGGIKEGD IVTKINDINI GATTQLQEVV ARYRPGDKLK VKYVRKGKEL ETSVTLKNKL GDMAIVAKDD NSVKTKLGAD LQPVSGGEMS VLEISGGAKV AKLFSGKLKE AGVREGFIIT SIDKKPVSSP EDVVRILEST TNGGILMEGI YPNGKKEFYG IGW
|
| |