Gene CHU_0052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_0052 
SymboldegQ 
ID4186973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp63736 
End bp65187 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content43% 
IMG OID638070051 
Productserine protease 
Protein accessionYP_676686 
Protein GI110636479 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00292411 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTAGCGG TAGTGTCTTC TATCTTTGGA GGCATTGTAG CTTTAGTCGG CTACCAGTAT 
TTTGTCAAAA AGAATGAATT TACGTCCATT GAATCGATGC AGCGCGCATC CTTTGCAAAT
TTTTCAGATA CATCTGCTAT CGCAGTGCCT GCGGGTCTCA ATTTTATTCA TGCCGCAGAA
TTGACTACAC CGGCAGTAGT GCATATAAAA ACAACGTATA TGCCTGAAAC TACACGCCCT
AAAACGCGTG ATGAAGAATT GTTCCGGTAC TTCTATGGTG ATCCGTATGA AAATTACAAC
CAGCCGCGCG AAGCTTCCGG CTCCGGCGTT ATTGTTACCG GCGGCGGATA TATCGTAACA
AATAATCACG TGGTTGATAA AGCATCTAAG ATTCAGGTTG TATTAAATGA CAAAAGAACC
TACGATGCAA AACTGATCGG AACAGATCCG ACAACGGATC TGGCATTGAT TAAAATTGAA
GGTGAAAATC TTCCGTTTGT AGTGTATGGC AACTCGGATC AGGTGCGTAT CGGTGAGTGG
GTACTGGCTG TAGGAAATCC GTTTAACTTA ACGTCTACGG TTACCGCAGG TATTATCAGC
GCTAAAACAA GAAGCATCAA TATCCTGAGA GATAAAGATA ACATGGCGAT TGAATCTTTT
CTGCAGACAG ATGCGGTAGT AAACCCGGGC AATAGCGGTG GTGCATTGGT AAACTTAAGA
GGAGAACTGA TTGGTATCAA TACGGCTATT GCAAGTCCTA CGGGAGCATA TGCAGGTTAT
TCGTTTGCTG TACCGGTATC TCTTGTTAAA AAAGTGATTG ATGACATCAT GAACTATGGC
CAGGTGCAAC GTGGTTTATT AGGTGTGGTG ATTCAGGATA TGACGCCTGC TTTAGCAAAG
GAAAAAACAA TCGATTTTAT TTCAGGAGTT TATGTGAGTG CCGTTAATCA GGGAAGTGCA
GCAGACCTGG GAGGTATTAA AGAAGGCGAT ATTGTAACAA AGATCAATGA CATCAACATC
GGCGCAACAA CACAATTGCA GGAAGTAGTG GCGCGCTACA GACCCGGCGA CAAATTGAAA
GTTAAGTATG TGCGCAAAGG AAAAGAACTT GAAACTTCGG TTACCTTAAA AAATAAATTA
GGCGATATGG CCATTGTTGC TAAAGACGAC AACTCTGTTA AAACGAAGCT TGGCGCAGAC
TTACAGCCGG TATCGGGTGG TGAAATGAGT GTGCTGGAAA TTTCCGGCGG TGCAAAGGTT
GCAAAATTAT TTAGCGGTAA ATTAAAAGAA GCAGGCGTAA GAGAAGGATT TATTATTACT
TCCATCGATA AAAAACCTGT CAGCTCGCCG GAAGATGTTG TCCGCATTCT TGAATCTACT
ACCAATGGCG GTATCTTGAT GGAAGGTATT TATCCGAATG GAAAAAAAGA ATTCTACGGC
ATTGGTTGGT AA
 
Protein sequence
MLAVVSSIFG GIVALVGYQY FVKKNEFTSI ESMQRASFAN FSDTSAIAVP AGLNFIHAAE 
LTTPAVVHIK TTYMPETTRP KTRDEELFRY FYGDPYENYN QPREASGSGV IVTGGGYIVT
NNHVVDKASK IQVVLNDKRT YDAKLIGTDP TTDLALIKIE GENLPFVVYG NSDQVRIGEW
VLAVGNPFNL TSTVTAGIIS AKTRSINILR DKDNMAIESF LQTDAVVNPG NSGGALVNLR
GELIGINTAI ASPTGAYAGY SFAVPVSLVK KVIDDIMNYG QVQRGLLGVV IQDMTPALAK
EKTIDFISGV YVSAVNQGSA ADLGGIKEGD IVTKINDINI GATTQLQEVV ARYRPGDKLK
VKYVRKGKEL ETSVTLKNKL GDMAIVAKDD NSVKTKLGAD LQPVSGGEMS VLEISGGAKV
AKLFSGKLKE AGVREGFIIT SIDKKPVSSP EDVVRILEST TNGGILMEGI YPNGKKEFYG
IGW