Gene CHU_3006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_3006 
SymbolhtrA 
ID4183739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp3448566 
End bp3449984 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content40% 
IMG OID638072995 
Productperiplasmic serine protease 
Protein accessionYP_679589 
Protein GI110639380 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.337572 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.40182 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACATT TTGTATTCTT ATTGATTTCA ATAATTGCGG GATTCTCAGG CGCTGCGCTG 
TTTCATAAAA CCGTACCCGT TAACAGCAGC ACTACTACAG ATCAGACCAT TGTCCCGATC
GTGCATCAGG CGGCTTATAG CAGCCCTGTT CAAAGTAATA CTGCTACGGA TTTCACGCTT
GCTTCTGCAA TCAGTACACC CAGTGTTGTG TATATAACGA CCGTTTCTGC CAACCAAAAT
ACGAACAATT GGTTTGACTG GTATTTTAAT GGCAATGGTA ATAATTTTGT TGCCGGATCA
GGTTCCGGTG TTATTTATTC TGCAGACGGC TACATCATTA CCAATAACCA CGTTATTCAG
CGGGCAACCA AGATTGAAGT GGTACACAAC AGAACTACCT ATACAGCGAA AATTGTTGGT
ATAGATCCCT CCTCTGATCT GGCGGTATTA AAAATTGAAG GCGAAAATCT TCCTGCAGTT
AAAATCGGAA GTTCGGCAGA TATAAAAATC GGTGAATGGG TTCTTGCCGT TGGCAATCCA
TTTAACCTCA CCTCCACGGT CACGGCTGGT ATTGTTTCTG CTAAAGGAAG AAATATTAAT
ATTGTAAACA GCAGCTTCCC TATCGAATCC TTTATACAGA CAGATGCTGC AATTAATCCC
GGCAATTCCG GCGGAGCACT TGTAAATACA AAAGGTGAAC TTATCGGTAT TAATACAGCT
ATCTTATCTA AAACAGGTAG CTACACCGGT TATGGGTTTT CAGTTCCGGT AGATATTGTA
AAAAAAATCG TTGCTGATCT GATTAAATAT GGTGTTGTTC AAAAAGCATT TATCGGATTA
GAAGTAAGCG AAGTAAATTC TACAATTGCA AAAGAATTAA AACTTTCTGA TCTGGATGGA
ACCTATATCA CCTATCTTCA AAAGGGTTCT GCTGCAGAAA AAGCAGGCTT GCAAAAAAAT
GATGTTCTCC TTAAATTAAA TGATAAATCA ATAACTTCAC GAAGCGATTT TGATGAATAT
ATTGCATACA AATCACCGGG TGAAAAAATT AAAATTACCT ATAAGAGAGA TCACGTACTT
AAAGAAGCTT ATGTGACATT AACAAATGAA GATGGCAACA CGGAAATTGT TAAACACGAA
GTATTTTCCT CTCAATCCTT AGGTGCCGAT TTCAGTACGC TGCCTAAGGT AGAGAAAGAA
AAAATGGGTC TCGCAAATGG TGTGCGTATC GTTGCCGTTC GCAGTGGCTT GATCAGCAGG
CTTGGCTTAC AGGAAGGGTT TATTATTACA TCGATTAACC GTACACCTGT ATCAACGCCG
GCTGAAGTGG CAAGCATTCT TGAGAACATT CGCGGGCAGG TAATCATTGA AGGTATTGCC
AGCAGTGGTT CCCGCGCTAT TTACCAGTAT TATTTTTAA
 
Protein sequence
MKHFVFLLIS IIAGFSGAAL FHKTVPVNSS TTTDQTIVPI VHQAAYSSPV QSNTATDFTL 
ASAISTPSVV YITTVSANQN TNNWFDWYFN GNGNNFVAGS GSGVIYSADG YIITNNHVIQ
RATKIEVVHN RTTYTAKIVG IDPSSDLAVL KIEGENLPAV KIGSSADIKI GEWVLAVGNP
FNLTSTVTAG IVSAKGRNIN IVNSSFPIES FIQTDAAINP GNSGGALVNT KGELIGINTA
ILSKTGSYTG YGFSVPVDIV KKIVADLIKY GVVQKAFIGL EVSEVNSTIA KELKLSDLDG
TYITYLQKGS AAEKAGLQKN DVLLKLNDKS ITSRSDFDEY IAYKSPGEKI KITYKRDHVL
KEAYVTLTNE DGNTEIVKHE VFSSQSLGAD FSTLPKVEKE KMGLANGVRI VAVRSGLISR
LGLQEGFIIT SINRTPVSTP AEVASILENI RGQVIIEGIA SSGSRAIYQY YF