Gene CHU_3351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_3351 
SymbolhsdS 
ID4185046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp3830174 
End bp3831238 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content33% 
IMG OID638073340 
Producttype I site-specific deoxyribonuclease S subunit 
Protein accessionYP_679930 
Protein GI110639720 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00146581 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0102246 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGATTTC CTGAGTTTGA TGAGGAATGG GAAGAGAAAA CGTTGGGGGA GATCTGTGAA 
ATGCAAGCTG GAAAATTCGT TAGTGCTAGT GAAATAAAAG AGCAGCATTT TGACGGCTTA
TTTCCTTGTT ATGGTGGAAA TGGATTAAGA GGTTATACTA AATCATATAA TTACGATGGT
AAATATTCCT TAATTGGTCG ACAGGGAGCA TTATGTGGCA ATGTAAATTT TGCTAATGGA
AAATTTCATG CAACAGAGCA TGCAGTGGTT GTCACCCCGT TAAATGGCAT TAATACAGTT
TGGATGTTTT ACTTGTTAAC AAATTTGAAT TTAAATCAAT TTGCTACAGG CATGGCCCAA
CCAGGACTAT CTGTACAAAA TTTAGAAAAG GTTGAGAGTA CAATTCCTAA AGCTATAGAT
GAGCAAGAAA AAATTGCTTC TTTTCTAACG CTAATTGACG GACGTATCTC AACTCAAAAC
AAAATAATTA AGGAATTAGA ATTACTTATT AAATCAATTA GCCAAATTAT ATTTCATGGA
CACAGATATA AATTCAAAAA AGCAAGCTTA GGTTCAATCT GCACTATAAA AAAAGGCGAG
CAAATTAACA GTTCGGTGTT AAGTGAATCA GGACTTTACG CAGTAATGAA TGGAGGAATT
ACTCCATCGG GATATTACTC ACAATATAAT TGTGTTGGTA ATACTATCTC TATTAGCGAA
GGAGGAAATT CATGCGGCTA TGTCCAATTC AATGATAAGA AATTTTGGAG CGGGGGACAT
TGTTACACAC TATCCGAAAT CAACGCAGAA ATTTCTAATA AATACCTATA TTACTTTATG
AAATTCTCTG AGAATTTAAT AATGTCTCTT CGCGTAGGCT CGGGATTACC TAATATCCAG
AAAAAAGATC TTGAAAAATT CAATGTAGCC TTTCCTGAAA TAAATCAACA GTATCAAATC
TCTAAATTTT TGGATCTTTT AACAGAAAAG ATCCAAGTTG AAAAATCTCT TAAAACTTCC
TTAATAAGGC AGAAGCAGTA TGTACTAAAA AAAATGTTCA TATAA
 
Protein sequence
MRFPEFDEEW EEKTLGEICE MQAGKFVSAS EIKEQHFDGL FPCYGGNGLR GYTKSYNYDG 
KYSLIGRQGA LCGNVNFANG KFHATEHAVV VTPLNGINTV WMFYLLTNLN LNQFATGMAQ
PGLSVQNLEK VESTIPKAID EQEKIASFLT LIDGRISTQN KIIKELELLI KSISQIIFHG
HRYKFKKASL GSICTIKKGE QINSSVLSES GLYAVMNGGI TPSGYYSQYN CVGNTISISE
GGNSCGYVQF NDKKFWSGGH CYTLSEINAE ISNKYLYYFM KFSENLIMSL RVGSGLPNIQ
KKDLEKFNVA FPEINQQYQI SKFLDLLTEK IQVEKSLKTS LIRQKQYVLK KMFI