Gene CHU_2939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_2939 
SymbolhsdS 
ID4185580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp3360658 
End bp3361956 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content33% 
IMG OID638072926 
Producttype I restriction-modification system 
Protein accessionYP_679523 
Protein GI110639314 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0304782 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACTA AACTACAAAA ATATCCTGCA TACAAAGATT CGGGTGTGGA ATGGTTGGGT 
GAGATACCGA AGCATTGGGA ATGTATTCGG ATGAAACATT TGTTCAGAGA TTATTCTGAA
AAAAATAAAC AGAATGAAGA ATTGCTGTCC GTTACACAAA ATCAAGGTGT TGTTCCTAGG
AGTTGGGTAG AAAGTAGAAT GGTTATGCCG TCTGGCGCAT TAGAGTCTTT TAAGTTTATA
CAAAAAGGCG ATTTTGCTAT AAGCTTAAGG TCGTTTGAGG GTGGTTTAGA GTATTGCCAT
CATGATGGTA TAATTAGCCC AGCTTATACT GTTTTAAAAA CTAAGAGAAA AATTGCCAAT
CAGTATTATA AATATTTGTT TAAGTCATCA GCTTTTATTT CAGAATTACA AACAAGCATT
GTTGGAATAC GAGAAGGAAA AAATATTAGC TATCCTGAGC TATCTTATTC TCTTCTACCA
ATTCCTAAAA TTGATGAGCA ATCCTGTATC GCCACTTTTC TTGACGACAA AACTGCTAAG
ATAGACCAAG CGATATCTAT CAAACAAAAG CAAATAGAAC TACTTAAAGA ACGCAGACAA
ATTCTCATAC ACAAAGCCGT TACCCGTGGG TTGAATCCAA AGGTTAAAAT GAAGGACAGC
GGTGTTGAAT GGATTGGCGA GGTGCCGGAG GGTTGGGAGG TGAAGAAATT ACTTGGATTA
TGTAATTTCA TTAGAGGGAA TTCAAGTTTT GGAAAAGATG ATTTACTAAA TGATGGAGAG
TATGTTGCAT TACAATATGG GAAAACATAT AAAGTAAATG AAGTGAATGA AGAATATAAT
TATTTTGTCA ATAATGAATT TTACAAGGCA AGCCAAATAG TAAATTACGG TGATACTATA
ATTATTGCTA CATCAGAGAC AATTGAAGAA TTGGGGCATA CTGCATATTA TAAAAGAAAT
GATTTAGGTT TAATCGGAGG TGAACAAATA TTGTTGAATC CAAATAATGA TAAAATAAAT
AGTCATTATT TATACTTTAC TTCAAGAGTG TTTTCAAAAG AACTAAGGAA ATATGCAACT
GGAATAAAAG TTTTTAGATT TAATATAAAT GACTTGAAAA CTATATACAT TGCTATTCCG
CCTCTTTCAG AACAACAGCA AATTGTGGAA TATATAGAAA CCACCACAGC CAAAATTGCC
ACTGCCATTT CCCTCAAAGA AAATGAAATA GAAAAGTTAA AAGAATACAA GGCGAATTTG
GTTAATAGTG CGGTAACGGG TAAAATAAAG GTGAGTTAG
 
Protein sequence
MKTKLQKYPA YKDSGVEWLG EIPKHWECIR MKHLFRDYSE KNKQNEELLS VTQNQGVVPR 
SWVESRMVMP SGALESFKFI QKGDFAISLR SFEGGLEYCH HDGIISPAYT VLKTKRKIAN
QYYKYLFKSS AFISELQTSI VGIREGKNIS YPELSYSLLP IPKIDEQSCI ATFLDDKTAK
IDQAISIKQK QIELLKERRQ ILIHKAVTRG LNPKVKMKDS GVEWIGEVPE GWEVKKLLGL
CNFIRGNSSF GKDDLLNDGE YVALQYGKTY KVNEVNEEYN YFVNNEFYKA SQIVNYGDTI
IIATSETIEE LGHTAYYKRN DLGLIGGEQI LLNPNNDKIN SHYLYFTSRV FSKELRKYAT
GIKVFRFNIN DLKTIYIAIP PLSEQQQIVE YIETTTAKIA TAISLKENEI EKLKEYKANL
VNSAVTGKIK VS