Gene Ccel_2726 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2726 
Symbol 
ID7311360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3284580 
End bp3287705 
Gene Length3126 bp 
Protein Length1041 aa 
Translation table11 
GC content39% 
IMG OID643609638 
Producttype I site-specific deoxyribonuclease, HsdR family 
Protein accessionYP_002507017 
Protein GI220930108 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACCTG AGAATCAAAT TGAAAAGATG TTTATTGACA TTTTAACTAT GAGGGGAAAT 
CAATGGACAT ATCGTGGTGA TATAAAAACA GAAATTGCCC TTTGGAGCAA TTTGCGCGGG
CATATCAACA GGATTAATCT TGCGGCATTG GAAGGTATAT TGCTTACTGA TAGTGAATTT
GAGCAGATAA AAAATGAGTT TACGCGATTA ACCCAAACAC CATTTTTAGC TGCTGTATGG
TTAAGAGGGG AGAATGGGAT AGCTCAAATA CCCATTGAGC GAGAAGATAT TTCTAAGGGT
AAGATAACAC TTACACTTTT CAGCAATAAG GATATTGCTG GGGGGATCTC CAGTTATGAG
GTTGTAAGCC AAATTGTGCC GACAACAGAA GGTAGAACTA CGCGTGGAGA TGTTACTCTG
CTTATAAACG GATTGCCCAT AATACATATA GAATTGAAAA ATGAGGTTGC CAAAGACGGC
TATCGGCAGG CGTTTAAGCA GATTGAACGT TATGCGCAAT CAGGTTTTTT TAATGGTATA
TATGCTACTG TGCAGATCTT CATAATCAGC AATAAAGTTT CAACAAGATA CTTTGCTCGC
CCGAGAACAA ACAATGATTT TGAGTCTGCT AAGAAATTCC TATTCAACTG GCGCGAGCAG
GATAATACTC CAGTAGAGGA TTTATACGAC TTCACGCGTA AGGTTCTCAG TATACCAATG
GCGCATGAGC TTATAAGTCG ATTTACTATT TTGGTGGATG ATAAGAAGAA TCAAAAGTTT
GTGATGGTGC TGCGTCCGTA TCAGATACAT GCGATAAAGA CAATAATGCA ACAGGCCTAT
AGTCATAAGG GAGGTTTTGT CTGGCATGCC ACCGGTTCAG GCAAAACGAT AACCAGCTTT
GTCGCTACCA AGCTACTTGC TCAGTCGGCA GTTTCAGTTT CTCGAACAGT GATGATTGTT
GACCGCAGAG ACTTAGACAG CCAAACCAAA GATGAGTTTT CTAAATTTGC TTCTGAATAC
AACACAGGGC TTTCCACGGG AGATGCCACA GACAATACGC TTATTGTAGG TATCAATAAC
CGACGTGAGC TGGTGCATAA TTTCATAAGC AAAAAGAACA GCAACACCAT TATTGTTACA
ACAATCCAGA AGTTGAGTCA TGCAATACGT GACTGTGCAG AAGCAGAGGT AAATAAATTT
GAGAAACTTA AGGGTGAGCA CATTGTGTTT ATTGTAGATG AGTGTCATCG TGCTGTTTCT
GATAAGGAAA TGAAAGCAAT CAAGAAGTTT TTTCCAAAAT CTACCTGGAT AGGCCTTACT
GGTACTCCGA TTTTCGAGGA GAACAAAAAA CAGGAAAATG GCACTTATGC CCGCACGACA
TTCGATCAAT ATGGTGAGCT TCTTCATGCT TATACTACCA AGAACGCCAT GGATGATAAA
TCTGTGCTAG ATTTTCAGGT AGAGTATCAT TCACTTATGA GTGAGGATGA GGAAACTCGA
CTGTACCTAA GTAAGCTCAG TAAAAAATAT CCTGATGATG ACCCACAGAT AAAACTGCTT
TCTATGACTG ATGCTATAGA TAAGGAGGCA CTTTTAGAAA CCTCAGACTA TGAGAATGAT
GCCTATATCG AAGCCATGTT GAAAAAAATA TTCCGTCATC AGTCTATACT GGAAAAATTC
AGGGTTATAG ATGGTATTCC CACAATGAGC GCTATTCTTA CTACTCATTC CATAGCTCAG
GCAAAACTGA TATACCATAA ACTGCTCGTA CTCAAAAAAA CAGGAGAACT TATCATAGGT
AATCCATTAG ATGAGCGCAG ACGACTAAAT GATCCTGAAT TCCCGCGTGT GGCAATTACT
TATTCGGTTG CTGAAAACCA AGATGGAATA ATTGATGCGA ACAATGAAAT ATCTGAAATT
ATGGAGCAGT ACAATGCTAT GTTCGGTACA AAATATACGG ATATAAATCT GTATAATCAA
AATATTAATA AACGCCTTGC CAGAAAAGAG GCGCAATATC AGAAAGATGG GCAGTGGCTG
GATTTTGTAA TCGTTGTGGA TAGGCTCTTG ACAGGCTTTG ACGCACCTAC GATTCAAACA
CTTTATGTGG ACAGAGAATT GCGCTATCAA AAATTACTGC AAGCATTTTC TCGGACTAAC
CGCACATATC CTGGCAAAAA CATCGGCATG ATCGTTACAT TTCGCAAACC GAAAACAATG
GTAGCGAATG TTAGAGATGC AATAAAACTG TTTTCAAACG AGGAGAGGGA TTGGGAAAAT
CTTGTGCCAA AGCAGTATGC AGAGGTTAAG CAAAGCTTCA AGGTCGCTTA CAAGAATTTT
GAGGAAGCTT ATAAAGAATT GGCTAAAGAC CCGGAAAACA TCAAAAAGCA ACTCAGTACC
ATTAAAACTT TTCAAACCTT GAGTAAGCTT GAAGAAGCCA TAAAAAGCTA TGAGGAATAT
GAGGATGAGT TTGATAAGTT TCGCAATATA ACTCAAACCA TTTCGGACGA ATTAGGCAAT
ATTGAGAATC TGAAGGCCGA AGTCAAAGAA ACGTTTGCTG AACACAATAT GGGTGACGAG
GAAATTGCGG AATTGTTTAA AATAGAGTTT TCGTCAGATC AACGCGCGAC TCTTGAAGAG
AAAATTGACA GTTATTACAT AGCTCAGCTT CTTAGGGATA TACAAAAAGA GGACAATAAA
CAGAGATTTG ATGAAATAAT CAAAAAAAAG CCCCCTATTG TTAAAGTGGC CTATGATGAG
ATATTAGGTA CGCTTTCAGA TGAACAGGAG GTCATAGACA AAGTTGCATC TCATTTTAAG
AAGCTAATTG GTGAGATAAT TGCTGAAACA GCAGCTGTGC TCAAGGTTTC GGATGACGAT
TTGTTGGTTA GCTTTCATGA ATACCGTAAC GATAAGGTGG AGGTTCCATA CATCAACGTG
ATTATTGACA AATCTTCCAT AACACAAGAG GAATTTGAAA GGAAGTTTCA TAAAAAGTTT
AGAGAGCGAC GCCGAACGAT AGAGGCCTAC TGGAAAGCAA CTATTGAGGA TAAATTGTTA
CCCCTAAGAG AAGAACTTTC AAATTTGGCT ATTGAATTAA AGAAAGCAGA GGTAACACAA
TCATGA
 
Protein sequence
MTPENQIEKM FIDILTMRGN QWTYRGDIKT EIALWSNLRG HINRINLAAL EGILLTDSEF 
EQIKNEFTRL TQTPFLAAVW LRGENGIAQI PIEREDISKG KITLTLFSNK DIAGGISSYE
VVSQIVPTTE GRTTRGDVTL LINGLPIIHI ELKNEVAKDG YRQAFKQIER YAQSGFFNGI
YATVQIFIIS NKVSTRYFAR PRTNNDFESA KKFLFNWREQ DNTPVEDLYD FTRKVLSIPM
AHELISRFTI LVDDKKNQKF VMVLRPYQIH AIKTIMQQAY SHKGGFVWHA TGSGKTITSF
VATKLLAQSA VSVSRTVMIV DRRDLDSQTK DEFSKFASEY NTGLSTGDAT DNTLIVGINN
RRELVHNFIS KKNSNTIIVT TIQKLSHAIR DCAEAEVNKF EKLKGEHIVF IVDECHRAVS
DKEMKAIKKF FPKSTWIGLT GTPIFEENKK QENGTYARTT FDQYGELLHA YTTKNAMDDK
SVLDFQVEYH SLMSEDEETR LYLSKLSKKY PDDDPQIKLL SMTDAIDKEA LLETSDYEND
AYIEAMLKKI FRHQSILEKF RVIDGIPTMS AILTTHSIAQ AKLIYHKLLV LKKTGELIIG
NPLDERRRLN DPEFPRVAIT YSVAENQDGI IDANNEISEI MEQYNAMFGT KYTDINLYNQ
NINKRLARKE AQYQKDGQWL DFVIVVDRLL TGFDAPTIQT LYVDRELRYQ KLLQAFSRTN
RTYPGKNIGM IVTFRKPKTM VANVRDAIKL FSNEERDWEN LVPKQYAEVK QSFKVAYKNF
EEAYKELAKD PENIKKQLST IKTFQTLSKL EEAIKSYEEY EDEFDKFRNI TQTISDELGN
IENLKAEVKE TFAEHNMGDE EIAELFKIEF SSDQRATLEE KIDSYYIAQL LRDIQKEDNK
QRFDEIIKKK PPIVKVAYDE ILGTLSDEQE VIDKVASHFK KLIGEIIAET AAVLKVSDDD
LLVSFHEYRN DKVEVPYINV IIDKSSITQE EFERKFHKKF RERRRTIEAY WKATIEDKLL
PLREELSNLA IELKKAEVTQ S