Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0066 |
Symbol | |
ID | 7312058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 74147 |
End bp | 77449 |
Gene Length | 3303 bp |
Protein Length | 1100 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643606995 |
Product | type III restriction protein res subunit |
Protein accession | YP_002504434 |
Protein GI | 220927525 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1061] DNA or RNA helicases of superfamily II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAAAC TATTCCCTGA CTGGCAAGGT AAGATTACAC TGCCTCTGAG TCCTCTAAGA AGAGAGGTTG TAGGTGGAAA TGCAGTATGC CAGCAAGCCT TTCGTGGAGG AGACACAATT GCAGAGGCAG AAGCTTTAGA TAATGAAGGT TACTATCTTG TCAAAACACG TTTATTATTC CCATTTAATG AACTGCTACT CCGCCGCAAA GGGACACGTC ATATTGAAAC CAACTATTTA GTGCTGTTAG CTCCGGATCT AAAAAGAATC TATGAACTTG GCCCAGATAC ACGACTGGAA TGGGATGATC TTGGCTTACT GACTAGGTCT TTTAGGAACC CTGAACAAAT ACTTGAGAGT TGGAAGGGTA AATTCTCTTT TCGGGAGGAG GACAATGTGA CTGGGGCGAA AGGTCTCCGC CCACCACAGA TTGGAGCTTT ACATGCGATA GCTGCACATT TTGCAGTAGG GAAAAACTTT GAACCTGCGA CAGTCGTACT GCCTACTGGA ACAGGCAAAA CTGAGACTAT GCTTTCAATG CAAGTTTACA AAAGGCTTTT GAGAACTCTT GTAATCGTTC CAAGTGATGC CCTACGAACA CAGATTTATA AAAAATTCTT GACATTAGGG GTACTCCCCG ATCTTGAAGT TGTTCCGTCT GAAACTATAG GACCTAATGT CACAAGACTT GTAACTGGTA TTCGCTCTAA GGAAGAGGCA TTAGCGATAA TTGAAAAATC TAATATCATT ATTACTTTAC CCAAAACTCT AAGTGCATCA GATCCAGAAG CACTTAGAAT TCTTACTGAA TACTGCACAG ATCTTATTGT AGATGAAGCA CATCATATCC CCGCATCCCA ATGGTCACTC GTGTGCCAAA TGTTTTCCGA TAAACGGATT ACACAGTTTA CCGCTACTCC TTTTCGTCGT GATAACAAAC GTATAGATGG AAAAATCATC TTTAACTATA AATTAGGCGA TGCTCAGGCT GCAGATTATT ATCGACCTAT CAATTTGAAA ACCATAGAAG AATTTGGTGA TAAAGAACAA CGTGACTTAT GTATAGCAGA GGAAGCTCTG ACTGCACTTC GCCGGGATCG CAATGAGCTC CACCTTGATC ACATACTTAT GGCACGATGC GAAAGTAAGG AACGTGCAGT GAATGTGTTT GAACTATATC AGAAACTTGC ACCAGAAATG TTACCGCAAC TTGTATATTC TGGACCTGGT CGAATACATA AAAACCGCGA GGCACTAGAG AAACTGATTC AGAGGAAGAA AAACGCCGCA CAAATAGTAG TTTGTGTTGA TATGTTAGGA GAAGGCTTTG ACCTACCAAA CCTGAAAGTA GCTGCATTAC ACGATACTCA TAAGTCGCTA GCAATTACTT TACAATTCAT TGGTCGGTTT ACACGTAAAG GAAATTGGGA ACAAATTGGT GAAGCCACTG TCGTGGTAAA TATTGCTGAT TCTGAAACAG AAACAAAATT AGAAGCGTTA TACGCTGAGG GTGCTGATTG GGATAGTTTG ATAAAAAGAT TGAGTGAGGA CCGAATTGAT CAAGAAATCC GTCTACAAAA AATAGTACAT AATTTACGAG AAAGTGGAGA TTTACATACT TTTCTTTCAC TGTGGAACCT TCGTCCTTCA CTATCTATGC AGGTATTCAA AACTCAATGC CAAAATTGGA GCCCTGAGAA CTATTTCAAT GTATTTTCAA AAGATTCAAA AAGCTGGCAT GCCATAAGCA AAGACAATGA CATTCTGATT GCTGTTGTCC ACCGTAAAGC ACGCGTACGA TGGGGTAACT ACCAGAACCT GTTTGATCGA CAATATCATT TATTAATGGC AAGATGGGAC AAAGAGAATG GTGCGCTTTT TATTCACGCA AGTGACTATG AAGAGCTGAA GAGCACTACT TTGGCTAAAG AAATAACAAA CGATAAGACG GTACTTCTAC AGGGGCCGGT TATTTTTAAT ATTTTGAACA ATGTTGAGCT TCCTCTCGTA AAGAATCTTG GTTCATCACG AATAGGTGCA ATTAGCTTTA CATCGTATTT TGGACCAAAT GTAACAGAGG GTTTGGCACT AATTGAACGA GCTGAATCTA CTTTAAATAA TATTGCCTGT GTTGGCTACG AGAATGGGGA GCGTGTTCTT TGGGGAGGAA CTCAAAGGCG TGGGAAAATA TGGCAACAAT CATCAGGGAC AATATCACAG TGGATTGACT GGACAGCACA TACTTGGGAG AAAGTTTCCA CTCAAGATAA TATTGACGTA GCAAATATTA CGAATGAATT TTTACGTCCT CATAAACTTA GTCAACCATA CAATCAATAT CCAATAAGTG TACAATGGGG AGAACAGGTA CAGGCTTCAT TCAGCAATAA TCAATCAATC GTCTTTGATA CAACGGAAGT TCCTCTTTAC TTAGTTGACC TTCAAATATC CGAAGTCAGA GATAACGGTG AAATTATTAT TCGTTTATCA AGTGATACAA ATTCTTCAGA ATATAGTTTC CAGATTAATG ATGACTCTGA GTCAGGATAC TACTATAAAA AAATATCGGG ACCAGATGTG TATTTTGCTA AAGGTACTAA CATAAAAACT GAAGTGTGCG AGTACTTCGT GGTTGATCCA GTGATAGTAA GGTATGTCGA TGGTACATAT TCATATAACT GCTATCATAT ACCGATTCCT CTAAAAGCCG GCGAGTATCC CAGAGAACGA ATAGAAGTCT GGGATTGGGC ATCTATTCCG TTGAACAAGG AATCTATCGG CAAGACCGGT AATAAAAATA CAATCCAGTA TCAAAGTTTT TTGACCATTT CCGATAAGTA TGATGTTGTT TTTAATGATG ATGGTAAAGG CGAGGCTGGT GACCTTGTTT GCTTGAAGGA TATTGACGAG AGTACAATCA AGCTTTGCCT TGTTCATTGT AAAGGGGCTC TTGGTGGTCA GATTTCGAAT GACATCGGAA ATTTCTACAC GCTTTGTGGG CAGGCTCAGA AAAGCATAAC CGTCAAACAT ATGGGCATGA CACGTCTTTA TAATGACCTC AAACGCCGTC ATGAAATATG GGCTAGAGCA GGTTATTCGA GATTCCTTAA AGGTGATATG AAGTATCTTT CATATTTCAA GGAGAAATCC CGGCGTTCTA AATTGCAGTT TGAGGTGATT ATAGTACAGC CAGGAGGATC AAAAGCAGCC CTTAGCACAG ACATTCTCAA ACTCTTGGGA ACGACAGAGC TTTTTCTGAA GACTACGACT CAGGGTAACT TACGTGTAGT GGTATCTCCG TAA
|
Protein sequence | MMKLFPDWQG KITLPLSPLR REVVGGNAVC QQAFRGGDTI AEAEALDNEG YYLVKTRLLF PFNELLLRRK GTRHIETNYL VLLAPDLKRI YELGPDTRLE WDDLGLLTRS FRNPEQILES WKGKFSFREE DNVTGAKGLR PPQIGALHAI AAHFAVGKNF EPATVVLPTG TGKTETMLSM QVYKRLLRTL VIVPSDALRT QIYKKFLTLG VLPDLEVVPS ETIGPNVTRL VTGIRSKEEA LAIIEKSNII ITLPKTLSAS DPEALRILTE YCTDLIVDEA HHIPASQWSL VCQMFSDKRI TQFTATPFRR DNKRIDGKII FNYKLGDAQA ADYYRPINLK TIEEFGDKEQ RDLCIAEEAL TALRRDRNEL HLDHILMARC ESKERAVNVF ELYQKLAPEM LPQLVYSGPG RIHKNREALE KLIQRKKNAA QIVVCVDMLG EGFDLPNLKV AALHDTHKSL AITLQFIGRF TRKGNWEQIG EATVVVNIAD SETETKLEAL YAEGADWDSL IKRLSEDRID QEIRLQKIVH NLRESGDLHT FLSLWNLRPS LSMQVFKTQC QNWSPENYFN VFSKDSKSWH AISKDNDILI AVVHRKARVR WGNYQNLFDR QYHLLMARWD KENGALFIHA SDYEELKSTT LAKEITNDKT VLLQGPVIFN ILNNVELPLV KNLGSSRIGA ISFTSYFGPN VTEGLALIER AESTLNNIAC VGYENGERVL WGGTQRRGKI WQQSSGTISQ WIDWTAHTWE KVSTQDNIDV ANITNEFLRP HKLSQPYNQY PISVQWGEQV QASFSNNQSI VFDTTEVPLY LVDLQISEVR DNGEIIIRLS SDTNSSEYSF QINDDSESGY YYKKISGPDV YFAKGTNIKT EVCEYFVVDP VIVRYVDGTY SYNCYHIPIP LKAGEYPRER IEVWDWASIP LNKESIGKTG NKNTIQYQSF LTISDKYDVV FNDDGKGEAG DLVCLKDIDE STIKLCLVHC KGALGGQISN DIGNFYTLCG QAQKSITVKH MGMTRLYNDL KRRHEIWARA GYSRFLKGDM KYLSYFKEKS RRSKLQFEVI IVQPGGSKAA LSTDILKLLG TTELFLKTTT QGNLRVVVSP
|
| |