Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2256 |
Symbol | |
ID | 4809994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2684571 |
End bp | 2685908 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640107662 |
Product | primary replicative DNA helicase |
Protein accession | YP_001038651 |
Protein GI | 125974741 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0305] Replicative DNA helicase |
TIGRFAM ID | [TIGR00665] replicative DNA helicase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000224677 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTTGG GTTCTTTTGG CCGGATACCC CCGCAAAATA TTGAGGCTGA GCAGTCGGTA CTGGGTGCCA TACTGCTGGA CAAAGAAGTT CTGTCAAGTG TAACGGAGAT AATTTCAAGC CAGGACTTTT ACAGAGAGGA CCATAGGGAA ATATTCGAAG CAATAATGGA CCTTTATGAA AAGGGAGAAC CTATTGACCT CATTACAGTT GCGGAACAGC TTAAGGTAAG AGGGAGTCTG GAGGCGGTAG GTGGGCTTGA GTATCTTACG AATCTGGCAA GTTTGGTTCC TACCACTGCA AATGCAAAAC ATTATGCAAA AATAGTCGAA GAAAAATCTA TATTAAGAAG GCTTATCAAG GCTTCCAGTG AAATAATCAA TATGGGTTAT GAAGCCGCGG AAGAGGTTTC CTATGTTTTG GACAAGGCTG AAAAAAGCAT ATTTGACGTA CTTCAGAAGA GAAACACCCA GGGGTTTGCC CTGATTAAAG ATGTATTGAT TGATACTTTC AACCGGCTTG AGGAGCTTTA CAACAATAAA GGATACATCA CAGGAATTCC CACCGGATTT GTGGATTTGG ACTACAAGAC TGCAGGGCTT CAAAATTCGG ACTTGATATT GATTGCGGCA AGACCGGCCA TGGGGAAGAC TTCTTTTGTA CTGAACATAG CTCAATATGC TGCCATTCAC GCAAAGGTGC CTGTCGCCAT ATTCAGTCTG GAAATGTCGA AAGAACAGTT GGTAAACAGA ATGCTATGCT GTGAAGCCAT GGTGGACAGT CAGAAGATGA GAACCGGAAA GCTGGAGGAC AGTGACTGGC AGAAAGTAGC AAGAGCATTG GGGCCTTTGT CCGAAGCGCC GATTTATATT GATGACACCC CTGGACTTTC TGTCGCGGAA ATAAGGGCAA AATGCAGAAG ACTCAAGCTG GAAAAAAATT TGGGTCTGGT TGTCATAGAT TACCTTCAGC TTATGCAGGG AAGGGGAAAA AGTGAGAGCA GGCAGCAGGA AATATCGGAG ATATCTAGGT CTCTTAAGAT ACTGGCAAAG GAGATAAACG TACCTGTGCT GACTTTGTCC CAGCTTAGCC GTGCCCCGGA GTTAAGGTCG GATCACAGGC CTATTTTAAG CGACCTGAGG GAATCCGGCG CAATAGAGCA GGATGCGGAT ATAGTAATGT TCTTATACAG GGATGACTAT TACAATCCCG ATACTGAAAA GAAGAATATT GCCGAAGTGA TAATTGCAAA GCACAGAAAC GGTTCGACGG GTACTGTGGA ACTGGCATGG CTGGGTCAGT ATACGAAGTT TGCGAACCTT GAGAAATATA GACAATAG
|
Protein sequence | MDLGSFGRIP PQNIEAEQSV LGAILLDKEV LSSVTEIISS QDFYREDHRE IFEAIMDLYE KGEPIDLITV AEQLKVRGSL EAVGGLEYLT NLASLVPTTA NAKHYAKIVE EKSILRRLIK ASSEIINMGY EAAEEVSYVL DKAEKSIFDV LQKRNTQGFA LIKDVLIDTF NRLEELYNNK GYITGIPTGF VDLDYKTAGL QNSDLILIAA RPAMGKTSFV LNIAQYAAIH AKVPVAIFSL EMSKEQLVNR MLCCEAMVDS QKMRTGKLED SDWQKVARAL GPLSEAPIYI DDTPGLSVAE IRAKCRRLKL EKNLGLVVID YLQLMQGRGK SESRQQEISE ISRSLKILAK EINVPVLTLS QLSRAPELRS DHRPILSDLR ESGAIEQDAD IVMFLYRDDY YNPDTEKKNI AEVIIAKHRN GSTGTVELAW LGQYTKFANL EKYRQ
|
| |