Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2876 |
Symbol | |
ID | 4809156 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3399024 |
End bp | 3401249 |
Gene Length | 2226 bp |
Protein Length | 741 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640108295 |
Product | ATP-dependent DNA helicase PcrA |
Protein accession | YP_001039267 |
Protein GI | 125975357 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0210] Superfamily I DNA and RNA helicases |
TIGRFAM ID | [TIGR01073] ATP-dependent DNA helicase PcrA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTTAT TGAAAGATTT GAATAAGGAA CAAAGGGAAG CGGTACTTCA CGTTGACGGC CCCTTGCTTG TTCTTGCCGG TGCGGGAAGC GGAAAGACAA AAGTGCTTAC GCACAGGATT GCGTATTTAA TTAAAGAGAA AAATGTACAT CCTGCCAGTA TCCTTGCCAT AACTTTTACC AACAAGGCTG CAAGGGAAAT GAGAGAAAGA ATAGACCGGC TTGTTGAAGA TGTCAGTGAC AGTATATGGG TAAGTACTTT TCATTCCATG TGTGTAAGGA TACTAAGAAG GGATATAGAA AAAATAGATT ATGATAAAAA CTTTGTGATA TTTGACTATG CCGATCAGCA AAATGTTGTA AAAGACTGCC TGAAGGAACT TAATCTCAGT GATAAAAACT TTCCTCCGAA GTCAATATTG GAGATGATCG GAAGGGCAAA GGACGAGCTT ATCACTCCGG ACTCTTATTT AAAAATGTAT TCCGGAGATT TCAGGATGGA GAAGATCGCC CGGGTCTATG AGCTTTATCA AAAGAAGCTG AAGCAAAATA ACGCTTTGGA TTTTGACGAT ATTATAATGC TTACGATAAA GCTTTTTTTG GACAATCCCG AAGTGCTGAA TTATTACCAG AGAAAATTCA AATATATTCT GGTGGATGAG TATCAGGACA CCAACACAGC CCAGTATTCC CTTGTAAGCC TTTTGGCACA GGGGTACAGA AATCTTTGTG TTGTTGGGGA CGACGACCAG TCCATCTATG GTTGGAGAGG CGCCAATATA AGAAATATTC TGGATTTTGA GAAGGAGTTT AAGGATGCAA AAGTAATAAA GCTTGAGCAG AACTACCGTT CAACCCAGAT TATCCTTGAT GCGGCAAACC ATGTTATCAA GAACAATGTG GGAAGAAAAG CCAAAAGGCT TTGGACAAAC AATAAAGGAG GAGACGGGAT TAGATATCTC GAGTGCCTGA ATGAGCATGA AGAGGCGTAT TTTGTGGCAA GTGAGATAAA GAGGCTCTGC AGGGAGCAGA ACCGCTCTTA CAAGGATTTT GCCGTACTTT ATCGTATAAA TGCCATGTCC CGTGTTATTG AGGATGAGCT TATGAGGGAA GGAATAAGCT ACAAGATATT TGGAGGGCTT AGGTTCTATG ACAGAAAAGA AATAAAAGAT GTAATTGCCT ATCTTCGTGT TATTCAAAAT CCTTCGGATA ACATAAGTCT AAAGAGGATA ATAAATGAGC CTAAAAGAGG TATTGGAAAT GCGACCATAG ATACTGCAGA GAGACTGGCA AATGAAAGGG GAGTCAGCAT TTTCTCAATT ATTTCATCGG CGGCGGAAAT TCCCGAGCTT GCCCGGGCTT CGTCAAAGCT TGAAAAGTTT GTCTCGCTGA TAAACAGCCT GAGGGCCCAA AGTATGGTGA TGACCGCATC GGAAATGATT GAGGAGGTCC TTGAGAGGAC GGGCATTTTG GAGGCGTATA GACAGGAAAA TACCCTGGAA GCCCAGAGCA GAATAGAAAA CATAAAAGAG TTGCTCTCCG TTGCAATTGA ATTTGAAAAT GAAAGCGAGG AAAAGAGCCT TACGGATTTT CTTGCCCATG TGTCCCTGGT TTCGGATGTC GATACGATGG ATGAAAACAG TGAATATGTT GCTCTTATGA CCCTTCACAG CGCAAAGGGA TTGGAGTTTC CGGTGGTATT TCTGGTGGGA ATGGAGGAAG GTATTTTCCC GGGTTACAGG TCAATGACCA ATGAATCGGA GCTGGAGGAG GAGAGGAGGC TTTGTTATGT AGGCATAACG AGGGCGAAAG AAAACCTTTA CATGACAAGC ACCTTCAGCC GTACGTTGTT TGGAAATACC ACCTACAACA GAGTTTCAAG ATTTGTTAAA GAAATTCCCG AGGAGCTTTT TGATTTCGGC GGGGATAAAA AGAAAGCAAA GGATGAGAAC ATTGGCGAAG GAACGGCAAA AAAGAGCGGT ACATTAAACA AGGACTTTAA CTCTTCTAAA AGCTTTAATG CGGTAAGCTT TAAGCCTGTG CAAAGGACGG AGACGTCAAA AACGGAGCTT AAAGTTGGTG ACGTGGTGCG GCACAAAGTC TTCGGGGAAG GAATTATAAC AAAAAGAGAG CCTGACGGCG ATGATTTCAA ACTGGAGATA CATTTTAAGG GAAAAGGCAT GAAGAGGCTT TTGGAAAGCT ATGCGAATCT GACTAAGGTG AATTAA
|
Protein sequence | MDLLKDLNKE QREAVLHVDG PLLVLAGAGS GKTKVLTHRI AYLIKEKNVH PASILAITFT NKAAREMRER IDRLVEDVSD SIWVSTFHSM CVRILRRDIE KIDYDKNFVI FDYADQQNVV KDCLKELNLS DKNFPPKSIL EMIGRAKDEL ITPDSYLKMY SGDFRMEKIA RVYELYQKKL KQNNALDFDD IIMLTIKLFL DNPEVLNYYQ RKFKYILVDE YQDTNTAQYS LVSLLAQGYR NLCVVGDDDQ SIYGWRGANI RNILDFEKEF KDAKVIKLEQ NYRSTQIILD AANHVIKNNV GRKAKRLWTN NKGGDGIRYL ECLNEHEEAY FVASEIKRLC REQNRSYKDF AVLYRINAMS RVIEDELMRE GISYKIFGGL RFYDRKEIKD VIAYLRVIQN PSDNISLKRI INEPKRGIGN ATIDTAERLA NERGVSIFSI ISSAAEIPEL ARASSKLEKF VSLINSLRAQ SMVMTASEMI EEVLERTGIL EAYRQENTLE AQSRIENIKE LLSVAIEFEN ESEEKSLTDF LAHVSLVSDV DTMDENSEYV ALMTLHSAKG LEFPVVFLVG MEEGIFPGYR SMTNESELEE ERRLCYVGIT RAKENLYMTS TFSRTLFGNT TYNRVSRFVK EIPEELFDFG GDKKKAKDEN IGEGTAKKSG TLNKDFNSSK SFNAVSFKPV QRTETSKTEL KVGDVVRHKV FGEGIITKRE PDGDDFKLEI HFKGKGMKRL LESYANLTKV N
|
| |