Gene Cthe_2876 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2876 
Symbol 
ID4809156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3399024 
End bp3401249 
Gene Length2226 bp 
Protein Length741 aa 
Translation table11 
GC content42% 
IMG OID640108295 
ProductATP-dependent DNA helicase PcrA 
Protein accessionYP_001039267 
Protein GI125975357 
COG category[L] Replication, recombination and repair 
COG ID[COG0210] Superfamily I DNA and RNA helicases 
TIGRFAM ID[TIGR01073] ATP-dependent DNA helicase PcrA 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTAT TGAAAGATTT GAATAAGGAA CAAAGGGAAG CGGTACTTCA CGTTGACGGC 
CCCTTGCTTG TTCTTGCCGG TGCGGGAAGC GGAAAGACAA AAGTGCTTAC GCACAGGATT
GCGTATTTAA TTAAAGAGAA AAATGTACAT CCTGCCAGTA TCCTTGCCAT AACTTTTACC
AACAAGGCTG CAAGGGAAAT GAGAGAAAGA ATAGACCGGC TTGTTGAAGA TGTCAGTGAC
AGTATATGGG TAAGTACTTT TCATTCCATG TGTGTAAGGA TACTAAGAAG GGATATAGAA
AAAATAGATT ATGATAAAAA CTTTGTGATA TTTGACTATG CCGATCAGCA AAATGTTGTA
AAAGACTGCC TGAAGGAACT TAATCTCAGT GATAAAAACT TTCCTCCGAA GTCAATATTG
GAGATGATCG GAAGGGCAAA GGACGAGCTT ATCACTCCGG ACTCTTATTT AAAAATGTAT
TCCGGAGATT TCAGGATGGA GAAGATCGCC CGGGTCTATG AGCTTTATCA AAAGAAGCTG
AAGCAAAATA ACGCTTTGGA TTTTGACGAT ATTATAATGC TTACGATAAA GCTTTTTTTG
GACAATCCCG AAGTGCTGAA TTATTACCAG AGAAAATTCA AATATATTCT GGTGGATGAG
TATCAGGACA CCAACACAGC CCAGTATTCC CTTGTAAGCC TTTTGGCACA GGGGTACAGA
AATCTTTGTG TTGTTGGGGA CGACGACCAG TCCATCTATG GTTGGAGAGG CGCCAATATA
AGAAATATTC TGGATTTTGA GAAGGAGTTT AAGGATGCAA AAGTAATAAA GCTTGAGCAG
AACTACCGTT CAACCCAGAT TATCCTTGAT GCGGCAAACC ATGTTATCAA GAACAATGTG
GGAAGAAAAG CCAAAAGGCT TTGGACAAAC AATAAAGGAG GAGACGGGAT TAGATATCTC
GAGTGCCTGA ATGAGCATGA AGAGGCGTAT TTTGTGGCAA GTGAGATAAA GAGGCTCTGC
AGGGAGCAGA ACCGCTCTTA CAAGGATTTT GCCGTACTTT ATCGTATAAA TGCCATGTCC
CGTGTTATTG AGGATGAGCT TATGAGGGAA GGAATAAGCT ACAAGATATT TGGAGGGCTT
AGGTTCTATG ACAGAAAAGA AATAAAAGAT GTAATTGCCT ATCTTCGTGT TATTCAAAAT
CCTTCGGATA ACATAAGTCT AAAGAGGATA ATAAATGAGC CTAAAAGAGG TATTGGAAAT
GCGACCATAG ATACTGCAGA GAGACTGGCA AATGAAAGGG GAGTCAGCAT TTTCTCAATT
ATTTCATCGG CGGCGGAAAT TCCCGAGCTT GCCCGGGCTT CGTCAAAGCT TGAAAAGTTT
GTCTCGCTGA TAAACAGCCT GAGGGCCCAA AGTATGGTGA TGACCGCATC GGAAATGATT
GAGGAGGTCC TTGAGAGGAC GGGCATTTTG GAGGCGTATA GACAGGAAAA TACCCTGGAA
GCCCAGAGCA GAATAGAAAA CATAAAAGAG TTGCTCTCCG TTGCAATTGA ATTTGAAAAT
GAAAGCGAGG AAAAGAGCCT TACGGATTTT CTTGCCCATG TGTCCCTGGT TTCGGATGTC
GATACGATGG ATGAAAACAG TGAATATGTT GCTCTTATGA CCCTTCACAG CGCAAAGGGA
TTGGAGTTTC CGGTGGTATT TCTGGTGGGA ATGGAGGAAG GTATTTTCCC GGGTTACAGG
TCAATGACCA ATGAATCGGA GCTGGAGGAG GAGAGGAGGC TTTGTTATGT AGGCATAACG
AGGGCGAAAG AAAACCTTTA CATGACAAGC ACCTTCAGCC GTACGTTGTT TGGAAATACC
ACCTACAACA GAGTTTCAAG ATTTGTTAAA GAAATTCCCG AGGAGCTTTT TGATTTCGGC
GGGGATAAAA AGAAAGCAAA GGATGAGAAC ATTGGCGAAG GAACGGCAAA AAAGAGCGGT
ACATTAAACA AGGACTTTAA CTCTTCTAAA AGCTTTAATG CGGTAAGCTT TAAGCCTGTG
CAAAGGACGG AGACGTCAAA AACGGAGCTT AAAGTTGGTG ACGTGGTGCG GCACAAAGTC
TTCGGGGAAG GAATTATAAC AAAAAGAGAG CCTGACGGCG ATGATTTCAA ACTGGAGATA
CATTTTAAGG GAAAAGGCAT GAAGAGGCTT TTGGAAAGCT ATGCGAATCT GACTAAGGTG
AATTAA
 
Protein sequence
MDLLKDLNKE QREAVLHVDG PLLVLAGAGS GKTKVLTHRI AYLIKEKNVH PASILAITFT 
NKAAREMRER IDRLVEDVSD SIWVSTFHSM CVRILRRDIE KIDYDKNFVI FDYADQQNVV
KDCLKELNLS DKNFPPKSIL EMIGRAKDEL ITPDSYLKMY SGDFRMEKIA RVYELYQKKL
KQNNALDFDD IIMLTIKLFL DNPEVLNYYQ RKFKYILVDE YQDTNTAQYS LVSLLAQGYR
NLCVVGDDDQ SIYGWRGANI RNILDFEKEF KDAKVIKLEQ NYRSTQIILD AANHVIKNNV
GRKAKRLWTN NKGGDGIRYL ECLNEHEEAY FVASEIKRLC REQNRSYKDF AVLYRINAMS
RVIEDELMRE GISYKIFGGL RFYDRKEIKD VIAYLRVIQN PSDNISLKRI INEPKRGIGN
ATIDTAERLA NERGVSIFSI ISSAAEIPEL ARASSKLEKF VSLINSLRAQ SMVMTASEMI
EEVLERTGIL EAYRQENTLE AQSRIENIKE LLSVAIEFEN ESEEKSLTDF LAHVSLVSDV
DTMDENSEYV ALMTLHSAKG LEFPVVFLVG MEEGIFPGYR SMTNESELEE ERRLCYVGIT
RAKENLYMTS TFSRTLFGNT TYNRVSRFVK EIPEELFDFG GDKKKAKDEN IGEGTAKKSG
TLNKDFNSSK SFNAVSFKPV QRTETSKTEL KVGDVVRHKV FGEGIITKRE PDGDDFKLEI
HFKGKGMKRL LESYANLTKV N