Gene Cthe_2144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2144 
Symbol 
ID4811191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2547344 
End bp2548987 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content43% 
IMG OID640107548 
ProductDNA polymerase III subunits gamma and tau 
Protein accessionYP_001038541 
Protein GI125974631 
COG category[L] Replication, recombination and repair 
COG ID[COG2812] DNA polymerase III, gamma/tau subunits 
TIGRFAM ID[TIGR00678] DNA polymerase III, delta' subunit
[TIGR02397] DNA polymerase III, subunit gamma and tau 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00119695 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCATATC TGGCTTTGTA TAGAAAATGG AGGCCCTTGG TTTTTGAAGA TGTGGTTGAA 
CAGGAACACG TTGTAAGAAC GCTTAGGAAC AGCATCTGCT CCGGACGTAT AGCTCATGCA
TATCTCTTCT GCGGTACGAG AGGTACAGGA AAAACCACAA TGGCAAAAAT ATTTTCGAGA
GCCGTAAACT GTTTGAATCC AAAAGACGGT GACCCGTGTA ATCAGTGTGA GATTTGCCAG
GGGATACTTA ACGGGAGTCT TTTGGATGTC ATAGAAATAG ATGCTGCTTC AAACAACAGT
GTGGATGATA TAAGGGTTCT CAGGGATGAA GTTATATATA CACCTTCGAA AGCCAGATAC
AAAGTTTATA TTATTGACGA AGTTCACATG CTTTCAACAG GCGCGTTTAA TGCCCTTTTA
AAGACACTTG AAGAGCCTCC GGCCCATGTG ATATTTATTC TTGCCACCAC CGAACCGCAC
AAGCTTCCTG CAACCATACT GTCCCGCTGC CAAAGGTTTG ATTTCAGAAG AATTCCGGTT
GACAGTATTG TAAAGAGAAT TGAGTATATT GCAAAAGAAA GCGGAGTGGA AATACGAAGG
GAAGCTTCCA AATTGATAGC AAAGCTATCC GACGGTGCTT TAAGGGATGC CATAAGCATA
TTGGATCAGT GCATATCCTT GGGAAGCAAG GAGCTGACCT ATGAGGATGT CTTGTCTGTG
GTAGGCCTTG TGACAGATAC TTTTATCGCT GAGGTTGTGG ATGCCATAAA AAACAAAGAG
GTAGGCAGAG TGTTGAATGC CGTTGATGAG CTGGTGATGG AGGGCAAGAA TATCGGCCAG
TTTGTTTCGG AGCTGGTGAT GTATTACAGG AATTTAATGA TTTGCAGTTC GCTATCAAAT
CCCGAAGATA TAATTGATGC CTCTGCGGAC TCAATTCAAA GAATGAAGGA ACAATGCAAA
GGTTTGGAGT TGTTTGAGAT TGTAACGGTG ATAAAGGAGC TGTCTTCACT GGAAGCGGCG
TTAAAATGGT CCACACATCC CAGGGTTCTT TTGGAAACAA CACTTATCAA GCTTTGCGAA
AACAGGTTGG ACCCGGGAGA TGCCGGGGTA TTGGAGAGAA TCCGGCTGCT TGAGAAGAAT
GTTAATGATA TTTTGGAAAA GGGTGTTGCC ATACCCCAAA ATGGTTCTGA CGGTTCGGGC
GGTTCTCAAA AGGATTCCGG AAGTGTGGAT TCGGATGAAA AAAATTCGTC ACCTGAAGTT
AACGAGCGTA TTGAAAAAAA GAATATTGCA AAAAATGTAA AGGGAATTGA AGTGTGGGGC
AAGGTGCTGG ACGAGCTGAA AAGCAGGGGC AGAATGGCTG TATATGCCTA TCTTTTGGAT
ACCAAACTTA TAGAGTTGGG TTCCAACCAA GTCGGGATAG TATTTAAAAA TAACGGCTGT
AAAATGCTTG TTGAAAAGTC CGAGAACCTT GAAGTGATAG AAGAGTGCCT TCGCGAATGC
CTTGGAAAAG AGGTCAGGGT TAAATGTTTT GACGAAGAGG ACATTGTTGA TACAGGAAAA
AATGATGAAG AGGACAAGCT GGTTGAGAAA GCGCAGGATT TTGCTCAAAA GTTTGATGTT
GAGGTCAATA TAATTGACGA ATAA
 
Protein sequence
MSYLALYRKW RPLVFEDVVE QEHVVRTLRN SICSGRIAHA YLFCGTRGTG KTTMAKIFSR 
AVNCLNPKDG DPCNQCEICQ GILNGSLLDV IEIDAASNNS VDDIRVLRDE VIYTPSKARY
KVYIIDEVHM LSTGAFNALL KTLEEPPAHV IFILATTEPH KLPATILSRC QRFDFRRIPV
DSIVKRIEYI AKESGVEIRR EASKLIAKLS DGALRDAISI LDQCISLGSK ELTYEDVLSV
VGLVTDTFIA EVVDAIKNKE VGRVLNAVDE LVMEGKNIGQ FVSELVMYYR NLMICSSLSN
PEDIIDASAD SIQRMKEQCK GLELFEIVTV IKELSSLEAA LKWSTHPRVL LETTLIKLCE
NRLDPGDAGV LERIRLLEKN VNDILEKGVA IPQNGSDGSG GSQKDSGSVD SDEKNSSPEV
NERIEKKNIA KNVKGIEVWG KVLDELKSRG RMAVYAYLLD TKLIELGSNQ VGIVFKNNGC
KMLVEKSENL EVIEECLREC LGKEVRVKCF DEEDIVDTGK NDEEDKLVEK AQDFAQKFDV
EVNIIDE