Gene Cthe_2372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2372 
Symbol 
ID4811022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2835874 
End bp2836974 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content39% 
IMG OID640107783 
ProductDNA polymerase III subunit beta 
Protein accessionYP_001038767 
Protein GI125974857 
COG category[L] Replication, recombination and repair 
COG ID[COG0592] DNA polymerase sliding clamp subunit (PCNA homolog) 
TIGRFAM ID[TIGR00663] DNA polymerase III, beta subunit 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAG TTTGTTCCAA AGAACAGCTA ATGGAAGGAA TCAACGTCGT GCAAAAAGCA 
GTGCCGACAA AAGCCACTCT AACCATACTG GAAGGAATAT TGCTGGAAGC ATACGACAAT
TTTAAAATGA CCGGAAATGA TTTGGAACTG GGAATAGAAT GCCTTATAGA TGCAGACATT
CTGGAAAAAG GATCTATAGT CTTAAATTCA AAAATGTTCG GAGACATAGT AAGAAGACTT
CCCGACTCAG AGGTACTTAT TGAAGTTAAA GAGAACAATA CAGTTATCAT TGAATGTGAC
AACTCTCACT TTGAGTTAAG GGGTATGCCT TCTGACAGCT TTCCGTCACT GCCTTCAATC
GAAAAAGAGA ACATGATCAA AGTCAGCCAA AAGGCAATCA GGGATATGAT AAGACAAACA
CTTTTTGCCG TAAGTACGGA AGGAACCAGA CCGATACTTA CCGGTTCACT TATTGAATGT
GCAGGAAACG AAATTACCTT CGTTTCAATA GACGGATTCA GAATGGCTCT GAGAAAAAAC
TTTAACAACG AAGGATTTTC CGAATTCAGT GTTGTCGTAC CCGCAAAAAC CCTCAGCGAG
ATAGGCAAAA TCTTACAGCC GGTTGATGAA GATATTTACA TATATAGTTC TCAAAACCAG
ATACTGTTTG AAATTGGAAA TTGCAAAGTT GTATCAAGAC TTTTAGAGGG TGAATATCTA
AACTATAAAA GTATTATACC ACCGGAATAT GAAACCAGCG TAAGACTTAG AACCGAGGAC
CTTTTGTCCA GCCTTGAAAG GGCGTCATTG ATTACTTCGG ACGAAAAGAA ATACCCGGTT
AAATTTAATA TTATAGACGA TAAAATCATA ATTACCTCCA ACACTGAAAT AGGAGCAGTA
AGGGAAGAAA TCAGAGTCGA AGTAAACGGC AGCAACATGG AAGTGGGCTT CAACCCCAGA
TATTTTATCG AAGCGCTCAG GGTCATAGAT GACGAGCTGG TTGACATATA CTTCAATTCA
AGTGTCGGTC CGTGTACAAT AAGACCTCTT GAAGGCGACA GTTTTGCATA CATGATACTT
CCGGTAAGAA TAAATAAATA A
 
Protein sequence
MKIVCSKEQL MEGINVVQKA VPTKATLTIL EGILLEAYDN FKMTGNDLEL GIECLIDADI 
LEKGSIVLNS KMFGDIVRRL PDSEVLIEVK ENNTVIIECD NSHFELRGMP SDSFPSLPSI
EKENMIKVSQ KAIRDMIRQT LFAVSTEGTR PILTGSLIEC AGNEITFVSI DGFRMALRKN
FNNEGFSEFS VVVPAKTLSE IGKILQPVDE DIYIYSSQNQ ILFEIGNCKV VSRLLEGEYL
NYKSIIPPEY ETSVRLRTED LLSSLERASL ITSDEKKYPV KFNIIDDKII ITSNTEIGAV
REEIRVEVNG SNMEVGFNPR YFIEALRVID DELVDIYFNS SVGPCTIRPL EGDSFAYMIL
PVRINK