Gene Athe_2107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2107 
Symbol 
ID7408816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2234257 
End bp2235861 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content39% 
IMG OID643716473 
ProductDNA polymerase III, subunits gamma and tau 
Protein accessionYP_002573956 
Protein GI222530074 
COG category[L] Replication, recombination and repair 
COG ID[COG2812] DNA polymerase III, gamma/tau subunits 
TIGRFAM ID[TIGR02397] DNA polymerase III, subunit gamma and tau 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00560473 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACATAG CACTTTACAG GAAATATAGG CCTAAGGTGT TTGAAGATGT TGTTGCACAA 
GAGCATATAA CAAGGACTTT AAAAAATCAA ATAAAACAGG ACAAAGTAGC TCACGCATAC
ATCTTTTGCG GCCCGAGAGG CACTGGCAAG ACAACAACTG CAAAAATAAT GTCAAGAGCT
GTGAACTGTT TGAATCCCAA GGATGGGAAT CCGTGCAACG AATGTGAGAT ATGCAGAAGC
ATCTTGGATG AAAAGACGCT TGATGTTTTG GAGATTGACG CTGCATCAAA CACAAGTGTC
AACGACGTGA GGCAAATCAG GGACGAGGTC AGGTACCCAC CTTCTGTTTG CAAAAAGAAG
GTATATATAA TAGACGAGGT GCACATGCTC TCAACAGGGG CTTTCAACGC GCTTTTAAAA
ACCCTTGAAG AGCCCCCCTC CCATGCACTT TTTATTCTGG CAACCACAGA TATTCAAAAA
GTACCTGCAA CCATTCTTTC AAGATGTCAG AGGTTTGATT TTAAAAGGAT TTCTGTAAAG
GACATATATG AAAGGCTCAA AAAGATTGTT CAGATGGAAA ATATATCAAT TGACGATAAT
GCCCTGTATT TGATCTCACA AAAGGCAGAA GGTGCTCTGA GGGATGCTTT GACCATATTA
GAAAGGTGCA TGAATACATC TGATGAACAT ATAACCTACA AGTTTGTTGC AAACCTTTTA
GGTGTTACAT CAACCGAGAT AGTGAAAGAA TATATTGCTG CTATTGTAGA AAATGATTCC
AACAAAGGAC TCAAAGTTAT AAATAGGCTT TGGGATGAAG GAATGGATGT AAATACTTTT
TTAGAAGAAG CTGTGAAGCT TTTGAGAAGT GCACTGATTT TACGACTTGG TGCAAAAGAT
GTTTTGGTTG ACATGCTTGA AAGTGATAAA GATTTTGTCA TTAACATATC AAACCTTGTT
GATTCAAACA GACTTGTTTC AATCATAAAG ATGCTTATTG ACACTGCCAA CCAGATACGC
TGGACAAGAT TTCCAAAGGT TTTGCTTGAG ATAAATACAA TAAAACTTTG CGATAGCCAG
TTTGACACCT CATTTGAAAC GCTCATTGAA AGAGTTCGAA AACTTGAGAC AAAGCTTTCT
CAGCTTGCCG AAAATCCCAA GGCTTTTGAA GCCATGAAAC TTGACAAGGC TCAATCTACA
AAACAAGAGC AAAAGATCTC GCATATAGCT GACAAAAGCG CAGAAGGTGT GGACAGCAAT
GCATCTTTTT CATGGTCTGA GATTTTGAGC AGGTGGCAGG AAATAAAAGA GGCTATCAAG
GAGGAAAAGC CGGGACTTTC GCATGTTCTT CAAAATGCCA GCCTGAGGTT AGAAAATGGT
GTGAAGGTAT GTTTTAAGCA GGAAGATAGT GTGTTTGCAG AGGTTTTGAG CAGAAACATG
GAGTATTTTA AGTCAATTCT AAAGAGGATT GTGGGGTATG AAGGTGAGGT CTCTGTTGAT
GTTGAAAAGC AAGAGCCATT TAAAGAAAAT ACTGTGTCTG ACCAAGAGAT AATAAACAAG
CTCAAGGACA TCTTCCCTGA CACAGAGATT ACTATAAAAG AGTGA
 
Protein sequence
MHIALYRKYR PKVFEDVVAQ EHITRTLKNQ IKQDKVAHAY IFCGPRGTGK TTTAKIMSRA 
VNCLNPKDGN PCNECEICRS ILDEKTLDVL EIDAASNTSV NDVRQIRDEV RYPPSVCKKK
VYIIDEVHML STGAFNALLK TLEEPPSHAL FILATTDIQK VPATILSRCQ RFDFKRISVK
DIYERLKKIV QMENISIDDN ALYLISQKAE GALRDALTIL ERCMNTSDEH ITYKFVANLL
GVTSTEIVKE YIAAIVENDS NKGLKVINRL WDEGMDVNTF LEEAVKLLRS ALILRLGAKD
VLVDMLESDK DFVINISNLV DSNRLVSIIK MLIDTANQIR WTRFPKVLLE INTIKLCDSQ
FDTSFETLIE RVRKLETKLS QLAENPKAFE AMKLDKAQST KQEQKISHIA DKSAEGVDSN
ASFSWSEILS RWQEIKEAIK EEKPGLSHVL QNASLRLENG VKVCFKQEDS VFAEVLSRNM
EYFKSILKRI VGYEGEVSVD VEKQEPFKEN TVSDQEIINK LKDIFPDTEI TIKE