Gene Moth_0027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0027 
Symbol 
ID3830893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp29099 
End bp30868 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content64% 
IMG OID637827960 
ProductDNA polymerase III, subunits gamma and tau 
Protein accessionYP_428910 
Protein GI83588901 
COG category[L] Replication, recombination and repair 
COG ID[COG2812] DNA polymerase III, gamma/tau subunits 
TIGRFAM ID[TIGR00678] DNA polymerase III, delta' subunit
[TIGR02397] DNA polymerase III, subunit gamma and tau 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000154817 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGGCCCAGT ACCAGGCCCT GTACCGGCAG TGGCGGCCTC GCACCTTTGC CGAGGTGGTG 
GGGCAGGAAC ATATAACCAG GACCCTTCGT AACGCCCTGC GTACCGGGCG CCTGGTTCAC
GCCTACCTCT TCTGTGGCCC CAGGGGTACG GGGAAAACCA GCACGGCGAA AATACTGGCC
CGGGCCATCA ATTGCCTGGC ACCCCGGGAG GGGGAACCCT GTAACGAATG CGCCAATTGC
CGGCGTATCC TGGCCGGAAA CTCCCTGGAC GTCCTGGAGA TGGACGCCGC CTCCAACCGG
GGGATTGACG AGATCCGCAA TCTGATTGAG AAAATACCTC TGGGTCCGGT AGAAGGCAGG
TACAAGGTTT ATATTATCGA TGAAGTCCAT ATGCTGACCC AGGAGGCCTT TAATGCCCTC
CTGAAAACCC TAGAAGAGCC CCCGGCCCAT GCAGTCTTTA TTCTGGCTAC CACCGAGCCG
CGCAAGGTTC TGCCGACCAT CCTATCCCGC TGCCAGCGCT TTGATTTTCA CCCCCTGACG
GTACAGGCCA TTGCCGGTCG CCTTCAGGAA GTGGCAGCAG CCAACGGGGT GGAGATTGAG
CCCGGAGCTT TGAGCCTCTT ATCCCGCAAG GCTGCCGGCG GCTTGCGGGA TGCCCTCAGC
CTCCTGGACC AGATTCTGGC CAGCGGCACC AGGGGACCGG TAACGGCCGG GCAGGTAGCA
GTTACCCTGG GCACGGCGCG ACTGGACACC CTCCTGGCCC TGACCGATGC CCTGGCTACC
GGCGATGGTG CCGGGGTGTT GAACCTGGTA GATAAAGCCC TGGCATCCGG TATCGAGCCC
CGGCGCCTGC TTGAGGACTT ACTCGATCAT ACCCGCAACC TTCTGCTCTT AAAAGTGGAC
CCCGGCGCCG GCTCCCTGAC CGGCCTTCTG CCTGAGGAAG TGGAGCAGGT GGCGGCTCAA
GCCCGGCAGT TTGACCACCA TCGCCTGCTA GACCTCATGG AAAGGCTACA GCAAGGTGGA
GCGGCTCTGC GCCGGAGCAA CCAGCCGCGG GTCATCCTGG AGATGACCCT GGCTGGCTTC
CTGGTTGCCC CCGGCCCCTC CCTGGAAGGC CTGGCCCGGC GGGTGGCGGA ACTGGAGGCA
CGTCTGGCCG CCCTGGAAGG TTCCGCTCCC AGCAGGACCC GGGAAAAGGT TGCCACCAGG
AACCAAGGAG AAGGGGGCAG GCCGCCGGTT GCCCCGGGGA GCCGGGCTGA TGCCGGCGGG
CAGGAACGGG GAAACAGCGG CCCACTTGAG GCCCCCGGGG CAGGGTCTCC TGCTGGACGG
GCCCGAGTAA TTAACCGTTC CCGGGAATTC CCGCAGGCTG CCGGCAGGGA TCGGGCAGGG
CTGGCCGGGC CGGAGACCGC GACCGGCCTG GAACCCGGTT CGCCGGCATC TCCGGGGCTG
GAACTGGCCG TAGTGCAGGA ACACTGGCCG GAAGTCCTGG CTGCCGCCCG CAGGGAAAGC
ATCCAGCTTC AGGCCTTCCT GCGGGAGGGC GAACCGGTAG CAATAGATGG TGATACCCTG
ACTCTGGCCG TCAAAGCCGA TTTTCACCGG GGCATGCTGG AGCAGCCTGG TAACAGGCAG
AAGGTGGAGA AAGCCCTGGC TGCGGTCTTC GGCCGGCCTT TAAAAGTAGT CATTACCTCC
GGGAAACCCT CCCCACCGGG GGACACCGGC GATACCTTGA CCCGGTTGGT GAACTTTTTC
GGGGCGGATA AAGTGGAGAT CAAGGACTGA
 
Protein sequence
MAQYQALYRQ WRPRTFAEVV GQEHITRTLR NALRTGRLVH AYLFCGPRGT GKTSTAKILA 
RAINCLAPRE GEPCNECANC RRILAGNSLD VLEMDAASNR GIDEIRNLIE KIPLGPVEGR
YKVYIIDEVH MLTQEAFNAL LKTLEEPPAH AVFILATTEP RKVLPTILSR CQRFDFHPLT
VQAIAGRLQE VAAANGVEIE PGALSLLSRK AAGGLRDALS LLDQILASGT RGPVTAGQVA
VTLGTARLDT LLALTDALAT GDGAGVLNLV DKALASGIEP RRLLEDLLDH TRNLLLLKVD
PGAGSLTGLL PEEVEQVAAQ ARQFDHHRLL DLMERLQQGG AALRRSNQPR VILEMTLAGF
LVAPGPSLEG LARRVAELEA RLAALEGSAP SRTREKVATR NQGEGGRPPV APGSRADAGG
QERGNSGPLE APGAGSPAGR ARVINRSREF PQAAGRDRAG LAGPETATGL EPGSPASPGL
ELAVVQEHWP EVLAAARRES IQLQAFLREG EPVAIDGDTL TLAVKADFHR GMLEQPGNRQ
KVEKALAAVF GRPLKVVITS GKPSPPGDTG DTLTRLVNFF GADKVEIKD