Gene Tmz1t_1948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1948 
Symbol 
ID7084416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2189236 
End bp2192463 
Gene Length3228 bp 
Protein Length1075 aa 
Translation table11 
GC content74% 
IMG OID643698973 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_002355595 
Protein GI217970361 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTCCCG CCTACGCCGA ACTCCACTGC CTGAGCAACT TCAGCTTCCA GCGCGGCGCC 
TCGCACCCCG AGGAGCTCGT CGCGCAGGCG GCAGCCTTCG GCTACAGCGC GATCGCCCTC
ACCGACGAAT GCTCCTTCGC CGGCGTGGTG CGCGCCCACC GCGCGCTGCA GGCCCTGCCC
GAAGCCGGCC GGCCGCGCCT GATCGTCGGC TGCGAGCTGC GCCTCGCCGA CGGGCCCTGC
CTCGTCCTGC TCGCCGCCGA TCGCGCCGGC TACGGCCGGC TCTCGCGCCT GCTCACCCAG
GCCAGGCGGA GCGCCGACAA GGGCCGCTAC CACCTCACGC GCGCGATGCT CGACGCCACC
CGCGCAGACG AGGCGGCCGC CGAAGCCCTG CCCGGCTGCC TGGCGCTGCT GATCCCACCC
GAGGAATTCC CGCCGCCGGG TGCCGCCGGC GACGCGGCCG TCGCCCTGCG CAGCGAAGCG
CGCTGGCTGG CGAAGCGCTT CCCCGGACGC TCCTGGATCG CCCACACCCC GCGCCTGGAC
GGACGCGATC CCGCCCGCCT GCACTGGATC CTGCGCATCG CCCAGGCCAC CGGCGTCCCT
GCGGTCGCCG CCGGCGGCGC GCTCATGCAC GAGGCCGCGC GGCGCCCGCT CGCCGACGTG
ATGAGCGCGC TGCGCCTGCA CTGCAGCGTG GCCGAGGCCG GCCTGGCGCT CGCGCCCAAT
GCCGAGCTGC GCCTGCACGA GCGCGCGACC CTCGCCCGCC GCCATCCTCC CGAGCTGCTC
GCCGAGACCG TGGCGGTGGC GGCGCGCTGC CGCTTCGACC TCGACGAGCT GCGCTACGAG
TACCCCGCCG AGCTCGTCCC CGCGGACGAG ACCCCGACGC GCTGGCTGCG CCGGCTGGTC
GAGGGCGGGC TGGCCTGGCG CTACGGCAAG GCCGGCAAGG AGCGCCGCGC AGCCGCGCCT
GCCACGATCG CGGCCGCACC GCCACCGGCC GAAGACCCCG CCCCGCCCAC GGTGCGCGCG
CAGATCGAGC ACGAGCTCGC GCTCATCGCC GAGCTCGGCT ACGAGCCCTA CTTCCTCACC
GTGCACGACA TCGTGCGCTT CGCCCGCGAG CGCGGCATCC TGTGCCAGGG CCGCGGCTCG
GCGGCCAACT CGGTGGTGTG CTGGGCACTG GGCATCACCG AGGTCGATCC GCAGCTGGGC
ATCATGCTGG TCGAGCGCTT CATCTCGCGA GAGCGCGACG AGCCGCCCGA CATCGACGTC
GACTTCGAGC ACGAGCGCCG CGAGGAGGTC ATCCAGTACC TCTACCGCAA GTACGGCCGC
GAACGCGCCG CGCTCGCCGC CACCGTGATC CGCTACCGCG CACGCAGCGC GCTGCGCGAC
GTCGGCCGCG CGCTCGGCCT CGACGAGACG CAGATCGAGC GCCTCACCCG CGAGCACCAC
TGGTTCGACG GCCGCCACAT CCTGCCCGAG CGCCTCGCGG AGGCCGGGCT CGACCCCGCC
AGCCCGGTCA CGCAAAGGCT GGTCGCACTC ACCGAGGAAC TCATCGGCTT TCCGCGCCAC
CTCTCCCAGC ACGTCGGCGG CTTCGTCATC GCGCGCGGCC GGCTCGACGA GCTGGTGCCG
GTGGAGAACG CCGCCATGTC CGATCGCACC GTGATCCAGT GGGACAAGGA CGACCTCGAC
GCGGTCGGCC TGATGAAGAT CGACGTGCTC GCGCTCGGCA TGCTGTCGGC GCTGCGTCGC
GGCCTCGCCC TGGTGTCGGC ATGGCGCGGC GAGGCGCTGA CGCTGGCGAC CATCCCGCGC
GAACGAAAGG AGGTCTACGA AATGCTGTCG CGCGCCGATT CGGTCGGCGT GTTCCAGGTC
GAGTCGCGCG CGCAGATGAC CATGCTGCCG CGCCTGAGGC CGCAACGCTT CTACGACCTG
GTGGTGGAAG TCGCGATCGT GCGCCCCGGC CCGATCCAGG GCGGCATGGT CCATCCCTAC
CTGCAGGCGC GCGAGCGTGC GGCGCGCGGC GAGGATCCGT TGGACGGCCT GCGCGAAGAG
ATCCGCGGTG TGCTGGCGCG CACGCTGGGC GTGCCGATCT TCCAGGAGCA GGTGATGCAG
CTCGCGGTGG TCGCGGCCGA CTTTACCGGC GGCGAGGCCG ACCAGCTGCG CCGCGCGATG
GGTGCCTGGC GGCGCAAGGG CGAGCTCGAG CGCTACCGCC AGAAGCTGCT CGACGGGCTG
GCGAACAACG GCTACGACCC CGACTTCGCC CAGCGCCTGT GCCAGCAGAT CGAGGGCTTC
GGCAGCTACG GCTTTCCCGA ATCCCACGCC GCCAGCTTCG CGCTGCTGGT GTATGCCTCG
GCCTGGCTCA AGTGCTTCGC GCCCGCGGCC TTCCTCGCCG CGCTGCTGAA CAGCCAGCCG
ATGGGCTTCT ACGCGCCCGC GCAGCTGATC CAGGACGCCC GCCGCCATGG GGTGGAGGTG
CGCGCGGCGG ACGTCGGCGC CAGCGCGTGG GACTGCATGC TGGAGACCGC CGACGGCGCG
CAGCCCGCGG TGCGCCTCGG CCTGCGCATG ATCCGCGGCC TCGGCCGCGA GGCCGCCACG
CGCATCGCCG CCGCGCGTGG CGAACACGCC TTCGCCGACA CCCGGGACCT CGCCGCGCGC
GCGCGGCTCG ACACCGGCGA GCTGCGCACC CTCGCCGCCG GCGGTGCCCT CGCCAGCCTC
GCCGGTCATC GCCGCCAGGC CCTGTGGCAG GCGAGCGGCG CCGCGCCCCT GCCCGGCCTG
CTCGCCGAAG CCCCTGGCGG CGACGTCGCC GCCACGCTGG ACGCCCCCGC CGAGGCCGAA
GACCTGCTCG CCGACTACGC CCGCCTCGGC TTCACCCTCG GTCGCCACCC GCTCGCCTTC
GTGCGCGAGC AGCTCGCCCG CCTGCGCTTT CTCACCGCCG CCGACATCAC CGCCGCGCCC
GACCGCATGC TCGCCCGCGG CGCCGGCCTG GTCACCTGCC GCCAGCGCCC CGGCACCGCG
AAGGGCACGC TCTTCCTCAC CCTGGAAGAC GAGACCGGGC TCACCAACGT GATCGTCCGC
CCCGAGCTCT TCGAACAGCA GCGCCGCATC CTGCTCGGGG CGCGCCTGAT GGGCGTGTTC
GGCCAGATCC GCCGTCAGGG CCGGGTCGTG CATCTGGTGG CGAGCCGGGT GGTCGACCAC
TCGCCCCTGC TCGGCAGCCT CGCCGCGCGC AGCCGGGATT TTCACTGA
 
Protein sequence
MLPAYAELHC LSNFSFQRGA SHPEELVAQA AAFGYSAIAL TDECSFAGVV RAHRALQALP 
EAGRPRLIVG CELRLADGPC LVLLAADRAG YGRLSRLLTQ ARRSADKGRY HLTRAMLDAT
RADEAAAEAL PGCLALLIPP EEFPPPGAAG DAAVALRSEA RWLAKRFPGR SWIAHTPRLD
GRDPARLHWI LRIAQATGVP AVAAGGALMH EAARRPLADV MSALRLHCSV AEAGLALAPN
AELRLHERAT LARRHPPELL AETVAVAARC RFDLDELRYE YPAELVPADE TPTRWLRRLV
EGGLAWRYGK AGKERRAAAP ATIAAAPPPA EDPAPPTVRA QIEHELALIA ELGYEPYFLT
VHDIVRFARE RGILCQGRGS AANSVVCWAL GITEVDPQLG IMLVERFISR ERDEPPDIDV
DFEHERREEV IQYLYRKYGR ERAALAATVI RYRARSALRD VGRALGLDET QIERLTREHH
WFDGRHILPE RLAEAGLDPA SPVTQRLVAL TEELIGFPRH LSQHVGGFVI ARGRLDELVP
VENAAMSDRT VIQWDKDDLD AVGLMKIDVL ALGMLSALRR GLALVSAWRG EALTLATIPR
ERKEVYEMLS RADSVGVFQV ESRAQMTMLP RLRPQRFYDL VVEVAIVRPG PIQGGMVHPY
LQARERAARG EDPLDGLREE IRGVLARTLG VPIFQEQVMQ LAVVAADFTG GEADQLRRAM
GAWRRKGELE RYRQKLLDGL ANNGYDPDFA QRLCQQIEGF GSYGFPESHA ASFALLVYAS
AWLKCFAPAA FLAALLNSQP MGFYAPAQLI QDARRHGVEV RAADVGASAW DCMLETADGA
QPAVRLGLRM IRGLGREAAT RIAAARGEHA FADTRDLAAR ARLDTGELRT LAAGGALASL
AGHRRQALWQ ASGAAPLPGL LAEAPGGDVA ATLDAPAEAE DLLADYARLG FTLGRHPLAF
VREQLARLRF LTAADITAAP DRMLARGAGL VTCRQRPGTA KGTLFLTLED ETGLTNVIVR
PELFEQQRRI LLGARLMGVF GQIRRQGRVV HLVASRVVDH SPLLGSLAAR SRDFH