Gene Tmz1t_3956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3956 
Symbol 
ID7873602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4352685 
End bp4354193 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content72% 
IMG OID643700893 
ProductDNA polymerase III, epsilon subunit 
Protein accessionYP_002890916 
Protein GI237654602 
COG category[L] Replication, recombination and repair 
COG ID[COG0847] DNA polymerase III, epsilon subunit and related 3'-5' exonucleases 
TIGRFAM ID[TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.323553 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACGCCCA CCCTCCGCAG CTTTCCCGGC GGCCTCGCCT TTGTCGACAT CGAGACCACC 
GGCGGCCCCG CCCAACGCGA ATCCATCACC GAGATCGGCA TCGTCCAGGT GGATGAGGAC
GGCGTGCGCG AGTGGTCGAC GCTGGTGCGT CCCGCGTCGC GCATCCCAGA GACCATCCAG
CGCCTCACCG GCATCGACGA CGACATGGTC GCCGACGCAC CGCGCTTCGA GGACATCGCC
GACGAGGTCT TCGACCGTCT CGACGGCCGC CTCTTCGTCG CCCACAACGC ACGCTTCGAC
CACGGCCACC TGCGCGCCGC CTTCCGCCGC GCCGGGCTCG ACATGCGGCC GCAGGTGCTG
TGCACCGTCA AGCTGTCGCG CCGGCTGTTC CCCGACCACC GCCGCCACGG TCTCGACCAC
CTCATCGAGC GCCACGGCCT GGCGGTGGCC GACCGCCACC GTGCGCTCGG CGACGCCCGG
CTGCTGTGGC AGTTCTGGCA GAAGATCCAC GAACGCTTTC CGCCCGGTCA CATCGATGCC
GCGGTGCGCG AACTCATCGG CCACCCCAGC CTGCCCCCCC ACCTCGACCC CGAGCAAATC
GCCGACCTGC CCGACACGCC GGGGGTGTAT CTGTTCTACG GCGAGCGCGG GGGCGAGAGC
AGCCAGCTTG GGGGCAACGA TGAAGCCGAC GCCGAAGCCG AGGCCGATCC GCTCGGACCC
GGCAGGGCGC GCACCGGCGC GCGCGACCGC AAGCGCCACG CGCCGCTGCA GGACTTGCCG
CTCTACATCG GCAAGAGCAC GCGGCTGCGC AGCCGGGTGT TGTCGCACTT CGCCGCCGAC
CACAGCAGCG ACCGCGAGCT CAGCCTCTCC CAGCAGGTGC GCCGCATCGA ATGGATCGCG
ACCGCCGGCG AGATCGGCGC GCTGCTGAAG GAAGCCGAAC TGGTCAAGCG CCTGCAGCCC
ACCCACAACC GCCAGCTGCG CCGCAACCGC GAGCTGTGCA CCTGGCGGCT CGCCACCGAC
ATCGTCGGCG ACTGGCGGCT GGAGCTGGTG CATGCGGCCG ACCTCGACTT CGGCCGCCGC
GACGACCTCT ACGGTTTCTT CCGCACCCGC CGCGAGGCCA CCAACCGGTT GCGCGCGCTC
GCCCGCGACC ACGCCCTGTG CCCGCCGCTG CTCGGCCTGG AGAAACCCCC GCAAGGTGCG
CGCTGCTTCG ACTTCCAGTT GAAGCGCTGC CGTGGCGCCT GCCACGGCGG CGAATCCCCC
CAGGCCCACG CCCTGCGCCT GATCGAGGCC CTGCACGCGC TGAAGGTCGA GCACTGGACC
TGGCCCGGCC CGGTCGGCCT GCGCGAGGGC GAGGCCATCC ACGTCGTCGA CGGCTGGCGC
TGGCTCGGCA CCGCCACCGA CGAAGCCATG CTCGCCGACC TGCTGGAGGC CGGCCGCCCG
GCCTTCGACC ACGACATCTA CAAGATCCTG GTCAAGGCGG TGAGGCGGCT GCCGGTGGTG
CAGCTCTAA
 
Protein sequence
MTPTLRSFPG GLAFVDIETT GGPAQRESIT EIGIVQVDED GVREWSTLVR PASRIPETIQ 
RLTGIDDDMV ADAPRFEDIA DEVFDRLDGR LFVAHNARFD HGHLRAAFRR AGLDMRPQVL
CTVKLSRRLF PDHRRHGLDH LIERHGLAVA DRHRALGDAR LLWQFWQKIH ERFPPGHIDA
AVRELIGHPS LPPHLDPEQI ADLPDTPGVY LFYGERGGES SQLGGNDEAD AEAEADPLGP
GRARTGARDR KRHAPLQDLP LYIGKSTRLR SRVLSHFAAD HSSDRELSLS QQVRRIEWIA
TAGEIGALLK EAELVKRLQP THNRQLRRNR ELCTWRLATD IVGDWRLELV HAADLDFGRR
DDLYGFFRTR REATNRLRAL ARDHALCPPL LGLEKPPQGA RCFDFQLKRC RGACHGGESP
QAHALRLIEA LHALKVEHWT WPGPVGLREG EAIHVVDGWR WLGTATDEAM LADLLEAGRP
AFDHDIYKIL VKAVRRLPVV QL