Gene Tmz1t_3813 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3813 
Symbol 
ID7874055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4208064 
End bp4210883 
Gene Length2820 bp 
Protein Length939 aa 
Translation table11 
GC content69% 
IMG OID643700755 
ProductDNA polymerase I 
Protein accessionYP_002890779 
Protein GI237654465 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.273799 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCACCC TGCTGCTCGT CGACGGCTCC AGCTATCTCT ATCGCGCCTT CCATGCCCTG 
CCCGACCTGC GCAATTCGCA GGGCGAGCCG ACGGGCGCGA TCCGGGGGGT GCTGTCGATG
CTAAGGCGGC TGGAAAGCGA CTACAAGGCA GAATTTCGCG CCTGCGTCTT CGACGCCAAG
GGCAAGACTT TCCGCGACGA CTGGTATCCC GAATACAAGT CCCACCGCCC GCCGATGCCC
GACGACCTGC GCGCGCAGAT CGCGCCGCTG CACGAGGCGG TGCGGGCCGA GGGCTGGCCG
CTGCTGTCGG TGGAGGGCGT GGAAGCGGAC GACGTCATCG GCACGCTCAC CCGCCTGGCG
CTCGAGCGCG GCTGGGAGGT GGTGATCTCG ACCGGCGACA AGGACCTCAC CCAGCTCGTG
CGCCCCGGCG TGCGCTGGGT CAACACGATG AGCGAGGAAG TGCTCGACGA GGCCGGCGTG
GCGGCCAAGT TCGGCGTGCC GCCCGAGCGC ATCGTCGACT ACCTCGCGTT GGTGGGCGAC
ACGGTGGACA ACGTGCCGGG CGTGGAGAAG TGCGGTCCCA AGACCGCGGT GAAGTGGCTG
ACCGAGTACG GCACGCTCGA CAACCTCGTC GCCAACGCCG ACAAGGTGGG CGGCAAGGTC
GGCGAGAACC TGCGCAGGCA TCTGGACTTT CTGCCGCTGG GCCGGAAGCT GGTGACGGTG
GCGACGGATG TGGAGCTGCC GGTGGGGCTG GACGAGCTGC CGGCGCGCGC GGACGACAAG
GCGGCGCTGC GCGCGCTCTA CGAGCGCTTC GAGTTCAAGT CATGGCTGAA GGATCTGGAT
GGCGGTGCGG AGAGTGTCGC CGCCTCGAAG GCCGCGTCCG ATGCGCGCCG CTTCGACCGC
CAGGAAAATC GGGGACAGAC CCCGATTTCC GAGGCTGCGA CCACGACGCC AGGGGAGGGC
GGGGTGTGCG CAATCTCCGC GGATGAGCAA ATGAAAGACC TGACCCCGGT CTATCTTCCC
ATCCTCGACT GGCCGACCTT CGACGCCTGG CTGGCGAAGA TCTCCGCTGC CGAGCTCACC
GCCTTCGACA CCGAGACCAC CAGCCTCGAC CCGATGGCGG CCAGGCTGGT CGGGATGTCC
TTCGCGGTCG AGCCGGGCGA GGCGGCCTAC CTGCCGCTGG CGCACCGCGG TGCCGCGCAG
TTGCCAGAGG CGCCGGAGCA GTTGCCGCTG GACGAGGTGC TCGACAGGCT CAGGCCCTGG
TTCGAATCCG ACCGGCACGC CAAGCTCGGG CAGAACCTCA AGTACGACGC CCACGTGCTC
GCCAACCACG GTATCGCGCT GCGCGGCATC GCGCATGACA CCCTGCTCGA ATCCTACGTG
CTGGAGAGCG ACAAGGCGCA CGACATGGAT TCGCTCGCCA AGCGCCATCT CGGCCTCGCG
ACGATCCCTT ATACGGAGGT CTGCGGCAAG GGTGCCAAGC AGATCGGCTT CGACGAGGTC
GATGTCGCGC GCGCCACCGA ATACGCCGCC GAGGACGCCG ATGTCACCCT GCGCCTGCAC
CAGCACCTGT GCCCGCAGCT CCAGGCCCTG CCGCAGCTCG AAGCGCTTTA CCGCGAGCTC
GAGATGCCGG TGCTCGGCGT GCTGCAGCGC ATGGAACGCA CCGGGGTGCT GATCGACCCC
TTCCTGCTCG GCCAGCACTC GGAGGAGCTC GGCCGCCGCC TGTACACGCT CGAGGGCGAG
GCACATGCGC TCGCCGGCCA GCCCTTCAAC CTCGGCTCGC CCAAGCAGCT CGGCGAGATC
CTGTTCGGCA AGCTCGGCCT GCCCGTGGTC AAGAAGACCG CCACCGGCCA GCCCTCCACC
GACGAGGAGG TGCTGGAGAA GCTCGCCGAG GACTACCCCT TGCCCAAGCT CCTGCTCGAG
CACCGCGGCC TGTCCAAGCT CAAGAGCACC TACGCCGACA AGCTGCCGCG CATGGTCAAT
CCGCGCACCG GCCGCGTGCA CACCAGCTTC TCGCAGGCCA CCGCGGTCAC CGGCCGGCTG
TCGAGCTCCG AGCCCAACCT GCAGAACATC CCGATCCGCA CGCCCGAAGG CCGCCGCATC
CGCGCCGCCT TCATCGCGCC GCGCGATCAT CTCATCGTCT CGGCCGACTA CTCGCAGATC
GAGCTGCGCA TCATGGCCCA CCTTTCGGAC GACGCCCGCC TGCTCGAAGC CTTCGCCCGC
GGCGAGGACG TGCACCGCGC CACCGCCGGC GAGGTCTTCG GCGTGGCGCC GGCCGAGGTG
ACGAGCGAGC AGCGCCGCTA CGCCAAGGTG ATCAACTTCG GTCTCATCTA CGGCATGAGC
GCGCATGGCC TGGCCAAGAA CCTCGGCATC GAGCGCGCCG CCGCGCAGGC CTGGATCGAC
CGCTACTTCG CGCGCTACCC CGGCGTTGCA GAGTACATGG ACCGCACCCG CGCCGAGGCG
CGCGAGCAGG GCTACGTCGA GACCGTCTTC GGCCGTCGCC TCTACCTGCC CGACATCCGC
GCCAGCCAGG CCGGCCGCCG CGCCGGCGCC GAGCGTGCGG CGATCAACGC GCCGATGCAG
GGCACGGCCG CCGACCTCAT CAAGAAAGCG ATGATCGCGA CCGACGCCTG GCTGGCTTCC
TGCGGCTTGA AGTCGAAGCT GGTGCTGCAG GTGCACGACG AGCTGGTGCT CGAGGTACCC
GGGGCCGAGC TCGAAACGGT GCGGGAAGAG CTGCCCCGGC TGATGGGCGG CGTCGCCACG
CTGAAGGTGC CGCTGCTGGT GGAGGTCGGC GCCGGCGACA ACTGGGACGA GGCCCACTGA
 
Protein sequence
MPTLLLVDGS SYLYRAFHAL PDLRNSQGEP TGAIRGVLSM LRRLESDYKA EFRACVFDAK 
GKTFRDDWYP EYKSHRPPMP DDLRAQIAPL HEAVRAEGWP LLSVEGVEAD DVIGTLTRLA
LERGWEVVIS TGDKDLTQLV RPGVRWVNTM SEEVLDEAGV AAKFGVPPER IVDYLALVGD
TVDNVPGVEK CGPKTAVKWL TEYGTLDNLV ANADKVGGKV GENLRRHLDF LPLGRKLVTV
ATDVELPVGL DELPARADDK AALRALYERF EFKSWLKDLD GGAESVAASK AASDARRFDR
QENRGQTPIS EAATTTPGEG GVCAISADEQ MKDLTPVYLP ILDWPTFDAW LAKISAAELT
AFDTETTSLD PMAARLVGMS FAVEPGEAAY LPLAHRGAAQ LPEAPEQLPL DEVLDRLRPW
FESDRHAKLG QNLKYDAHVL ANHGIALRGI AHDTLLESYV LESDKAHDMD SLAKRHLGLA
TIPYTEVCGK GAKQIGFDEV DVARATEYAA EDADVTLRLH QHLCPQLQAL PQLEALYREL
EMPVLGVLQR MERTGVLIDP FLLGQHSEEL GRRLYTLEGE AHALAGQPFN LGSPKQLGEI
LFGKLGLPVV KKTATGQPST DEEVLEKLAE DYPLPKLLLE HRGLSKLKST YADKLPRMVN
PRTGRVHTSF SQATAVTGRL SSSEPNLQNI PIRTPEGRRI RAAFIAPRDH LIVSADYSQI
ELRIMAHLSD DARLLEAFAR GEDVHRATAG EVFGVAPAEV TSEQRRYAKV INFGLIYGMS
AHGLAKNLGI ERAAAQAWID RYFARYPGVA EYMDRTRAEA REQGYVETVF GRRLYLPDIR
ASQAGRRAGA ERAAINAPMQ GTAADLIKKA MIATDAWLAS CGLKSKLVLQ VHDELVLEVP
GAELETVREE LPRLMGGVAT LKVPLLVEVG AGDNWDEAH