Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3813 |
Symbol | |
ID | 7874055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4208064 |
End bp | 4210883 |
Gene Length | 2820 bp |
Protein Length | 939 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643700755 |
Product | DNA polymerase I |
Protein accession | YP_002890779 |
Protein GI | 237654465 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.273799 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCACCC TGCTGCTCGT CGACGGCTCC AGCTATCTCT ATCGCGCCTT CCATGCCCTG CCCGACCTGC GCAATTCGCA GGGCGAGCCG ACGGGCGCGA TCCGGGGGGT GCTGTCGATG CTAAGGCGGC TGGAAAGCGA CTACAAGGCA GAATTTCGCG CCTGCGTCTT CGACGCCAAG GGCAAGACTT TCCGCGACGA CTGGTATCCC GAATACAAGT CCCACCGCCC GCCGATGCCC GACGACCTGC GCGCGCAGAT CGCGCCGCTG CACGAGGCGG TGCGGGCCGA GGGCTGGCCG CTGCTGTCGG TGGAGGGCGT GGAAGCGGAC GACGTCATCG GCACGCTCAC CCGCCTGGCG CTCGAGCGCG GCTGGGAGGT GGTGATCTCG ACCGGCGACA AGGACCTCAC CCAGCTCGTG CGCCCCGGCG TGCGCTGGGT CAACACGATG AGCGAGGAAG TGCTCGACGA GGCCGGCGTG GCGGCCAAGT TCGGCGTGCC GCCCGAGCGC ATCGTCGACT ACCTCGCGTT GGTGGGCGAC ACGGTGGACA ACGTGCCGGG CGTGGAGAAG TGCGGTCCCA AGACCGCGGT GAAGTGGCTG ACCGAGTACG GCACGCTCGA CAACCTCGTC GCCAACGCCG ACAAGGTGGG CGGCAAGGTC GGCGAGAACC TGCGCAGGCA TCTGGACTTT CTGCCGCTGG GCCGGAAGCT GGTGACGGTG GCGACGGATG TGGAGCTGCC GGTGGGGCTG GACGAGCTGC CGGCGCGCGC GGACGACAAG GCGGCGCTGC GCGCGCTCTA CGAGCGCTTC GAGTTCAAGT CATGGCTGAA GGATCTGGAT GGCGGTGCGG AGAGTGTCGC CGCCTCGAAG GCCGCGTCCG ATGCGCGCCG CTTCGACCGC CAGGAAAATC GGGGACAGAC CCCGATTTCC GAGGCTGCGA CCACGACGCC AGGGGAGGGC GGGGTGTGCG CAATCTCCGC GGATGAGCAA ATGAAAGACC TGACCCCGGT CTATCTTCCC ATCCTCGACT GGCCGACCTT CGACGCCTGG CTGGCGAAGA TCTCCGCTGC CGAGCTCACC GCCTTCGACA CCGAGACCAC CAGCCTCGAC CCGATGGCGG CCAGGCTGGT CGGGATGTCC TTCGCGGTCG AGCCGGGCGA GGCGGCCTAC CTGCCGCTGG CGCACCGCGG TGCCGCGCAG TTGCCAGAGG CGCCGGAGCA GTTGCCGCTG GACGAGGTGC TCGACAGGCT CAGGCCCTGG TTCGAATCCG ACCGGCACGC CAAGCTCGGG CAGAACCTCA AGTACGACGC CCACGTGCTC GCCAACCACG GTATCGCGCT GCGCGGCATC GCGCATGACA CCCTGCTCGA ATCCTACGTG CTGGAGAGCG ACAAGGCGCA CGACATGGAT TCGCTCGCCA AGCGCCATCT CGGCCTCGCG ACGATCCCTT ATACGGAGGT CTGCGGCAAG GGTGCCAAGC AGATCGGCTT CGACGAGGTC GATGTCGCGC GCGCCACCGA ATACGCCGCC GAGGACGCCG ATGTCACCCT GCGCCTGCAC CAGCACCTGT GCCCGCAGCT CCAGGCCCTG CCGCAGCTCG AAGCGCTTTA CCGCGAGCTC GAGATGCCGG TGCTCGGCGT GCTGCAGCGC ATGGAACGCA CCGGGGTGCT GATCGACCCC TTCCTGCTCG GCCAGCACTC GGAGGAGCTC GGCCGCCGCC TGTACACGCT CGAGGGCGAG GCACATGCGC TCGCCGGCCA GCCCTTCAAC CTCGGCTCGC CCAAGCAGCT CGGCGAGATC CTGTTCGGCA AGCTCGGCCT GCCCGTGGTC AAGAAGACCG CCACCGGCCA GCCCTCCACC GACGAGGAGG TGCTGGAGAA GCTCGCCGAG GACTACCCCT TGCCCAAGCT CCTGCTCGAG CACCGCGGCC TGTCCAAGCT CAAGAGCACC TACGCCGACA AGCTGCCGCG CATGGTCAAT CCGCGCACCG GCCGCGTGCA CACCAGCTTC TCGCAGGCCA CCGCGGTCAC CGGCCGGCTG TCGAGCTCCG AGCCCAACCT GCAGAACATC CCGATCCGCA CGCCCGAAGG CCGCCGCATC CGCGCCGCCT TCATCGCGCC GCGCGATCAT CTCATCGTCT CGGCCGACTA CTCGCAGATC GAGCTGCGCA TCATGGCCCA CCTTTCGGAC GACGCCCGCC TGCTCGAAGC CTTCGCCCGC GGCGAGGACG TGCACCGCGC CACCGCCGGC GAGGTCTTCG GCGTGGCGCC GGCCGAGGTG ACGAGCGAGC AGCGCCGCTA CGCCAAGGTG ATCAACTTCG GTCTCATCTA CGGCATGAGC GCGCATGGCC TGGCCAAGAA CCTCGGCATC GAGCGCGCCG CCGCGCAGGC CTGGATCGAC CGCTACTTCG CGCGCTACCC CGGCGTTGCA GAGTACATGG ACCGCACCCG CGCCGAGGCG CGCGAGCAGG GCTACGTCGA GACCGTCTTC GGCCGTCGCC TCTACCTGCC CGACATCCGC GCCAGCCAGG CCGGCCGCCG CGCCGGCGCC GAGCGTGCGG CGATCAACGC GCCGATGCAG GGCACGGCCG CCGACCTCAT CAAGAAAGCG ATGATCGCGA CCGACGCCTG GCTGGCTTCC TGCGGCTTGA AGTCGAAGCT GGTGCTGCAG GTGCACGACG AGCTGGTGCT CGAGGTACCC GGGGCCGAGC TCGAAACGGT GCGGGAAGAG CTGCCCCGGC TGATGGGCGG CGTCGCCACG CTGAAGGTGC CGCTGCTGGT GGAGGTCGGC GCCGGCGACA ACTGGGACGA GGCCCACTGA
|
Protein sequence | MPTLLLVDGS SYLYRAFHAL PDLRNSQGEP TGAIRGVLSM LRRLESDYKA EFRACVFDAK GKTFRDDWYP EYKSHRPPMP DDLRAQIAPL HEAVRAEGWP LLSVEGVEAD DVIGTLTRLA LERGWEVVIS TGDKDLTQLV RPGVRWVNTM SEEVLDEAGV AAKFGVPPER IVDYLALVGD TVDNVPGVEK CGPKTAVKWL TEYGTLDNLV ANADKVGGKV GENLRRHLDF LPLGRKLVTV ATDVELPVGL DELPARADDK AALRALYERF EFKSWLKDLD GGAESVAASK AASDARRFDR QENRGQTPIS EAATTTPGEG GVCAISADEQ MKDLTPVYLP ILDWPTFDAW LAKISAAELT AFDTETTSLD PMAARLVGMS FAVEPGEAAY LPLAHRGAAQ LPEAPEQLPL DEVLDRLRPW FESDRHAKLG QNLKYDAHVL ANHGIALRGI AHDTLLESYV LESDKAHDMD SLAKRHLGLA TIPYTEVCGK GAKQIGFDEV DVARATEYAA EDADVTLRLH QHLCPQLQAL PQLEALYREL EMPVLGVLQR MERTGVLIDP FLLGQHSEEL GRRLYTLEGE AHALAGQPFN LGSPKQLGEI LFGKLGLPVV KKTATGQPST DEEVLEKLAE DYPLPKLLLE HRGLSKLKST YADKLPRMVN PRTGRVHTSF SQATAVTGRL SSSEPNLQNI PIRTPEGRRI RAAFIAPRDH LIVSADYSQI ELRIMAHLSD DARLLEAFAR GEDVHRATAG EVFGVAPAEV TSEQRRYAKV INFGLIYGMS AHGLAKNLGI ERAAAQAWID RYFARYPGVA EYMDRTRAEA REQGYVETVF GRRLYLPDIR ASQAGRRAGA ERAAINAPMQ GTAADLIKKA MIATDAWLAS CGLKSKLVLQ VHDELVLEVP GAELETVREE LPRLMGGVAT LKVPLLVEVG AGDNWDEAH
|
| |