Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_1234 |
Symbol | |
ID | 5878027 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | + |
Start bp | 1275073 |
End bp | 1277007 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641541584 |
Product | DNA-directed DNA polymerase |
Protein accession | YP_001662864 |
Protein GI | 167039879 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000117572 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAACCC TCAGTATAGA TATAGAAACA TTTAGCAGCG TAGACCTCGC CAAAAGCGGG GTCTATCGTT ACGCTGAAGC ACCGGATTTT GAGATTTTAC TTTTCGGTTA CAGTGTGGAT GCTGGGCCGG TTCAAGTAGT AGATTTCGCT TGTGGCGAAA AAATACCTAA AGAAATACAG CAAGCGATTC TAGATAATAA TGTTACCAAA TGGGCGTTTA ATGCGCAGTT TGAGCGGATT TGCTTATCGA AGTACTTTGG TATACACCTT GCACCTGATT CTTGGCGTTG CACAATGGTT TGGTCTGCGT ACCTTGGGCT TCCTTTATCT TTGGAAGGAG CAGCAATCGT AACAGGAGCA GACAAGAAAA AGCTAACAAA AGGTAAAGAG CTCATTCGAT ATTTTTCAGT TCCATGCAAA CCAACAGTAA CTAATGGTGG TCGTACACGA AATCTACCTG AACATGCTCC AGAGAAGTGG AATAGCTTCA AAGCATATAA TCTTCGAGAC GTTGAAGTTG AACTTTCTAT ACAGGAAAAG CTTCAAAAAT TTCCTATGCC TGAAGAGGAA TGGAATAACT ATATTCTAGA TCAGCAAATC AATGATCGGG GTATTCAGTT AGATTTGGAA TTGGTAAAAA AAGCGATTCA ATGTGATGAA AAAGTAAGAG AAGAGCTAAC AAGCAGACTA AAAGAACTAA CTGATCTTGA TAATCCTAAT TCAGTTGTTC AAATGAAGTC CTGGCTATCG GAAAAGGGTC TGGAAACAGA CAGTCTTGAT AAGGCATCAG TTAAGGCACT ATTAAAGGAA GCTCCAGAAC ATCTGAGTGA AGTACTGGAA CTGAGACAGT TACTAGCCAA ATCCAGTGTA AAGAAATACA CCGCAATGGA AAATGCCGTA TGCACTGACG GAAGAGCTCG GGGACTACTA CAATTCTACG GTGCTAATCG AACCGGTAGA TTTGCAGGAA GGCTTATACA AGTTCAAAAT CTCCCTCAGA ACCATTTGTC GGATCTGGAA CAGGCACGTC GATTGGTTAG AGGTGGCCAT TTTGATGCCT TGGAAATATT GTATGACTCT ATTCCGGGAG TTTTGTCTGA ATTAATTCGA ACCGCTTTTG TGCCAAAGAA AGGATATAAG TTTATCGTTG CTGACTTTAG TGCAATAGAG GCTAGAGTTA TTGCTTGGCT TGCAGGCGAG ACATGGAGAA ACGAAGTGTT TGCTACTCAT GGAAAGATTT ATGAAGCATC TGCTTCCCAG ATGTTTAAAG TCCCCTTAGA AGAAGTCACA AAAGGCAGTC CTCTAAGACA AAAAGGAAAA ATTGCGGAGC TGGCCTTAGG TTATGGTGGT TCTGTAGGTG CATTAAAAGC AATGGGTGCA CTTGATATGG GTCTTACCGA AGAAGAATTA AAACCTTTGG TCTACGCATG GCGAAATGCA AATCCTAATA TTGTACGACT TTGGTGGGAT GTTGATCGTG CTGTCAAAGA AGCTGTAACA GAAAGGTGCA GAACTGAAAC TCATCGTATC CGTTTTGAGT ATCGTAGCGG GATGCTATTA ATATGGCTTC CTTCTGGCAG ACAACTTACT TATGTTAAAC CAAGAATGGG TATTAACAGC TTTGGCAGTG AAGCAGTGAC TTATGAAGGA GTTGGTGCCA CAAAGAAATG GGAGCGCATT GAAAGTTATG GTCCGAAGTT TGTTGAGAAC ATCGTGCAGG CCATTTCAAG AGATCTTCTT TGCCATTCCA TGCGAAATCT GGATGAATCA GGACTAAATA TTGTCATGCA TGTCCATGAC GAGGTAGTTT TAGAGGTTCC TTTAGAAATA TGCGTACAAG ACGTCTGTGT TCTTATGGGT CAGGTACCGC CTTGGGCGCA TGGGCTTCTA CTTCGTGCGG ATGGATTTGA ATGCGATTTT TATAAAAAAG ATTAA
|
Protein sequence | MRTLSIDIET FSSVDLAKSG VYRYAEAPDF EILLFGYSVD AGPVQVVDFA CGEKIPKEIQ QAILDNNVTK WAFNAQFERI CLSKYFGIHL APDSWRCTMV WSAYLGLPLS LEGAAIVTGA DKKKLTKGKE LIRYFSVPCK PTVTNGGRTR NLPEHAPEKW NSFKAYNLRD VEVELSIQEK LQKFPMPEEE WNNYILDQQI NDRGIQLDLE LVKKAIQCDE KVREELTSRL KELTDLDNPN SVVQMKSWLS EKGLETDSLD KASVKALLKE APEHLSEVLE LRQLLAKSSV KKYTAMENAV CTDGRARGLL QFYGANRTGR FAGRLIQVQN LPQNHLSDLE QARRLVRGGH FDALEILYDS IPGVLSELIR TAFVPKKGYK FIVADFSAIE ARVIAWLAGE TWRNEVFATH GKIYEASASQ MFKVPLEEVT KGSPLRQKGK IAELALGYGG SVGALKAMGA LDMGLTEEEL KPLVYAWRNA NPNIVRLWWD VDRAVKEAVT ERCRTETHRI RFEYRSGMLL IWLPSGRQLT YVKPRMGINS FGSEAVTYEG VGATKKWERI ESYGPKFVEN IVQAISRDLL CHSMRNLDES GLNIVMHVHD EVVLEVPLEI CVQDVCVLMG QVPPWAHGLL LRADGFECDF YKKD
|
| |