Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1748 |
Symbol | |
ID | 3832893 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1801134 |
End bp | 1803086 |
Gene Length | 1953 bp |
Protein Length | 650 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637829672 |
Product | PHP-like |
Protein accession | YP_430592 |
Protein GI | 83590583 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1796] DNA polymerase IV (family X) |
TIGRFAM ID | [TIGR01856] histidinol phosphate phosphatase HisJ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00399585 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.370986 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACCAACC TGGAGCTGGC CTGGGCCCTG GCGGAAATGG GCGACTTGCT GGAGTTAAAG GGGGAGGAGC CCTTTAAGGT GCGGGCCTAC CATCGTGCCG CCCGTTCCCT GGAGAACCTG GAGGAAGAGG CGGCCGATCT ATACGCCCGC GGCGCCCTGG AGGAGATACC CGGCGTGGGC AAGAACCTGG CCAAAAAGCT CGCCGAACTC CTGACTACAG GCCGCTCTAC CTTTCTCGAC AATCTCCGCC GGGAAGTGCC GCCGGGCCTG CGGGAGATGC TGGCCATCCC GGGGCTGGGC AGCCGTACCG TCCGCCAGAT TCACCAGGGA CTGGGGATTA CGACCCTGGC TGAGCTGGAA CAGGCGGCCC GGGAGAGGCG CATCCGCACC CTGCCGGGTC TGGGCAGCAA GACGGAACTG GCCATTCTGC GGGGGCTGGA GATGCTGCGG GAGGTCCAGG ACCGGGTACC CCTGGGGGTG GCCCGGCCCC TGGCCCTGTT GTTGCGGGCT CAACTCCTGG CCCTGCCGGG GGTGGTCCGG GCGGAAATAG CCGGGAGCGT CCGCCGCGGT AAAGAAATGG TGGGGGATAT TGATCTGGTG GCCGCCGTCG AGCCGGACAA CCAGGTGGCG GCAGTCCTGG TCCGCCACCC CCAGGTCAAG GAAGTCCTGG CCAGGGAACC GGACCGCCTG GCCCTGCAGA CGAACCTGGG CCTGAAGATC GAAGTGATCA TGGTTCCCCC GGAGGATTTC CCGGCCACCC TCTTTTATGC CACCGGGTCA AAGGCGCATC GCCGGGCCCT GCTTCGCCTG GCCGCCGAAA GAGGCCTTGG GGCGGCCGAC CTGGGCCTGG TTACCCCGCG CTGGCTGGCC GAGGAGGAGG ACGTGCTGGC CGGGGGAACT ACGGAAGCCC CGGGAAAGGG CGGCGGGTCC CATGGGGAAG CAGCTGCCGC CTTTGCAACA TCCGGGGCGA CGGCTAAGGA GGATACCCCC GGGGTCGCCG GCGGTGCTCC TGGTACCGGC GTCCCCCCTG CACACGCCGG CGCACCCCTT ACACATGCCG GTACCGGGAC CAATGCACGG GAAGAACACG CCGGGGTGAG GGAGCCGGTT GAAGCTGCCT TTTACCAGCG CCTGGGTTTA CCTTACATCG TCCCCGAACT CCGGGAAGAC CGGGGAGAGC TCGCAGCCGC CCGGCGGGGG GAACTGCCCC ACCTTGTTAC CCTCGCCGAT ATCCGTGGCG ACCTGCATAT GCACAGCCGC TACAGCGACG GAGTGGAGAC CATTGCCGCC ATGGCCGCGG CAGCCAGGGC CAGGGGCTAC CAGTATATCG CCATCACCGA CCACTCCCGC TCCCTGACGG TGGCCCGGGG CCTGAGCCTT GAACAGTTAA AGGCCCAGCG GGAAGAGATT GCCCGCCTGA ATGAGGAACT GGAAGGCATC ACCATCCTGG CCGGGATCGA AGTGGACATC CTGGCCGACG GCCGCCTGGA CTACGAAGAT GAGGTTTTAA AGGAATTCGA TCTGGTTATC GCCTCCATCC ATTCCGGCTT CCGCCAGGAG AGGGAGCAAA TCATGGCCCG CCTGGAAGCG GCCCTGCGCA ACCCTTATGT GGATATCCTG GGACACCCCA CCGGCCGCAT GCTGGGCCGG CGGCAGCCCT ACGCCGTAGA TGTCAAGAGG GTTATAGAAC TGGCGGCGGA GACGGGGACC ATCCTGGAGA TCAACGCCAG CCCCGAACGG CTGGATCTAA ACGATACCTC GGCCCGCCTG GCCAAAGAAT ACGGCGTACC CATCGCCATT GATACCGATG CCCATGACCC TCACCGTCTC GCGGACATGG AGTACGGCGT CCTCACCGCC CGGCGCGGTT GGCTGGAACC CGCGGACGTA GTCAACACCT GGGAACTGGA ACGGCTGCTG GCCGGGTTGA AGCGGAACAG GCACGGGGCG TAA
|
Protein sequence | MTNLELAWAL AEMGDLLELK GEEPFKVRAY HRAARSLENL EEEAADLYAR GALEEIPGVG KNLAKKLAEL LTTGRSTFLD NLRREVPPGL REMLAIPGLG SRTVRQIHQG LGITTLAELE QAARERRIRT LPGLGSKTEL AILRGLEMLR EVQDRVPLGV ARPLALLLRA QLLALPGVVR AEIAGSVRRG KEMVGDIDLV AAVEPDNQVA AVLVRHPQVK EVLAREPDRL ALQTNLGLKI EVIMVPPEDF PATLFYATGS KAHRRALLRL AAERGLGAAD LGLVTPRWLA EEEDVLAGGT TEAPGKGGGS HGEAAAAFAT SGATAKEDTP GVAGGAPGTG VPPAHAGAPL THAGTGTNAR EEHAGVREPV EAAFYQRLGL PYIVPELRED RGELAAARRG ELPHLVTLAD IRGDLHMHSR YSDGVETIAA MAAAARARGY QYIAITDHSR SLTVARGLSL EQLKAQREEI ARLNEELEGI TILAGIEVDI LADGRLDYED EVLKEFDLVI ASIHSGFRQE REQIMARLEA ALRNPYVDIL GHPTGRMLGR RQPYAVDVKR VIELAAETGT ILEINASPER LDLNDTSARL AKEYGVPIAI DTDAHDPHRL ADMEYGVLTA RRGWLEPADV VNTWELERLL AGLKRNRHGA
|
| |