Gene Moth_1748 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1748 
Symbol 
ID3832893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1801134 
End bp1803086 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content66% 
IMG OID637829672 
ProductPHP-like 
Protein accessionYP_430592 
Protein GI83590583 
COG category[L] Replication, recombination and repair 
COG ID[COG1796] DNA polymerase IV (family X) 
TIGRFAM ID[TIGR01856] histidinol phosphate phosphatase HisJ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00399585 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.370986 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCAACC TGGAGCTGGC CTGGGCCCTG GCGGAAATGG GCGACTTGCT GGAGTTAAAG 
GGGGAGGAGC CCTTTAAGGT GCGGGCCTAC CATCGTGCCG CCCGTTCCCT GGAGAACCTG
GAGGAAGAGG CGGCCGATCT ATACGCCCGC GGCGCCCTGG AGGAGATACC CGGCGTGGGC
AAGAACCTGG CCAAAAAGCT CGCCGAACTC CTGACTACAG GCCGCTCTAC CTTTCTCGAC
AATCTCCGCC GGGAAGTGCC GCCGGGCCTG CGGGAGATGC TGGCCATCCC GGGGCTGGGC
AGCCGTACCG TCCGCCAGAT TCACCAGGGA CTGGGGATTA CGACCCTGGC TGAGCTGGAA
CAGGCGGCCC GGGAGAGGCG CATCCGCACC CTGCCGGGTC TGGGCAGCAA GACGGAACTG
GCCATTCTGC GGGGGCTGGA GATGCTGCGG GAGGTCCAGG ACCGGGTACC CCTGGGGGTG
GCCCGGCCCC TGGCCCTGTT GTTGCGGGCT CAACTCCTGG CCCTGCCGGG GGTGGTCCGG
GCGGAAATAG CCGGGAGCGT CCGCCGCGGT AAAGAAATGG TGGGGGATAT TGATCTGGTG
GCCGCCGTCG AGCCGGACAA CCAGGTGGCG GCAGTCCTGG TCCGCCACCC CCAGGTCAAG
GAAGTCCTGG CCAGGGAACC GGACCGCCTG GCCCTGCAGA CGAACCTGGG CCTGAAGATC
GAAGTGATCA TGGTTCCCCC GGAGGATTTC CCGGCCACCC TCTTTTATGC CACCGGGTCA
AAGGCGCATC GCCGGGCCCT GCTTCGCCTG GCCGCCGAAA GAGGCCTTGG GGCGGCCGAC
CTGGGCCTGG TTACCCCGCG CTGGCTGGCC GAGGAGGAGG ACGTGCTGGC CGGGGGAACT
ACGGAAGCCC CGGGAAAGGG CGGCGGGTCC CATGGGGAAG CAGCTGCCGC CTTTGCAACA
TCCGGGGCGA CGGCTAAGGA GGATACCCCC GGGGTCGCCG GCGGTGCTCC TGGTACCGGC
GTCCCCCCTG CACACGCCGG CGCACCCCTT ACACATGCCG GTACCGGGAC CAATGCACGG
GAAGAACACG CCGGGGTGAG GGAGCCGGTT GAAGCTGCCT TTTACCAGCG CCTGGGTTTA
CCTTACATCG TCCCCGAACT CCGGGAAGAC CGGGGAGAGC TCGCAGCCGC CCGGCGGGGG
GAACTGCCCC ACCTTGTTAC CCTCGCCGAT ATCCGTGGCG ACCTGCATAT GCACAGCCGC
TACAGCGACG GAGTGGAGAC CATTGCCGCC ATGGCCGCGG CAGCCAGGGC CAGGGGCTAC
CAGTATATCG CCATCACCGA CCACTCCCGC TCCCTGACGG TGGCCCGGGG CCTGAGCCTT
GAACAGTTAA AGGCCCAGCG GGAAGAGATT GCCCGCCTGA ATGAGGAACT GGAAGGCATC
ACCATCCTGG CCGGGATCGA AGTGGACATC CTGGCCGACG GCCGCCTGGA CTACGAAGAT
GAGGTTTTAA AGGAATTCGA TCTGGTTATC GCCTCCATCC ATTCCGGCTT CCGCCAGGAG
AGGGAGCAAA TCATGGCCCG CCTGGAAGCG GCCCTGCGCA ACCCTTATGT GGATATCCTG
GGACACCCCA CCGGCCGCAT GCTGGGCCGG CGGCAGCCCT ACGCCGTAGA TGTCAAGAGG
GTTATAGAAC TGGCGGCGGA GACGGGGACC ATCCTGGAGA TCAACGCCAG CCCCGAACGG
CTGGATCTAA ACGATACCTC GGCCCGCCTG GCCAAAGAAT ACGGCGTACC CATCGCCATT
GATACCGATG CCCATGACCC TCACCGTCTC GCGGACATGG AGTACGGCGT CCTCACCGCC
CGGCGCGGTT GGCTGGAACC CGCGGACGTA GTCAACACCT GGGAACTGGA ACGGCTGCTG
GCCGGGTTGA AGCGGAACAG GCACGGGGCG TAA
 
Protein sequence
MTNLELAWAL AEMGDLLELK GEEPFKVRAY HRAARSLENL EEEAADLYAR GALEEIPGVG 
KNLAKKLAEL LTTGRSTFLD NLRREVPPGL REMLAIPGLG SRTVRQIHQG LGITTLAELE
QAARERRIRT LPGLGSKTEL AILRGLEMLR EVQDRVPLGV ARPLALLLRA QLLALPGVVR
AEIAGSVRRG KEMVGDIDLV AAVEPDNQVA AVLVRHPQVK EVLAREPDRL ALQTNLGLKI
EVIMVPPEDF PATLFYATGS KAHRRALLRL AAERGLGAAD LGLVTPRWLA EEEDVLAGGT
TEAPGKGGGS HGEAAAAFAT SGATAKEDTP GVAGGAPGTG VPPAHAGAPL THAGTGTNAR
EEHAGVREPV EAAFYQRLGL PYIVPELRED RGELAAARRG ELPHLVTLAD IRGDLHMHSR
YSDGVETIAA MAAAARARGY QYIAITDHSR SLTVARGLSL EQLKAQREEI ARLNEELEGI
TILAGIEVDI LADGRLDYED EVLKEFDLVI ASIHSGFRQE REQIMARLEA ALRNPYVDIL
GHPTGRMLGR RQPYAVDVKR VIELAAETGT ILEINASPER LDLNDTSARL AKEYGVPIAI
DTDAHDPHRL ADMEYGVLTA RRGWLEPADV VNTWELERLL AGLKRNRHGA