Gene Moth_1841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1841 
Symbol 
ID3831701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1897551 
End bp1900208 
Gene Length2658 bp 
Protein Length885 aa 
Translation table11 
GC content62% 
IMG OID637829772 
ProductDNA polymerase I 
Protein accessionYP_430684 
Protein GI83590675 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0941024 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCACCA AAACCGCTAG ACTTCTCCTG GTCGACGGCA ACAGCGTCAT TCACCGGGCT 
TTTCATGCCC TGCCCCCCCT GCAGACCAGG GAAGGTATTC GCACCAATGC CGTCTACGGC
TTTGCAACCA TGTTGCAAAA GGCCAGGGAG ATGTTTAAAC CCGATTATAT TATTGTCGCC
TTTGACCATA GTAAGGTTAC CTTTCGTAAC GAACTGTACG ATGAATATAA AGGAACCCGG
CCGGAGACCG ACCCTGAGCT CAGGCCCCAG TTTGCCCTGG TCAAACGCCT CCTGGCAGCC
TGGAACCTGG CCAGTTGTGA GGTTGAGGGC TACGAAGCCG ACGACCTTAT CGGCACCTTA
AGCCGCCAGG GCGCCGCCAC GGGCCTGGAA GTCCTCATCC TCACCGGGGA CCGCGACGCC
CTGCAACTAG TGGGAGAACG GGTGAAGGTC CTCCTTATGC GGCGCGGCCT TTCCCAGGTA
GAGGTCATCG ACCGGGAAGC AATCAAGAAA AACTATGGCC TGGAACCGGA GCAGCTCATT
GACGTCAAGG CCCTGATGGG CGACGCCTCG GATAATATTC CCGGCGTACC GGGGGTGGGG
GAGAAAACGG CCGTCCAGCT CGTCCGCCAG TACGGCGACC TGGAAGGAGT CCTGGCCCAC
AGCGGGGAGA TAAAAGGACG CCGGGTAGCG GAGAACCTGG TGACCTTCGC CGACCAGGCG
CGCCTGGCCC GGCGCCTGGC CACAATTGAC TGCCAGGCTC CTGTGACCCT CGACCTGGCA
GGGTGCTGCA ACCAGTCGCC GGACTACGAG GCCGTCCTGG CCCTTTATAA AGAACTGGAG
TTCCACAGTC TGGTCAAGGA CGTCCTCAGG GCCATGGAAC AGGAAGGCAA GAAGGCCTCG
CAAGAAACGA CTGCTGTCCG GGGACTGTCG CTGCCGGACC CACTCACCCT GGACAGCCTG
GAAGAACTGG CGGAACTGGT GGCCAGGTTG GTCGGGAAGA CGGATGTGGC CCTGGAATTG
ATTCTTAATA ATCCCAGCTA CCTGGAGGCC GCAGCCGTCG CCGTAGGGCT GGCCTGGGAG
GATGGGGTGG CGGTCCTGGG TACAGCCGGG ATAGAGCCTG CCGCCCTGGC CGGTACCCTG
GAACCATTGC TCCGGGTCAA CCCCATCTTC CACGACGCCA AAAGGGCCCT GGTCTGGTTC
AGCAATGCCG GGGCGGGGGT TGCCGACCCC GGCGGCGATA CTATGGTGGC CGGTTATCTC
TTAAACCCCT CGGCCTCGCG CCATGACCTG CCCGAACTCT GCCTGGAACA CTTGAACCTG
GCCCTGGTGG AGGGGGATTC CCCGCAACTG GCAGCGGCCC GGCGGGCCGC TGTCATCAGG
TTGCTCCACC GGGAACTGGC CAGTAAACTC CAGGTTGCCG GGATGGAGAA CCTCTACCGG
CGGGTGGAGT TGCCCCTGAC CCGGGTCCTA GGGGCCATGG AAAGCTACGG CGTGGCCGTC
AACATGGAAA CCCTGGATCT CATGGGAATA GAGCTGGAGG GTGGGCTGGC CGCCCTCACA
GAGGCCATCT ACGAGCTGGC CGGGGAAGAG TTTAACCTGA ATTCCCCCAA GCAGCTGGCC
GTTATACTCT TTGAAAAGCT CGGACTGCCG CCGGTAAAGC GTACCAAGAC CGGTTTTTCT
ACCGATGCCG CCGTCCTGGA GGAACTGGCC TGCAGGCACC CCATAGCCGC CAAACTGGTC
GAGTACCGCC AGCTGGCCAA ACTGAAATCG ACCTATGTGG ATGGTCTTAA ACCCCTGGTC
AATCCCCGCA CGGGGAGCCT GCATACCAGC TTTAACCAGA CGGTGACGGC CACCGGCCGC
CTCTCCAGCA GCGAGCCCAA TCTCCAGAAT ATCCCGGTGC GCCTGGAACT GGGCCGGCGC
CTGCGCAAGG CCTTTGTACC CCACGGTCCC GGCCGGTTGC TCCTGGCCGC CGACTACTCC
CAGATCGAGC TGCGCATCCT GGCCCATATT TCCGGCGATG AAGCCATGAT TGCAGCCTTT
CGCCGGGGGG AGGATATCCA CGCCCGGACT GCGGCCGAGG TCTTCGGCGT CCCCCTTGGT
GAAGTGACGC CGGCTATGCG CCGCAGCGCC AAGGCGGTGA ACTTCGGTAT TGTCTACGGC
ATCAGCGACT ACGGCCTGAG CCGGGATCTG GGGATAAGCC GCAGCGAAGC CCACGATTAT
ATCGAACGTT ACTTTCGGCG TTACCGGGGC GTCAAGGCCT ATCTGGAGGA GATCGTAGCC
CGGGCGCGGC AGGAGGGCTA TGTCACCACC CTCCTGGGCC GCCGCCGTTA TCTGCCGGAC
CTCTTTAGCT CCAACCGCAA TGTCCGCAGC TTTGGCGAGC GCACGGCCAT GAATACGCCT
ATCCAGGGCA CGGCCGCCGA CATCATCAAG ATGGCCATGG TGAAAATCTT TCGCCTCCTG
GAGGCACAAT ACCCGGCCGC GCGTATGATC CTCCAGGTCC ACGACGAACT CATCTTTGAT
GTCCCGGATG ACGACCTGCC GGCCGTGGCC GGCCTGGTCA AGGATACCAT GGAGCATACC
CTGGAACTCC AGGTCCCCCT CCAGGTAGAT TTAAAGGCCG GGCCCAACTG GTATGACCTG
GAGCCGTATA AGGAGTAA
 
Protein sequence
MPTKTARLLL VDGNSVIHRA FHALPPLQTR EGIRTNAVYG FATMLQKARE MFKPDYIIVA 
FDHSKVTFRN ELYDEYKGTR PETDPELRPQ FALVKRLLAA WNLASCEVEG YEADDLIGTL
SRQGAATGLE VLILTGDRDA LQLVGERVKV LLMRRGLSQV EVIDREAIKK NYGLEPEQLI
DVKALMGDAS DNIPGVPGVG EKTAVQLVRQ YGDLEGVLAH SGEIKGRRVA ENLVTFADQA
RLARRLATID CQAPVTLDLA GCCNQSPDYE AVLALYKELE FHSLVKDVLR AMEQEGKKAS
QETTAVRGLS LPDPLTLDSL EELAELVARL VGKTDVALEL ILNNPSYLEA AAVAVGLAWE
DGVAVLGTAG IEPAALAGTL EPLLRVNPIF HDAKRALVWF SNAGAGVADP GGDTMVAGYL
LNPSASRHDL PELCLEHLNL ALVEGDSPQL AAARRAAVIR LLHRELASKL QVAGMENLYR
RVELPLTRVL GAMESYGVAV NMETLDLMGI ELEGGLAALT EAIYELAGEE FNLNSPKQLA
VILFEKLGLP PVKRTKTGFS TDAAVLEELA CRHPIAAKLV EYRQLAKLKS TYVDGLKPLV
NPRTGSLHTS FNQTVTATGR LSSSEPNLQN IPVRLELGRR LRKAFVPHGP GRLLLAADYS
QIELRILAHI SGDEAMIAAF RRGEDIHART AAEVFGVPLG EVTPAMRRSA KAVNFGIVYG
ISDYGLSRDL GISRSEAHDY IERYFRRYRG VKAYLEEIVA RARQEGYVTT LLGRRRYLPD
LFSSNRNVRS FGERTAMNTP IQGTAADIIK MAMVKIFRLL EAQYPAARMI LQVHDELIFD
VPDDDLPAVA GLVKDTMEHT LELQVPLQVD LKAGPNWYDL EPYKE