Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1229 |
Symbol | |
ID | 3833170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1268308 |
End bp | 1269510 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637829164 |
Product | DNA polymerase IV |
Protein accession | YP_430086 |
Protein GI | 83590077 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.000257934 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGTCCA GCGATTGTTC CATTCTCCTG TGCGATGCTA ACAGTTTCTT TGCCTCTGTG CATCAGGCCC TGGACCCGGG TTTACGCGGG CGCCCGGTTA TCGTCGCCGG CCGGGAGGCT ACCCGCCACG GCATTGTCCT GGCAGCCAGC TATGAGGCCA AGCTGGGTTA CGGTATCAAG ACCGGCATGA CGGTCCGGGA AGCCAGGGGC CTTTGCCCCC ACGGGGTCTT TATTCCTCCC CGTCATGACC TGTACATCGA GTTTTCTACC CGGATCTTAC GTATTATGCG GGACTTTACT CCCCTGGTAG AGCCCTTCTC TATAGACGAA GCCTGGCTGG ATGTCCGCGG CTGCCGGGAT CTCCACGGCT CGCCCCTGAC CGTAGCCCGG CGGCTGAAGC AAAGGATCAG GGAGGAAGTG GGGATTACCA CCAGCGTCGG CCTGGGGCCT TCTAAACTCC TGGCCAAGAT GGCCGCTGAG ATGCAGAAGC CTGATGGTCT GACCGTCCTG GATTACGCCG ATGTTCCCGG GAAGATGTGG CCCCTCCCCG TCCGGGAACT CTTCGGCATC GGCCCCCGTA TGGAGGCCCA CCTGGCCAAA CTCGGTATCC ATACCATCGG GGAGCTAGCC GGTTTCCCTG TTGAGGTGCT CATTAAGCGT TTTGGGGTTG TGGGCCGGAT TCTCCACCAG TGTGCCCGGG GCATCGACTA CAGTCCCGTG GACCCCCATT CCCTGGACAC AGTTAAATCC ATCGGCCACC AGATCACCCT GCCCCGGGAC TACCGGGGCT ACGAGGAAAT CGAGGTGGTC CTGCTGGAAC TGGCTGAACT GGTGGCCCGG CGGGTGCGCC TGGGAGGTTA TCTGGGCCGG ACGGTGGCTA TAAGCCTCAA GGATCCGGAG TTTCACTGGC TGGGGCGCTC CCGTACCCTG CCCCATTATA CCGATACCGC GGGGGATATT TACGCCGCGG CCCGGCATCT CCTGCACCGC CACTGGCCGG AATGGCGAGC CGTGCGGCTG GTCGGGGTCA GCCTGGCCGG CCTGGTGCCG GCGACGGTGC GCCAGGAAGA TCTTTTCGGC CGGGTGGAAA GGCAGGCCCG CCTTGATCGG GCCTGCGACC AGTTAAAAAA CCGCTACGGT GAAAGGGTTA TTCACCGGGC GGTATCTTTA ACCGGGGCGG GGGTGCTCTA TGGGGGGAGC TAA
|
Protein sequence | MVSSDCSILL CDANSFFASV HQALDPGLRG RPVIVAGREA TRHGIVLAAS YEAKLGYGIK TGMTVREARG LCPHGVFIPP RHDLYIEFST RILRIMRDFT PLVEPFSIDE AWLDVRGCRD LHGSPLTVAR RLKQRIREEV GITTSVGLGP SKLLAKMAAE MQKPDGLTVL DYADVPGKMW PLPVRELFGI GPRMEAHLAK LGIHTIGELA GFPVEVLIKR FGVVGRILHQ CARGIDYSPV DPHSLDTVKS IGHQITLPRD YRGYEEIEVV LLELAELVAR RVRLGGYLGR TVAISLKDPE FHWLGRSRTL PHYTDTAGDI YAAARHLLHR HWPEWRAVRL VGVSLAGLVP ATVRQEDLFG RVERQARLDR ACDQLKNRYG ERVIHRAVSL TGAGVLYGGS
|
| |