Gene Moth_1229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1229 
Symbol 
ID3833170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1268308 
End bp1269510 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content62% 
IMG OID637829164 
ProductDNA polymerase IV 
Protein accessionYP_430086 
Protein GI83590077 
COG category[L] Replication, recombination and repair 
COG ID[COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.000257934 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGTCCA GCGATTGTTC CATTCTCCTG TGCGATGCTA ACAGTTTCTT TGCCTCTGTG 
CATCAGGCCC TGGACCCGGG TTTACGCGGG CGCCCGGTTA TCGTCGCCGG CCGGGAGGCT
ACCCGCCACG GCATTGTCCT GGCAGCCAGC TATGAGGCCA AGCTGGGTTA CGGTATCAAG
ACCGGCATGA CGGTCCGGGA AGCCAGGGGC CTTTGCCCCC ACGGGGTCTT TATTCCTCCC
CGTCATGACC TGTACATCGA GTTTTCTACC CGGATCTTAC GTATTATGCG GGACTTTACT
CCCCTGGTAG AGCCCTTCTC TATAGACGAA GCCTGGCTGG ATGTCCGCGG CTGCCGGGAT
CTCCACGGCT CGCCCCTGAC CGTAGCCCGG CGGCTGAAGC AAAGGATCAG GGAGGAAGTG
GGGATTACCA CCAGCGTCGG CCTGGGGCCT TCTAAACTCC TGGCCAAGAT GGCCGCTGAG
ATGCAGAAGC CTGATGGTCT GACCGTCCTG GATTACGCCG ATGTTCCCGG GAAGATGTGG
CCCCTCCCCG TCCGGGAACT CTTCGGCATC GGCCCCCGTA TGGAGGCCCA CCTGGCCAAA
CTCGGTATCC ATACCATCGG GGAGCTAGCC GGTTTCCCTG TTGAGGTGCT CATTAAGCGT
TTTGGGGTTG TGGGCCGGAT TCTCCACCAG TGTGCCCGGG GCATCGACTA CAGTCCCGTG
GACCCCCATT CCCTGGACAC AGTTAAATCC ATCGGCCACC AGATCACCCT GCCCCGGGAC
TACCGGGGCT ACGAGGAAAT CGAGGTGGTC CTGCTGGAAC TGGCTGAACT GGTGGCCCGG
CGGGTGCGCC TGGGAGGTTA TCTGGGCCGG ACGGTGGCTA TAAGCCTCAA GGATCCGGAG
TTTCACTGGC TGGGGCGCTC CCGTACCCTG CCCCATTATA CCGATACCGC GGGGGATATT
TACGCCGCGG CCCGGCATCT CCTGCACCGC CACTGGCCGG AATGGCGAGC CGTGCGGCTG
GTCGGGGTCA GCCTGGCCGG CCTGGTGCCG GCGACGGTGC GCCAGGAAGA TCTTTTCGGC
CGGGTGGAAA GGCAGGCCCG CCTTGATCGG GCCTGCGACC AGTTAAAAAA CCGCTACGGT
GAAAGGGTTA TTCACCGGGC GGTATCTTTA ACCGGGGCGG GGGTGCTCTA TGGGGGGAGC
TAA
 
Protein sequence
MVSSDCSILL CDANSFFASV HQALDPGLRG RPVIVAGREA TRHGIVLAAS YEAKLGYGIK 
TGMTVREARG LCPHGVFIPP RHDLYIEFST RILRIMRDFT PLVEPFSIDE AWLDVRGCRD
LHGSPLTVAR RLKQRIREEV GITTSVGLGP SKLLAKMAAE MQKPDGLTVL DYADVPGKMW
PLPVRELFGI GPRMEAHLAK LGIHTIGELA GFPVEVLIKR FGVVGRILHQ CARGIDYSPV
DPHSLDTVKS IGHQITLPRD YRGYEEIEVV LLELAELVAR RVRLGGYLGR TVAISLKDPE
FHWLGRSRTL PHYTDTAGDI YAAARHLLHR HWPEWRAVRL VGVSLAGLVP ATVRQEDLFG
RVERQARLDR ACDQLKNRYG ERVIHRAVSL TGAGVLYGGS