Gene Moth_1154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1154 
Symbol 
ID3833122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1184542 
End bp1187307 
Gene Length2766 bp 
Protein Length921 aa 
Translation table11 
GC content60% 
IMG OID637829085 
ProductDNA polymerase III, epsilon subunit 
Protein accessionYP_430011 
Protein GI83590002 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0847] DNA polymerase III, epsilon subunit and related 3'-5' exonucleases
[COG1199] Rad3-related DNA helicases 
TIGRFAM ID[TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family
[TIGR01407] DnaQ family exonuclease/DinG family helicase, putative 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCCCC AGACTTATGT CGTCCTGGAT GTGGAAACCA CCGGCCTGGA TCCAGCCCGG 
GACAGAATAA TTGAGATTGC TGCCGTCCGC CTGGAAGGAG GCAACATTAC CCGCCAGTTT
CAAACCCTGG TGAACCCGGG CCGGCCTATA CCCCCGGCCA TCGAGAGGTT GACGGGTATT
AGTGACGCCA TGGTCCGCGA GGCGCCCCCC CTTCCTGAAG TTTTGCCCGG GCTGCTGGAT
TTATTCAGGG ATGCCATCCC AGTAGGCCAT AACGGCACCT TTGACCTGGC CTTCCTAAAC
CAAGCCCTGG GCCACGGCTG GCATTCCCCG CTGCTGGATA CCCTTGCCCT GAGCCGGATT
CTCTTTCCCT GCCTGGCTTC TCACCGCCTG GATTATATGA GCAAGTACCT GACCCTCGAA
GCTACCGGCC ATCATCGCGC CCTGGATGAT GTGTTAACTA CCGCCCGCCT CCTGGAGAAC
CTCTGGCAGG CCACCCTGGA ACTGGACAAA AACCTGCTGA CAAAACTCCT GAACCTAGCC
CCGGTCGGCC TCCAGTCCTG GTTCCGGGCT GCTCTGGTTC AGGGAACGCC AGCCAGCCAT
TTTGAAGTGG CCGCTACCGG GTTATTTGCC CCGACCAGGG TACCCTCCCC GCAGTCTGGA
ACCTTGCCGG CCTTTAATGT CGACGACCTG GTAGCCATGC TGGACCACAG GGGTCTGCTG
GCCGAGCAGA TACCAGGTTA TGAATACCGT CCCCAGCAGG TGGAAATGCT CAGGGCTGTA
GCCTCGGCCC TGGCCGGCAA TCACTACCTG ACAGTAGAAG CCGGCACAGG GACCGGCAAA
TCTCTGGCCT ACCTTTTGCC GGCGATTTAC TGGGCCTGTA GCCAGAGAAA GAGGGTAGCC
ATTGCCACCC ATACCATCAG CCTGCAGGAA CAACTCTGGC AGAAGGACCT GCCCCAGCTA
CGGGAACTCC TGCCCTTTTC CTTTAAGGCG GCCCTGGTTA AAGGACGGAG CAACTATATC
TGCAGGCGGA AGTTGCGGGA CTACCTGGCT AACCCGCCGG CCGGGGAGGC AGAGCGTCTC
TTCGCCATGC GGGTTTTACG CTGGCTGGAG GTAACAACCA GCGGCGACTG GAGCGAAATG
AAACTTACCC CGGAAGAAGA AGGCTTCAAA TTTGCCCTGG CGGCCGATAC CGAGACCTGC
ACCGGTAGCG CCTGCCCCTT CAACGATGAA TGTTTTGTCA ATGCCGCCCG CCGGGAGGCT
GAGGCTGCCA ATATCCTGAT CTTGAACCAT TCCCTTTTAC TGAGCGATAT CCGCTTAAAC
AACCAGGTCC TGCCTGACTA CCCCTACCTG ATTATCGATG AAGCCCATCA CCTGGAGGAG
GCGGCTACCG AACACCTGGG GAGCAGTGTC AGCCAGGCCA GCTGTGAGCT TTTCTTTCGC
CGCCTGGGCC GGGGTGAGCA GGCCTATAGC TTCCTGGGTC GGGTCCGCAA TCTCGCCCGC
CGTTTACCTC CGGAAGGGAA CCTTGAATTG GCAGACTTCC TGGAGGATAT GGAACTAACA
GTAACAGCAA CCCTGGCCGG TTGGCAGGAG TTCTGGGAGA GCTTGGGCAG GTTAAGTGAT
GCCGCCCGCT GGGAAGAGGC GGGTTATACC CTGCGTTTCA CCAGCCGTTT AAAGGAAACC
CCTGCTTGGG ACAACCTGCT GTCAGTCTTC GGGAGCCTGG AGGAAAACCT CAGCGGCCTG
GCCAGTCGCC TGGAGCGCCT CTCGGAGTTA TTGAGCGCTG CCGGGGCCGG CGAGTTTGCT
GCCGATGCCG GTAATTTCGC CGCCGTTGTT GCCCAGTACA GCTATGATCT GGGGCAGATC
CTTGATGCCG ACCCGGCCAC CAGCGTCAGC TGGCTGGAAA AGAATAACCA TGGCCAGTAT
ATCTTACGCT CCGCCCCCCT GGATATTGGT CCCCTGCTGG CAGAACTTCT CTTTTCCCGC
AAGCAGGCTG TGATCCTGAC CTCGGCCACC CTGACAGTCA ACAATAGCTT TGATTATTAC
CACCAGCAGA CCGGCCTCCA GGAACTGCCA GCCGACAGGG TCGTCAGCTG CCAGGTATCC
TCACCCTTTG ACTACCGGTC GCAGGCCCTG GTTTGTTCTA TTAGAGGGCT GCCCAACCCG
GGCCAGTTAA AGGATGCCGA CTATGCCCGG GCTATTACCC CGGTCCTGAC AGCCATCTGC
CCGGCCGTCG GCGGCCGTTC CCTGGTTCTC TGTACCTCCC ACCGTTTTTT ACGGGAAGTC
TACGAACTTT TAAGCGCCGA CCTCAACGGC AGCGGTTACC GGGTCCTGGC CCAGGGAATA
GACGGCAGCC GTTCCCGCCT GCTGGAAGAA TTCATCCAGA CTCCCCGGGC TGTCCTTCTG
GGGGCCAACA GCTACTGGGA AGGTATCGAC CTGCCCGGGG ATCTGCTTCG CTGCGTCATC
ATCCCGCGTT TACCATTCCC CTCCCCGGGC ATACCGACCC TGGCGGCGCG GATGGAACAC
CTGGCCGCCC GGGGACAGAA TGCCTTTGCC ACCCTGAGTC TACCCCAGGC GATTATTCGT
TTCCGCCAGG GGTTTGGTCG CCTGATCCGG CGGGCCAGCG ACAGAGGAGT ACTGGTAATC
CTTGACCAGC GCCTCCTCTC CCAGCGGTAC GGCCGCCTTT TTATCCAGTC CCTGCCGCCA
GTAACCCTTG AGGAGGTTGA CCCCGCCGGG GCTCCCTCCC GGATAAAAAC ATGGTTTCAG
GGTTGA
 
Protein sequence
MLPQTYVVLD VETTGLDPAR DRIIEIAAVR LEGGNITRQF QTLVNPGRPI PPAIERLTGI 
SDAMVREAPP LPEVLPGLLD LFRDAIPVGH NGTFDLAFLN QALGHGWHSP LLDTLALSRI
LFPCLASHRL DYMSKYLTLE ATGHHRALDD VLTTARLLEN LWQATLELDK NLLTKLLNLA
PVGLQSWFRA ALVQGTPASH FEVAATGLFA PTRVPSPQSG TLPAFNVDDL VAMLDHRGLL
AEQIPGYEYR PQQVEMLRAV ASALAGNHYL TVEAGTGTGK SLAYLLPAIY WACSQRKRVA
IATHTISLQE QLWQKDLPQL RELLPFSFKA ALVKGRSNYI CRRKLRDYLA NPPAGEAERL
FAMRVLRWLE VTTSGDWSEM KLTPEEEGFK FALAADTETC TGSACPFNDE CFVNAARREA
EAANILILNH SLLLSDIRLN NQVLPDYPYL IIDEAHHLEE AATEHLGSSV SQASCELFFR
RLGRGEQAYS FLGRVRNLAR RLPPEGNLEL ADFLEDMELT VTATLAGWQE FWESLGRLSD
AARWEEAGYT LRFTSRLKET PAWDNLLSVF GSLEENLSGL ASRLERLSEL LSAAGAGEFA
ADAGNFAAVV AQYSYDLGQI LDADPATSVS WLEKNNHGQY ILRSAPLDIG PLLAELLFSR
KQAVILTSAT LTVNNSFDYY HQQTGLQELP ADRVVSCQVS SPFDYRSQAL VCSIRGLPNP
GQLKDADYAR AITPVLTAIC PAVGGRSLVL CTSHRFLREV YELLSADLNG SGYRVLAQGI
DGSRSRLLEE FIQTPRAVLL GANSYWEGID LPGDLLRCVI IPRLPFPSPG IPTLAARMEH
LAARGQNAFA TLSLPQAIIR FRQGFGRLIR RASDRGVLVI LDQRLLSQRY GRLFIQSLPP
VTLEEVDPAG APSRIKTWFQ G