Gene Moth_0173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0173 
Symbol 
ID3831113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp172158 
End bp174347 
Gene Length2190 bp 
Protein Length729 aa 
Translation table11 
GC content38% 
IMG OID637828110 
Producthypothetical protein 
Protein accessionYP_429052 
Protein GI83589043 
COG category[L] Replication, recombination and repair 
COG ID[COG1195] Recombinational DNA repair ATPase (RecF pathway) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGAAA CGATCAGGAT AAAAGAGATT ACACTGGAGG GCTTTAAGTG TTTTTTTAAA 
AAGCAGTCAA TTAATTGTGA TGCCGATGTC ATAATTCTTA CCGGAAATAA TGGTTTTGGT
AAAACTTCAT TTATTGAGGC CCTCGAACTT ATGGCTACAG GGAAAATTGA GCCGCGGATA
GGTGAAAATA GCACTAAGAG GCCTTTCGAG AACTATATCA ATAGGAATTT GTTATCAGGA
GAAGCCAAAG CCAGGATAGA GGTCAAATGT CAAGACGGGC GAGTTTTTCA AGCAACGCTA
TTACCGGAAA GATTAGATGA AGAAAGTACT CTATTTCCAC TCAATGGTTT TTCAACTTCA
TTTGTACGCA GTAGTTGTTT TTTTTATCAG GATGGGGTAG AGAGTACATT TGGCAGGTCC
CAGGAAGGAA TTCAAGGAAT AGCTGATGCT TTCGTAGGCG ATTTTCCTAA TAGAGAAATC
CTGCTGGATG CGCTGGTAAA AGCGGCCGCT GAGGTTGAGA ATATTCGAAG TAGCTTTCGA
GCCAATTTAG GCAATGAGGA TATTTTAAAA AATGAAGCGG CTGAGCAGGC CAGAAATTTA
CAAGCAATCA TAGAATCAAA TGAGTATCTA AAACAGCTAA TTCCATTCCC AGCAAAGTTG
GTTATCGCAA GTGGAAAATT GTCCATTCGG TATAAGGAAA ATATACAAAA ATGGGTTGCA
GCTTTAACCG GGCATGAACA GGGAGACCAG CCTGTATGTA AATTACTTGA GGAAGCAATT
AAAGCAATTG AAAGTATAAG GAAGCAAGCT TTGGCTGATA TACCACCACA AGCTGATATA
AATGAAGGAG TTGCCAGGTT AAAAGAATTA TTGGATAACC TTCCTGATTT TTTTCCTGTA
ACTGAAACTT CCCCTAAGGG AAATATTGCC TTAAAAGAGC TTACTGAGAA GGAAAAAGCC
CTTGAACTAG AGCTCGCCAG GATCAAGACG CAGATGGGGC TTCTTTTAAG TGAAGAACAA
CTACCTATTC CAGGCACAAA TATTTCAACC TGGATGGGGG AAATCCCCCT TATTGCTTCA
ATCGATTATA TGTTAAAGGT AAAGGCTAAT TCCTCGGAAG TAGTAGTTGA TGAGATATGG
TCGATGTTGG AGAAGCATGG TGGTGATTGG AGCGATAAAT TAAAAAGCAA GTGGCTTCAA
TATAAACAAC TCCGTGACGA AGCCACGGCA ATTACCCAAA AGATTTTTGA GATTAGGAAA
GAAATTACTG GTATAAAGAA AAAAGAGGTG GATTTTGATA ATCTTTTGCA AGCCCTCAAA
TTATGGAGTG CCATTTTTGG AGCTCCTTTA GAAAAAGTAA ACTTGGAAGA TGGTACTGGA
TATGATCTTA AAAAGGCCCG TATGATGCTA AAAGAAAAAT TCGTGAATAT AAATGCTGCT
GAAACAATGA CAGCTTGGAA TAAAGTATTG GAGTATCTAC ATAAATGGCA ACAGATTGAA
CAAAAGCTAG AAACAATTAA ATTAAATTGG ACCCCAGCAG CAAGAGAAGC CGATAAAAAG
CTTTCAGATT TTATTAGCTG GGCCACTAAG AGAAATTTAG ATAAATGGAT TGAGGAAATA
AAAGGCAATG CTTTAAGGGA AAATTATATT TCTAAATTAA ACCAAGCCCT TGAGTATATT
CTTTTTCAAT TCGGTATGGG TAAAGAACTG GCCAAAAATG TACATTTTAA ACGTAGTAAT
GGTAAGTTAT ATATACAGCG GTGGTCTGAA GATAATATAA TAAGAGAAAC AAATATTTTT
CCTACTTTCT CGACAGCCCA AATAAATCAA TTGGCAATTG CCTACATGCT TGCCCTTAAT
TGTGGGGTTA AAAATCATCC ATTAGGTTTT ATTTGCCTGG ATGATGTATC TAGCGCATTT
GATTTAAATA ATCTTGCAGC CGATGCCCAT CTAATTCGCA TGTTGGCTTA TGGTAATAAT
CAGAGGCGGC AGGTTTTTAT TACCAGCCAT CATGATATTA TTACTTCACG ACTTTTGCCC
TTACTATTGC CACCTAGTGG ATTTAGACTT AAAGTGATTG AATTTACTTC TTTCCGGCCA
GAAACAGGAC CAGAACTAAG GTTTTATGAA TGTAAGCAGG CAAGATCAGG GTACATAGCC
TTGCAGAATG TTTTTAAGGT GGAGGCATGA
 
Protein sequence
MEETIRIKEI TLEGFKCFFK KQSINCDADV IILTGNNGFG KTSFIEALEL MATGKIEPRI 
GENSTKRPFE NYINRNLLSG EAKARIEVKC QDGRVFQATL LPERLDEEST LFPLNGFSTS
FVRSSCFFYQ DGVESTFGRS QEGIQGIADA FVGDFPNREI LLDALVKAAA EVENIRSSFR
ANLGNEDILK NEAAEQARNL QAIIESNEYL KQLIPFPAKL VIASGKLSIR YKENIQKWVA
ALTGHEQGDQ PVCKLLEEAI KAIESIRKQA LADIPPQADI NEGVARLKEL LDNLPDFFPV
TETSPKGNIA LKELTEKEKA LELELARIKT QMGLLLSEEQ LPIPGTNIST WMGEIPLIAS
IDYMLKVKAN SSEVVVDEIW SMLEKHGGDW SDKLKSKWLQ YKQLRDEATA ITQKIFEIRK
EITGIKKKEV DFDNLLQALK LWSAIFGAPL EKVNLEDGTG YDLKKARMML KEKFVNINAA
ETMTAWNKVL EYLHKWQQIE QKLETIKLNW TPAAREADKK LSDFISWATK RNLDKWIEEI
KGNALRENYI SKLNQALEYI LFQFGMGKEL AKNVHFKRSN GKLYIQRWSE DNIIRETNIF
PTFSTAQINQ LAIAYMLALN CGVKNHPLGF ICLDDVSSAF DLNNLAADAH LIRMLAYGNN
QRRQVFITSH HDIITSRLLP LLLPPSGFRL KVIEFTSFRP ETGPELRFYE CKQARSGYIA
LQNVFKVEA