Gene Moth_1665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1665 
Symbol 
ID3831936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1698764 
End bp1701022 
Gene Length2259 bp 
Protein Length752 aa 
Translation table11 
GC content62% 
IMG OID637829590 
Producthypothetical protein 
Protein accessionYP_430510 
Protein GI83590501 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1331] Highly conserved protein containing a thioredoxin domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.590351 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000463934 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCCCGATC TGCGCCGTCC CAACCGTTTA ATCCATGAAA AAAGCCCCTA TCTGCTGCAG 
CACGCCTACA ACCCCGTCGA TTGGTATCCC TGGGGGGAAG AGGCCTTTGC CCGGGCCAAG
CGGGAAGATA AGCCGGTATT TTTATCCATC GGCTATTCCA CCTGTCACTG GTGCCACGTC
ATGGCCAGGG AATCCTTTAA CGATGAAGAG GTAGCCGCCC TCCTCAACGA TAGCTTTATC
GCTATCAAGG TCGACCGGGA AGAACGGCCG GACATCGACC AGGTCTATAT GGCCGCCTGC
CAGGCCCTGA CCGGCAGCGG CGGCTGGCCC CTGACTGTTT TCCTGACGCC GGAGAAAAGA
CCCTTTTATG CCGGCACCTA TTTCCCCAAG CACAACCGGT ACGGCAGGCC CGGCCTGGTG
GAGCTTTTAA AGTTGATCCG GGAAAAATGG GCGACCCACC GCGAGGAGCT GGAAGAATCC
GGCGCGGAAT TGATACAACA CGTGGCCGGC CAATTTGCCC CTACGCCGCC GGGAGAACCC
GGGGCTCAGG TCCTGGAAAA GGGCTGGCAA CAACTCCGGG CCGGTTTCGA CCCTCTCTAT
GGCGGTTTCA GTGAAGCTCC TAAATTTCCC AGCCCCCACC AGCTTTTGTT TTTACTGCGT
TACTGGAAAA GGTATGATGA AGCGGGCGCC CTGGCCATGG TGGAAAAAAC CCTGCAGGCT
ATGTACTGCG GTGGCATTTA CGACCATATC GGTTTTGGCT TCGCCCGCTA TTCTACCGAC
CGCCGCTGGC TGGTGCCTCA CTTTGAGAAA ATGCTCTACG ACAACGCCCT GCTGGCCCTG
GCCTACCTGG AAACCCGGCA AGCAACGGGC AAAGCTGTCT ACAGCCATGT CGCCCGGGAG
ATATTCACCT GGGTGTTGCG GGACATGACC AGCCCGGAGG GAGGATTTTA CTCCGCCCTG
GACGCCGATT CCGAAGGAGA AGAGGGCCGT TTTTACCTCT GGACACCCGA CCAGGTCCGG
GAGGTCCTCG GTGCTAAAGA GGGGGAGTTC TTTTGCCGTT ACTTTGATAT AACCGCCGGA
GGTAACTTCG AGGGGCGCAG TATTCCCAAT TTGATTGGCC GGGGAGAAGC CCTCTTTGCC
GCCGGTACCA GTGGCAATGA GAGCAATGAT ACCGCTGGCG ATCAGAGGCA GCCCCGGGAG
CAAGGCGGGA GAGCAGGCGG CATTTCCGGC GGCGGGGGCT GCGCTAAAGG TAGTCCGGAG
GAGGACCGGC TGCCCGGCCG GGGGCCAACC ACCCTGGCCG GTTTTGGCCC GGCAACGGCG
GCCCGATTGG CTGCGGCCCG CGAAAAACTC TTCGCCGCCC GGGAAAAGCG GGTCCATCCC
CACCGGGACG ATAAAATCCT CACTGCCTGG AACGGGCTGA TGATCGCCGC CCTGGCCCGG
GGCGCCTGGG TGCTCGATGA ACCCGCCTAT GCCGCGGCGG CCGCCAGGGC GGCCCGGTTT
ATCCTAACCC ACTTGCGCGA TGCGGAGGGG CGCCTGCAGG CCCGTTACCG GGAAGGCCAG
GCTGCTTTCC CGGCCTACCT TGACGATTAT GCCTTTCTCA CCTGGGGGCT CATCGAACTC
TACCAGGCCA CCTTTGAGAC AGGTTACCTC CGGGAGGCCC TGGCTTTGAC GCGGCAGATG
CAGGAACTCT TCCGGGACGA AGGGGGCGGC TACTTTTTTA CCCCTCACGG CGCCGGGGAA
CTACCGGTCC GCCCCCGGGA AGTCTATGAC GGCGCCATTC CCTCCGGCAA TTCGGTAGCG
GCCTTAAACC TGCTGCGCCT GGCCCGCATC ACCGGGGACA GCCGGCTGGA AGAAGAAGCC
GCAGCCCAGG TGCGTGCCCT GGCCGGAACG GTGGCCGAAT ACCCCCGGGG CTATTCCTTC
TACCTCTGTG CCCTGGACTT CTACCTGGGG CCGGTAACAG AAATAGTCCT GGCCGGGGAA
CGGGAAACAG AAGATACCCG TGCCCTGCTC CGCGTGCTAA GGGCGGCCTA CCTGCCCTCA
GCCGTCCTGG TGCTGCGTCC CGGCGGCCGG GAGGGTGAGG AAGTAACCAG GCTCATTCCC
TATACCGCCG GCCAGAAACC GGTAAACGGT AAAGCCACCC TATACCTGTG CCGCAACTTC
GCATGCCGGG CGCCGGTTAC GACGGCCGGA GAACTGGAGC AATGGCTAGC GTCTGCCGGA
CGGGAGGCTC ATGGCGCAGG CGACACTTTT AACGAATGA
 
Protein sequence
MPDLRRPNRL IHEKSPYLLQ HAYNPVDWYP WGEEAFARAK REDKPVFLSI GYSTCHWCHV 
MARESFNDEE VAALLNDSFI AIKVDREERP DIDQVYMAAC QALTGSGGWP LTVFLTPEKR
PFYAGTYFPK HNRYGRPGLV ELLKLIREKW ATHREELEES GAELIQHVAG QFAPTPPGEP
GAQVLEKGWQ QLRAGFDPLY GGFSEAPKFP SPHQLLFLLR YWKRYDEAGA LAMVEKTLQA
MYCGGIYDHI GFGFARYSTD RRWLVPHFEK MLYDNALLAL AYLETRQATG KAVYSHVARE
IFTWVLRDMT SPEGGFYSAL DADSEGEEGR FYLWTPDQVR EVLGAKEGEF FCRYFDITAG
GNFEGRSIPN LIGRGEALFA AGTSGNESND TAGDQRQPRE QGGRAGGISG GGGCAKGSPE
EDRLPGRGPT TLAGFGPATA ARLAAAREKL FAAREKRVHP HRDDKILTAW NGLMIAALAR
GAWVLDEPAY AAAAARAARF ILTHLRDAEG RLQARYREGQ AAFPAYLDDY AFLTWGLIEL
YQATFETGYL REALALTRQM QELFRDEGGG YFFTPHGAGE LPVRPREVYD GAIPSGNSVA
ALNLLRLARI TGDSRLEEEA AAQVRALAGT VAEYPRGYSF YLCALDFYLG PVTEIVLAGE
RETEDTRALL RVLRAAYLPS AVLVLRPGGR EGEEVTRLIP YTAGQKPVNG KATLYLCRNF
ACRAPVTTAG ELEQWLASAG REAHGAGDTF NE