Gene Moth_0527 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0527 
Symbol 
ID3831766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp546155 
End bp547495 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content53% 
IMG OID637828468 
Producttrigger factor 
Protein accessionYP_429400 
Protein GI83589391 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0544] FKBP-type peptidyl-prolyl cis-trans isomerase (trigger factor) 
TIGRFAM ID[TIGR00115] trigger factor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000228546 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.800279 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCTG CCGTAGAAAA ACTGGATAAA AACCAGGTAC TCCTGGAAGT AGAAGTTGAA 
GCCCCCAGAG TCCAGAAGGC TATAGACCAG GCATATCGTC GACTGGTAAA GCAGGTCAAT
ATTCCAGGAT TCAGGAAGGG CAAAGCCCCT CGCTTTATCC TGGAGCAATA TATTGGCAAG
GAGCCTATTT ATAACGAGGC AGCGGAGATT GTTATTCCTC CGGCTTATGA AGAAGCTGTG
GCCGAACACC AGCTGGAACC CATCGACCGG CCGGAGGTTG AGATAGTTAA AGTAGAAGAT
GGAGAACCCC TGGTTTTTAA AGCCAGGGTG GAGGTTAAAC CGGAAGTCCA GCTTGGTCCC
TACACCGGCC TGGAGGTGGA ACGCCAGGAA GTAGAGGTAA CAGAAGCCGA TATCGATGCT
TACCTGAAGC GGCTGCAGGA GCGTTATGCC GAGCTGGAGG TAATCGAGGA CGAACCGGCA
GCAGCCGGTG ATATTGTTAC CATTGACTTC AAGGGCACCG TGGACGGCCA GCCTTACCCG
GGAATGGAAG GGAACAACTA CCCCCTGGAG CTGGGTTCAG GCACCTTTAT AACCGGTTTT
GAGGAGCAAC TGGTTGGTGC CAGGGTAAAT GAAGAACGCA CCGTTAATGT CACTTTCCCT
TCCGACTACC ACGAAAAGGA TCTGGCCGGT AAAGAAGCCG TTTTCCAGGT CACCGTGCGG
GGGATAAAAC GTAAGAAACT GGCTCCCCTT GATGATGAGT TTGCCAAAGA TGTGAGCGAG
TGCGAGACCC TGGCGGACCT GCGCCAGGAT ATCCGCCGGC GCCTGGAGGA AAGCCAGAAA
CAGCGAGTTG AAGCCGCAGT TCGGCAGGCC GTGGTGGAAA AGGCTGTCGC TGCGGCCACG
GTGGAACTCC CGGAAGTTAT GGTGGAACGG CGTATCGATG CCCGGATCCG GGAACTTGAG
CGCAACCTAC AGGCCCAGAA GATGACCCTG GAGGAGTTCT TAAAGAACAC TGACAAGACT
ATTGGTGATT TAGAAAAAGA ATTCCGGCCC GGGGCCGAAA GGGACGTAAA AACGGAACTG
GTTCTGGAAG CTATTGCCAA AGCTGAAAAT ATCCAGCCCA GCCAGGAAGA GATTGATGCC
GAAATCGAGA GGATGGCCAG GATTTTCCGG CAAGATCCGG ATACTGTACG GAAGAATTTG
GGCGACTTGT CCGTCCTGAA ATACGATATA ATGATTAAAA AGACCATAGA TTTTCTGGTA
GAGCACAGCA AGCCCGTACC GCCCCGGGAA CAGGGCGCGG CAGGGGAAAC AGCAGAAACA
GCGGAAGCTA CGCCGGCTTA A
 
Protein sequence
MKAAVEKLDK NQVLLEVEVE APRVQKAIDQ AYRRLVKQVN IPGFRKGKAP RFILEQYIGK 
EPIYNEAAEI VIPPAYEEAV AEHQLEPIDR PEVEIVKVED GEPLVFKARV EVKPEVQLGP
YTGLEVERQE VEVTEADIDA YLKRLQERYA ELEVIEDEPA AAGDIVTIDF KGTVDGQPYP
GMEGNNYPLE LGSGTFITGF EEQLVGARVN EERTVNVTFP SDYHEKDLAG KEAVFQVTVR
GIKRKKLAPL DDEFAKDVSE CETLADLRQD IRRRLEESQK QRVEAAVRQA VVEKAVAAAT
VELPEVMVER RIDARIRELE RNLQAQKMTL EEFLKNTDKT IGDLEKEFRP GAERDVKTEL
VLEAIAKAEN IQPSQEEIDA EIERMARIFR QDPDTVRKNL GDLSVLKYDI MIKKTIDFLV
EHSKPVPPRE QGAAGETAET AEATPA