Gene Moth_0150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0150 
Symbol 
ID3832380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp144256 
End bp145338 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content61% 
IMG OID637828083 
Productshikimate/quinate 5-dehydrogenase 
Protein accessionYP_429031 
Protein GI83589022 
COG category[R] General function prediction only 
COG ID[COG5322] Predicted dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000131066 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCATAAAT TCGCCTTCAT GATCCACCCC CTGGACATCC ACGACGTTAC CCGTAAATTC 
CCTGTTGCCC GCCACCTGCC TCCGGCCCTG CTGGAGAAAG CGGTACGCTA TCTGCCGCCC
ATCAAGGCTT CCCATATTAC CGGCGTCCGC TCCGCCTACG CTGAAACCGA GGGGTGGTTT
GTGGCCTGCT CCCTGACAAG CCGCCAGATG CTCTCTCTGC CCCAGGATCT GGTCATCAAA
AAGCTAATCC GTACCGGTCG CTTGGCCGAA AAACTGGGAG CCGAGATCCT GGGCCTGGGG
GCCATGACCT CCGTAGTGGG CGATGCCGGC ATCACCATTG CCCGCCACCT GAACATAGCC
GTCACCACCG GCAACAGCTA CACGGTAGCC ACTGCCCTCG AAGCCACCGC CAAGGCTGCT
GCAATGATGG ACATCGACCT GACCCGGGCC GAGATAGCGA TTATGGGGGC CACGGGCTCC
ATTGGCGCCG TCTGTGCCCG GATTCTGGCC CGCAACTGCC GGCACCTGAC CCTGATTGCC
CGTAATGAAG AAAAACTGGC CCGCCTGGCG CATCAGATTA AAGAAGAAAC CGGTCTTAAG
GCCCGGGTAA CCAATCATTC CCGGGAAGCC CTGCGCCGGG CCGATGTCAT TATTACCGTC
ACCTCGGCGG TAGATACCGT GATTGAACCC GAGGACCTGA AACCAGGTGC CGTAGTTTGC
GACGTCGCCC GGCCCCGGGA TGTCTCGCGG CGGGTAGCCG AGGTGCGCGA CGACGTCCTG
GTTATCGACG GCGGTGTCGT CCAGGTCCCC GGGGATGTCG ACTTCCATTT TAACTTTGGC
TATCCCCCGG GCCTCTCCTA CGCCTGTATG GCCGAAACCA TGATCCTGGC CCTGGAGGGC
AGGATTGAAA ACTTTACCCT GGGCCGGGAG TTGACGGTAG AACAAATCGA CACCATTAAC
CGGCTGGCCG CCAAGCACGG CTTTCAGGTT GCCGGCTTCC GCAGCTTTGA ACTACCTGTT
TCCGAGGAGC AGGTGGCGGC CATCAGGGAG CGAGCACGGC AGCGGGCCGC CCTGGCCCGT
TAA
 
Protein sequence
MHKFAFMIHP LDIHDVTRKF PVARHLPPAL LEKAVRYLPP IKASHITGVR SAYAETEGWF 
VACSLTSRQM LSLPQDLVIK KLIRTGRLAE KLGAEILGLG AMTSVVGDAG ITIARHLNIA
VTTGNSYTVA TALEATAKAA AMMDIDLTRA EIAIMGATGS IGAVCARILA RNCRHLTLIA
RNEEKLARLA HQIKEETGLK ARVTNHSREA LRRADVIITV TSAVDTVIEP EDLKPGAVVC
DVARPRDVSR RVAEVRDDVL VIDGGVVQVP GDVDFHFNFG YPPGLSYACM AETMILALEG
RIENFTLGRE LTVEQIDTIN RLAAKHGFQV AGFRSFELPV SEEQVAAIRE RARQRAALAR