Gene Moth_0444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0444 
Symbol 
ID3830968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp444987 
End bp446012 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content65% 
IMG OID637828379 
Productgeranylgeranyl reductase 
Protein accessionYP_429318 
Protein GI83589309 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID[TIGR02032] geranylgeranyl reductase family 


Plasmid Coverage information

Num covering plasmid clones59 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAAAT ACGATGCCGT CGTCGCCGGA GCCGGGCCGG CGGGGAGTAC GGCAGCCAGG 
GTAGTTGCCG CCGCCGGTGC CAGGGTACTG TTGATAGAAA AGCGGGCCCG GGTTGGTTAC
CCCGTCCAGT GTGCCGAATA CGTCCCGGCC CTGATTGCAA GTGAAGTTGA TTTTGGGGAA
AAAAGCATTG CCCTGGCTGT CGGCACCCTG GTAACTTTCT TTCCCGACGG TACCGTGACC
TCCACCCCCG CCCCGGGCTA TATCCTTAAC CGGGAGGTCT TTGATGCCTC CCTGGCTGAA
GGGGCCGTAA AGGCCGGGGC GGAGCTCTGG CTAAAGGCCA CGGTAGAAGA CCTGACTGAT
ACCAGCCTGA TCATCCGGCA AGCAAACGGC CGGCGGCAGG AAGTGGAGGC GGGGGTCATC
ATCGGCGCCG ACGGTCCCCT CTCCCTGGTG GCCCGTACCC GGGGCTGGCC GCGGGCTACC
CTGGCTGCGG CGGTCCAGGT GGAAATGGCC CTGCCGGAAC CCATGCAGGT TACCCGGGTC
TATTTCGACC CCCTTTACCG GGGCGGTTAC GCCTGGGTCT TTCCCAAGGG GAAAACGGCC
AACGTCGGGG TGGGACTGGT GCCGGGAGAA ATCACCCCGG CTGCAGCCCT GACCCATTTT
CTCAGACGCC TGGGCTGGCA GCAACAAAAT ATTGTCCGGC GCACCGGGGG ACTTATTCCC
GTCGCTGGCC CTTACCAGGA AGTCCATCGG GGCCGGGTGC TCCTCTGCGG CGATGCCGGC
GGCTTCACCC ATCCGGTTAC GGGAGCCGGC ATCCTGACGG CCATCCTGAG TGGCCGGCTG
GCCGGGGAAG CAGCCGCCGC CTATCTCGGC TCGGGAGCAC CCCTGGCGAC CTATGAAGAG
AGCTGGCGGG ACCTCCTTGG TCCTGCCCTG GCCCGGGGCC AGGCCGGCCG CCGCCGCTGG
CAGGAAGAAT GGGCCCGGGA CGGAGCCGCT TTAAGCAAGC TCCTGCGCCA GACGTGGATG
GTTTAA
 
Protein sequence
MLKYDAVVAG AGPAGSTAAR VVAAAGARVL LIEKRARVGY PVQCAEYVPA LIASEVDFGE 
KSIALAVGTL VTFFPDGTVT STPAPGYILN REVFDASLAE GAVKAGAELW LKATVEDLTD
TSLIIRQANG RRQEVEAGVI IGADGPLSLV ARTRGWPRAT LAAAVQVEMA LPEPMQVTRV
YFDPLYRGGY AWVFPKGKTA NVGVGLVPGE ITPAAALTHF LRRLGWQQQN IVRRTGGLIP
VAGPYQEVHR GRVLLCGDAG GFTHPVTGAG ILTAILSGRL AGEAAAAYLG SGAPLATYEE
SWRDLLGPAL ARGQAGRRRW QEEWARDGAA LSKLLRQTWM V