Gene Moth_0147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0147 
Symbol 
ID3832377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp141504 
End bp142508 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content63% 
IMG OID637828080 
Productdihydrouridine synthase TIM-barrel protein nifR3 
Protein accessionYP_429028 
Protein GI83589019 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCAA CACTTAAAAT CGGCCCGGTC ACCCTGGCCG CCCCCCTGAT TATGGCTCCC 
ATGGCCGGTT ATACGGACCG CGTTTTCCGC CTCCTGGCCC GGGAGGCTGG GGCGGCCCTG
ACTTATACAG AAATGATCAG CGCCCAGGGA CTTATTTATA ACAACAAAAA CACCCATGCC
CTCCTAGACC TGAAGGGGGA ACCGGGTCCG GTGGCGGTCC AGCTCTTCGG CCGGGAACCG
GAGATCATGG CCGCAGCGAC CCGCATCGCC GTAGCCGCCG GTGCTGCTAT CATTGACCTG
AATATGGGCT GCCCCACCCC CAAGATCGTC AAAAATGGCG AGGGTTCGGC CCTGATGCGG
GACCTGCCCC GGGCGGCGGC CATTGTTGCC GCCATGGTCC GGGCCGCCGG CCCGGTGCCG
GTAACGGTAA AAATGCGCCT GGGCTGGGAC GAGGATTCCA TCAATGTGGT GGAGGCGGCC
CGGGCGGTGG TTTATGCCGG CGCGGCGGCA GTGGCCATCC ATGGCCGCAC CAGGAGCCAG
TTTTACAGCG GCCGCGCCGA CTGGAGCTAT TTTCGCCGGG TCAAGGAGGC CGTGGATGTG
CCGGTAATCG GCAACGGCGA CGTCAGAACG GCCCGGGACG CTGTCACCAT GCTAGCGGAA
ACAGGGTGCG ACGGGGTCAT GGTGGGTCGG GGAGCAGTCG GTAACCCCTG GCTGTTGACG
GCCATCCGCG CCGTCCTGGA AGGCCGACCG GAACCGCCGC CAGTAGATGT CAGGACCAGG
ATGACCATGG CCTGCCGGCA CTTAAAGCTC CTGGTAGAAC TCAAAGGGGA GACTACCGCC
GTCAAAGAGA TGCGCAAGCA CCTGGCTTGT TACTTCCGCG GTTTGCCAGG GGCCGCCCGC
CTGCGGCAGC AAATCAATAC CCTCACCACT GCTGCCGAAG TTATCGCCGC TATCAAAGCC
TACCTGCGTG ACTACCCTTG CCAGGACTAT AACTTTTTGC TATAA
 
Protein sequence
MSATLKIGPV TLAAPLIMAP MAGYTDRVFR LLAREAGAAL TYTEMISAQG LIYNNKNTHA 
LLDLKGEPGP VAVQLFGREP EIMAAATRIA VAAGAAIIDL NMGCPTPKIV KNGEGSALMR
DLPRAAAIVA AMVRAAGPVP VTVKMRLGWD EDSINVVEAA RAVVYAGAAA VAIHGRTRSQ
FYSGRADWSY FRRVKEAVDV PVIGNGDVRT ARDAVTMLAE TGCDGVMVGR GAVGNPWLLT
AIRAVLEGRP EPPPVDVRTR MTMACRHLKL LVELKGETTA VKEMRKHLAC YFRGLPGAAR
LRQQINTLTT AAEVIAAIKA YLRDYPCQDY NFLL