Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0147 |
Symbol | |
ID | 3832377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 141504 |
End bp | 142508 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637828080 |
Product | dihydrouridine synthase TIM-barrel protein nifR3 |
Protein accession | YP_429028 |
Protein GI | 83589019 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0042] tRNA-dihydrouridine synthase |
TIGRFAM ID | [TIGR00737] putative TIM-barrel protein, nifR3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCAA CACTTAAAAT CGGCCCGGTC ACCCTGGCCG CCCCCCTGAT TATGGCTCCC ATGGCCGGTT ATACGGACCG CGTTTTCCGC CTCCTGGCCC GGGAGGCTGG GGCGGCCCTG ACTTATACAG AAATGATCAG CGCCCAGGGA CTTATTTATA ACAACAAAAA CACCCATGCC CTCCTAGACC TGAAGGGGGA ACCGGGTCCG GTGGCGGTCC AGCTCTTCGG CCGGGAACCG GAGATCATGG CCGCAGCGAC CCGCATCGCC GTAGCCGCCG GTGCTGCTAT CATTGACCTG AATATGGGCT GCCCCACCCC CAAGATCGTC AAAAATGGCG AGGGTTCGGC CCTGATGCGG GACCTGCCCC GGGCGGCGGC CATTGTTGCC GCCATGGTCC GGGCCGCCGG CCCGGTGCCG GTAACGGTAA AAATGCGCCT GGGCTGGGAC GAGGATTCCA TCAATGTGGT GGAGGCGGCC CGGGCGGTGG TTTATGCCGG CGCGGCGGCA GTGGCCATCC ATGGCCGCAC CAGGAGCCAG TTTTACAGCG GCCGCGCCGA CTGGAGCTAT TTTCGCCGGG TCAAGGAGGC CGTGGATGTG CCGGTAATCG GCAACGGCGA CGTCAGAACG GCCCGGGACG CTGTCACCAT GCTAGCGGAA ACAGGGTGCG ACGGGGTCAT GGTGGGTCGG GGAGCAGTCG GTAACCCCTG GCTGTTGACG GCCATCCGCG CCGTCCTGGA AGGCCGACCG GAACCGCCGC CAGTAGATGT CAGGACCAGG ATGACCATGG CCTGCCGGCA CTTAAAGCTC CTGGTAGAAC TCAAAGGGGA GACTACCGCC GTCAAAGAGA TGCGCAAGCA CCTGGCTTGT TACTTCCGCG GTTTGCCAGG GGCCGCCCGC CTGCGGCAGC AAATCAATAC CCTCACCACT GCTGCCGAAG TTATCGCCGC TATCAAAGCC TACCTGCGTG ACTACCCTTG CCAGGACTAT AACTTTTTGC TATAA
|
Protein sequence | MSATLKIGPV TLAAPLIMAP MAGYTDRVFR LLAREAGAAL TYTEMISAQG LIYNNKNTHA LLDLKGEPGP VAVQLFGREP EIMAAATRIA VAAGAAIIDL NMGCPTPKIV KNGEGSALMR DLPRAAAIVA AMVRAAGPVP VTVKMRLGWD EDSINVVEAA RAVVYAGAAA VAIHGRTRSQ FYSGRADWSY FRRVKEAVDV PVIGNGDVRT ARDAVTMLAE TGCDGVMVGR GAVGNPWLLT AIRAVLEGRP EPPPVDVRTR MTMACRHLKL LVELKGETTA VKEMRKHLAC YFRGLPGAAR LRQQINTLTT AAEVIAAIKA YLRDYPCQDY NFLL
|
| |