Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0038 |
Symbol | |
ID | 3830904 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 37930 |
End bp | 39288 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637827970 |
Product | FAD linked oxidase-like |
Protein accession | YP_428920 |
Protein GI | 83588911 |
COG category | [C] Energy production and conversion |
COG ID | [COG0277] FAD/FMN-containing dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.839008 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000133751 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGACGCG AAATTATCGC CAACCTGGAA AAAATCGTTG GCAGGGAGAA TTGCCAGGTG GAAGAGGCCG TCAGGCGGCA GCACGGTTTT AGCAAAACAG CCCTCCCGGA GGCCGTTCTT TACCCTCAGG ACAGCCAGCA GGTAGCCGCG ATATTAAAGG TGGCCGCCGC GGAGGGCATT CCGGTGGTAC CCTGGGGTGC CGGTACCATG GCCCGCAGGG GTTTACTACC CTTAAACGGG GGCCTGGCGA TTAACCTTAC CGCGATGAAC AAGATCCTCG AATATGATTA CGAAAATATG ACGGCCTTCG TAGAAGCGGG CGTCACCCTC AAGGATTTAC AGGCCACCCT GAAAACCCAC AACCTGTACT GGCCTGTGGA GCCGGTGGAT GGAGACACCT CAACAGTCGG CGGTTGCGTG GCCGCCGGCG CTTCCGGACC CAGCAAACTG GGCTACGGTG ACGCCAAATT TCACATCCTG GGACTCGAGG TGGTCCTGTC TACGGGAGAG ATTATCCGCA CCGGCGGCAA GACGGTGAAA AACGTCCAGG ACTATGATAA CACTCGCTTT ATAGCTGGCT CCTGGGGCAG CCTGGGGATT ATCACCAGGG TGATGCTGAA GTTAAGGCCG TTGCCGGAAA AGGAAATCAC CGTCTTCCTT TCCTTCAAGG AACTGGAAGC AGCCATTGAG GCGGCCCGGA TCATCAGGAG CGATACCCTG CCCACAGCCC TGGAACTCAT GGATGGCGTG GCCATGAAGA TCCTGGCCCG TGCCGGTTAC CGTCCAAACG GGGAAGGCCC TGGCATCCTG GCGAACTTTA ACGGTTTTAC CGAGCAGGTG GACGCCCAGG CGGACTACCT GCAAGGGAAG TTTAAAGGTA CCCTTATCCT GGAAGGAGAG GCTGCGGCCG GCGCCTGGCA GGCCCGGCGG CAGATCTGGC CGACCTTTGC CGGTGAGGGG GGAGCCATCC TGGCCAGCGC GGCGGTACCC TTCACCGCCT TGGGAGAGTT CCTCAAGGGG GCCAGGGCGG AACTGGACCG CAGCCGTAAA GGGGCGGCTA TGGTAGCCCA CTTTGGTAAC GGCCACATTC ATATTCTCCT GGACCAGGCC CCCGAAGCTT TTAACGGCGT CCGGGGGGTT GTCGACCAGC TGTCTGCCCG GGCGGAAAAC CTGGGCGGTT TCCTGGTAGT AGATAATATC GATGACCTCG AGTTTACCAG ACGCCGGGTT GAGGCCCGGG GAAGGGCTAT CTTCGAACTC CTGGGCCGGG TCAAGGCGGC CTTCGATCCG CGAGGGATCA TGGCCCCCAA CAGCAAGGTC CTGGCCTATG TATTAGTAGA TAATAGGGCA GCTTCTTGA
|
Protein sequence | MRREIIANLE KIVGRENCQV EEAVRRQHGF SKTALPEAVL YPQDSQQVAA ILKVAAAEGI PVVPWGAGTM ARRGLLPLNG GLAINLTAMN KILEYDYENM TAFVEAGVTL KDLQATLKTH NLYWPVEPVD GDTSTVGGCV AAGASGPSKL GYGDAKFHIL GLEVVLSTGE IIRTGGKTVK NVQDYDNTRF IAGSWGSLGI ITRVMLKLRP LPEKEITVFL SFKELEAAIE AARIIRSDTL PTALELMDGV AMKILARAGY RPNGEGPGIL ANFNGFTEQV DAQADYLQGK FKGTLILEGE AAAGAWQARR QIWPTFAGEG GAILASAAVP FTALGEFLKG ARAELDRSRK GAAMVAHFGN GHIHILLDQA PEAFNGVRGV VDQLSARAEN LGGFLVVDNI DDLEFTRRRV EARGRAIFEL LGRVKAAFDP RGIMAPNSKV LAYVLVDNRA AS
|
| |