Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0450 |
Symbol | |
ID | 3830878 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 450976 |
End bp | 453144 |
Gene Length | 2169 bp |
Protein Length | 722 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637828385 |
Product | formate dehydrogenase |
Protein accession | YP_429324 |
Protein GI | 83589315 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAGAAC AAAAAATAAC GCGGAGGACA TTCCTCAAGG GATCCCTGGC GGCCGGAGCT CTGGCAACCT TTGGCGGCAA GCTAATCCCC ATTGAACCTG CCAAAGCCGC GGCAGCAGGG CAGGCTGAGA CCAGGGTGGT TCCCACCCTT TGCGAAATGT GTGGTGTTAA ATGCGGCGTC CTGGCCCATG TTCGGGACGG CCGGGTCTGG CGCCTGACAG GGAACCCCAG AGACCCCCAG AGCGGCGGTC GCCTTTGCGC CCGGGGCAAC GCCGGTACCA AAACCCTCTA TGACCCCGAC CGCCTCAAGG GCCCGATGAA AAGGGTAGGG GAAGGCCAGT TCCAGCCCAT TAGCTGGGAG CAGGCTTTCC AGGAGATCGG CAGCAAATTA AAGGAACTGA AGGAGCAGTA CGGGCCCCAG TCCCTGGTCT GGCTCGCCCA CCCGGAACTG ATCTCACCCC TGGAAAAACA CTTCATGGCT GCCTTCGGCT CCCCCAATTA TACCGGTCAC GGCCCCACCT GTTACAGCAG CCGTAACGTG GCCTTCGAGC AGATGTACGG CGGCGTACCC GGGGTGGACT ACCGCAATGT CAGGTACTAT ATCGCCTTCG GCCGTAATCT CACCGGTGGG ATTAAGAACC CGGATGTGCA AAAGATCGTG GCCGCAAAGG CAGAAGGTGC CCACCTGGTG GCCGTGGACC CACGCCTGAG TGACTTTGCC TACTTTGCCG ACGAGTGGCT GCCCATACGG CCGGGAACCG ACCTGGCCAT GGTCCTGGCC ATGATTAATG TGCTTATCAA CGAAAACCTC TATGACGCCG CCTTTGTGGC CGCGTATACT ACAGGCTTTG AAGAGCTAAA AAAGGGGGTT AGCGGTTATA CCCCTGCCTG GGCGGCCGGG ATTACGGGCA TCGAGGCCGG GACCATCAGC CGTATCGCCC GGGAACTGGC GGCGGCCAAA CCGGCGGCAG CCGTTGACCC CGGCTGGCAC GCCGTCACCG GTTCCCAGTA CGGCAACAGC GTCCAGGCCG GCCGGGCCAT AGCTGCCCTC AATGCCCTCC TGGGTAACCT GGGTGCCAGG GGCGGCCTGT CCCTGCCGCC GACCATAAAG TTGGGCAGTC CCGCCGGGAT TATGGGTCCC AAGCCTCCGG CGGCAACGGC ACCCCGCTGG GACGGGGCGG GCAGCGAAAA ATGGCCCCTC AACAAAGATC ATGGTATGAT CCAGACTTTC CCCGAGAGGG TGAAACAGGA CCAGCCCTAT CCGGTTAAAG CGGTTATAAT CCAGCACTTA AACCCGGTGC GCTCCAGCAC CGATTCCCTC GCCTTCATCG AGGCCTTGAA GAAACTCGAC CTGGTGGTGG CCATTGACAT CCAGATGAAT GACACCGCCT ATTACGCCCA TTACATCCTC CCCGAGGCCA CCTACCTGGA GCGCTACGAC CCCCTGATGA CGGTGGGCAA TAAAGTTCTC CTGCGCCAGC CGGCCATCAA GCCACTCTTT GATAATAAAG GCGCCGAGGA GATTATCGCC GGCATCGGCA GGGCTGCGGG CTTAAGTGAG TATTTTAACT TTACCCTGGA GCAGTATAAC GACGCCCTGC TCGGCCCCCT GGGCCTCACC CAGGCCCAGC TGGCCCTGAC GGGAGTGGCC GAGGTGGAGG CGAGCAAACC GGATTACAGC AAGCTGAAAA CCCCCTCCGG GAAGATTGAG CTGGCCTGCC CGGCCTTCGT CAAGGCCGGC AGTACCCTGA CCCCGGCCTG GGAACCGCCC CTGGTTGAGC CCCGAGATGA TAGCTTCCGT TTAATTCAGG GCCACGTACC CATGCATACC CATACCACCA CTGACAATAA CAGTTACCTC CACGCCATCA TGCCGGAAAA CGAGCTCTGG ATCCATACCA GCCGTGCCGG TAAACTGGGC ATTAAGACCG GGGACCTGGT TGAGGTGGCT TCAAAGGTTG GTAAGGTCAG GGTAAAGGCC AGGGTGACGG AGGCCATCCA CCCGGAGGCC GTTTTCCTGG CCCACGGTTT TGGCTGCCGG GTACCCCTGC GGCACCTGGC TTATAACCGC GGCGCCAACG GCGGTGACCT GATACCCATT ATGACGGCGC CGGTCTCGGG GGCGGCGGCC CAGTGTGAGA CCCTGGTAAC GGTACGGAAG GCGGGATGA
|
Protein sequence | MLEQKITRRT FLKGSLAAGA LATFGGKLIP IEPAKAAAAG QAETRVVPTL CEMCGVKCGV LAHVRDGRVW RLTGNPRDPQ SGGRLCARGN AGTKTLYDPD RLKGPMKRVG EGQFQPISWE QAFQEIGSKL KELKEQYGPQ SLVWLAHPEL ISPLEKHFMA AFGSPNYTGH GPTCYSSRNV AFEQMYGGVP GVDYRNVRYY IAFGRNLTGG IKNPDVQKIV AAKAEGAHLV AVDPRLSDFA YFADEWLPIR PGTDLAMVLA MINVLINENL YDAAFVAAYT TGFEELKKGV SGYTPAWAAG ITGIEAGTIS RIARELAAAK PAAAVDPGWH AVTGSQYGNS VQAGRAIAAL NALLGNLGAR GGLSLPPTIK LGSPAGIMGP KPPAATAPRW DGAGSEKWPL NKDHGMIQTF PERVKQDQPY PVKAVIIQHL NPVRSSTDSL AFIEALKKLD LVVAIDIQMN DTAYYAHYIL PEATYLERYD PLMTVGNKVL LRQPAIKPLF DNKGAEEIIA GIGRAAGLSE YFNFTLEQYN DALLGPLGLT QAQLALTGVA EVEASKPDYS KLKTPSGKIE LACPAFVKAG STLTPAWEPP LVEPRDDSFR LIQGHVPMHT HTTTDNNSYL HAIMPENELW IHTSRAGKLG IKTGDLVEVA SKVGKVRVKA RVTEAIHPEA VFLAHGFGCR VPLRHLAYNR GANGGDLIPI MTAPVSGAAA QCETLVTVRK AG
|
| |