Gene Moth_0450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0450 
Symbol 
ID3830878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp450976 
End bp453144 
Gene Length2169 bp 
Protein Length722 aa 
Translation table11 
GC content61% 
IMG OID637828385 
Productformate dehydrogenase 
Protein accessionYP_429324 
Protein GI83589315 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAGAAC AAAAAATAAC GCGGAGGACA TTCCTCAAGG GATCCCTGGC GGCCGGAGCT 
CTGGCAACCT TTGGCGGCAA GCTAATCCCC ATTGAACCTG CCAAAGCCGC GGCAGCAGGG
CAGGCTGAGA CCAGGGTGGT TCCCACCCTT TGCGAAATGT GTGGTGTTAA ATGCGGCGTC
CTGGCCCATG TTCGGGACGG CCGGGTCTGG CGCCTGACAG GGAACCCCAG AGACCCCCAG
AGCGGCGGTC GCCTTTGCGC CCGGGGCAAC GCCGGTACCA AAACCCTCTA TGACCCCGAC
CGCCTCAAGG GCCCGATGAA AAGGGTAGGG GAAGGCCAGT TCCAGCCCAT TAGCTGGGAG
CAGGCTTTCC AGGAGATCGG CAGCAAATTA AAGGAACTGA AGGAGCAGTA CGGGCCCCAG
TCCCTGGTCT GGCTCGCCCA CCCGGAACTG ATCTCACCCC TGGAAAAACA CTTCATGGCT
GCCTTCGGCT CCCCCAATTA TACCGGTCAC GGCCCCACCT GTTACAGCAG CCGTAACGTG
GCCTTCGAGC AGATGTACGG CGGCGTACCC GGGGTGGACT ACCGCAATGT CAGGTACTAT
ATCGCCTTCG GCCGTAATCT CACCGGTGGG ATTAAGAACC CGGATGTGCA AAAGATCGTG
GCCGCAAAGG CAGAAGGTGC CCACCTGGTG GCCGTGGACC CACGCCTGAG TGACTTTGCC
TACTTTGCCG ACGAGTGGCT GCCCATACGG CCGGGAACCG ACCTGGCCAT GGTCCTGGCC
ATGATTAATG TGCTTATCAA CGAAAACCTC TATGACGCCG CCTTTGTGGC CGCGTATACT
ACAGGCTTTG AAGAGCTAAA AAAGGGGGTT AGCGGTTATA CCCCTGCCTG GGCGGCCGGG
ATTACGGGCA TCGAGGCCGG GACCATCAGC CGTATCGCCC GGGAACTGGC GGCGGCCAAA
CCGGCGGCAG CCGTTGACCC CGGCTGGCAC GCCGTCACCG GTTCCCAGTA CGGCAACAGC
GTCCAGGCCG GCCGGGCCAT AGCTGCCCTC AATGCCCTCC TGGGTAACCT GGGTGCCAGG
GGCGGCCTGT CCCTGCCGCC GACCATAAAG TTGGGCAGTC CCGCCGGGAT TATGGGTCCC
AAGCCTCCGG CGGCAACGGC ACCCCGCTGG GACGGGGCGG GCAGCGAAAA ATGGCCCCTC
AACAAAGATC ATGGTATGAT CCAGACTTTC CCCGAGAGGG TGAAACAGGA CCAGCCCTAT
CCGGTTAAAG CGGTTATAAT CCAGCACTTA AACCCGGTGC GCTCCAGCAC CGATTCCCTC
GCCTTCATCG AGGCCTTGAA GAAACTCGAC CTGGTGGTGG CCATTGACAT CCAGATGAAT
GACACCGCCT ATTACGCCCA TTACATCCTC CCCGAGGCCA CCTACCTGGA GCGCTACGAC
CCCCTGATGA CGGTGGGCAA TAAAGTTCTC CTGCGCCAGC CGGCCATCAA GCCACTCTTT
GATAATAAAG GCGCCGAGGA GATTATCGCC GGCATCGGCA GGGCTGCGGG CTTAAGTGAG
TATTTTAACT TTACCCTGGA GCAGTATAAC GACGCCCTGC TCGGCCCCCT GGGCCTCACC
CAGGCCCAGC TGGCCCTGAC GGGAGTGGCC GAGGTGGAGG CGAGCAAACC GGATTACAGC
AAGCTGAAAA CCCCCTCCGG GAAGATTGAG CTGGCCTGCC CGGCCTTCGT CAAGGCCGGC
AGTACCCTGA CCCCGGCCTG GGAACCGCCC CTGGTTGAGC CCCGAGATGA TAGCTTCCGT
TTAATTCAGG GCCACGTACC CATGCATACC CATACCACCA CTGACAATAA CAGTTACCTC
CACGCCATCA TGCCGGAAAA CGAGCTCTGG ATCCATACCA GCCGTGCCGG TAAACTGGGC
ATTAAGACCG GGGACCTGGT TGAGGTGGCT TCAAAGGTTG GTAAGGTCAG GGTAAAGGCC
AGGGTGACGG AGGCCATCCA CCCGGAGGCC GTTTTCCTGG CCCACGGTTT TGGCTGCCGG
GTACCCCTGC GGCACCTGGC TTATAACCGC GGCGCCAACG GCGGTGACCT GATACCCATT
ATGACGGCGC CGGTCTCGGG GGCGGCGGCC CAGTGTGAGA CCCTGGTAAC GGTACGGAAG
GCGGGATGA
 
Protein sequence
MLEQKITRRT FLKGSLAAGA LATFGGKLIP IEPAKAAAAG QAETRVVPTL CEMCGVKCGV 
LAHVRDGRVW RLTGNPRDPQ SGGRLCARGN AGTKTLYDPD RLKGPMKRVG EGQFQPISWE
QAFQEIGSKL KELKEQYGPQ SLVWLAHPEL ISPLEKHFMA AFGSPNYTGH GPTCYSSRNV
AFEQMYGGVP GVDYRNVRYY IAFGRNLTGG IKNPDVQKIV AAKAEGAHLV AVDPRLSDFA
YFADEWLPIR PGTDLAMVLA MINVLINENL YDAAFVAAYT TGFEELKKGV SGYTPAWAAG
ITGIEAGTIS RIARELAAAK PAAAVDPGWH AVTGSQYGNS VQAGRAIAAL NALLGNLGAR
GGLSLPPTIK LGSPAGIMGP KPPAATAPRW DGAGSEKWPL NKDHGMIQTF PERVKQDQPY
PVKAVIIQHL NPVRSSTDSL AFIEALKKLD LVVAIDIQMN DTAYYAHYIL PEATYLERYD
PLMTVGNKVL LRQPAIKPLF DNKGAEEIIA GIGRAAGLSE YFNFTLEQYN DALLGPLGLT
QAQLALTGVA EVEASKPDYS KLKTPSGKIE LACPAFVKAG STLTPAWEPP LVEPRDDSFR
LIQGHVPMHT HTTTDNNSYL HAIMPENELW IHTSRAGKLG IKTGDLVEVA SKVGKVRVKA
RVTEAIHPEA VFLAHGFGCR VPLRHLAYNR GANGGDLIPI MTAPVSGAAA QCETLVTVRK
AG