Gene Moth_1382 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1382 
Symbol 
ID3831629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1427048 
End bp1429084 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content59% 
IMG OID637829318 
Productmolybdopterin oxidoreductase 
Protein accessionYP_430238 
Protein GI83590229 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0219675 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCAA AGTTTTATCG CCATATCTGC CCGCGGAATT GTTACAGCAC CTGTGGCCTT 
ATTTCCATGG TAGAAGGGGG AAAAATCAAG GAACTAGCCG GCGATCCGGC CCATGGTTAC
AGCCGGGGAC ACCTCTGCCG TTTCGGCTAC AGCTACCTGG ATATTTTCCA TCATCCAGCC
CGGGTACTTT ACCCTTTACG GCAGGAGCCC CGGGGTTCAG GCAACTGGCG CCGGATAGGC
TGGGACGAAG CCATGACCCT CATTGCCGGC AAAATGCTGG AGTTAAAGGG TCGTTGGGGT
TCCTTTTTGC CAGTCTTCTT TTACAGCAAT TCGGGTAACA TAGGTCTCCT GCACCAGGCA
TGGAACTGGC TGGCCCGCAG CCTGGGAGAG GTGACTGTAG CGTCAGGGTC CCTGTGCTGG
AGCGCCGGGC TGGACGCCAT GGTCTATGGT TACGGCGCCC ACAACCACCC CGACCCGGCA
GCCATGGCCC GGGCGGGTTA CCTCTTGCTC TGGGGGGCCA ACCCGGCCTG GACGGCAGTC
CACCAGATGG AATATATCTA CGAGGCCCGG GAAAGGGGTG CCCGCCTGGT TGTTATTGAT
CCTATTTTTA CAGCCACGGC GGCCCGGGCC GACTTCTATG TTCAGATAAA ACCCGGCAGC
GACGGCGCCC TGGCTTTAGG ATTGGCCCAC CAACTATGGC AGAAAGGGTT AGTAGATAAT
TACTACCTGG AAAATTATGT GCGGGGTTGG CCGGAGTGGC GGGAATATCT GGCTGGCCTG
GACCCCCGGG AGCTGGTGGC AGCTACTGGT GTACCGCTAG AGCTAATGGC CCGCCTGGCG
GAGGAATATG CTGCCGGCAA CCCTGCAGCC ATCTGGATAG GCATAGGCCT GCAGCGCCAC
ATCAACGGTG GCCAGAACAT CCGGGCCATC AACGCCCTGG CGGCCATGAC CGGAAACCTG
GGCCGTGAAG GCGGCGGTGT TTATTATGCC AGCCCGGTAG CAAGCGAGCT TTTTACTACC
TCCTGGCCGG AGTGGATGCG CCCTGCCGGC AGTGAGAGGC AAATACCAGT TTATGACCTG
GCCCGGGGCC TGGAGGAAGC CGACAGCCCG CCAGTAAAAA TGGCCCTTCT GGCAAACGCC
AATCCCCTGG CGCAAAACGC CACAACGCAA GAATTGCAGC GGGCCCTAGC GAATCTGGAT
CTGGTGGTCT TAAGTGGCCA GTTCCTGACC GAAACGGCCC GGGCGGCCGA CGTCTTCCTG
CCAGTGACGA CCTTTTTGGA AAGCTGGGAC GTTGTTCCCA GCTACTGGCA TCGCTGGATA
GGAATCAATG AACCGGCCGT TTCTCCGGGG GGAGAATGCC GTTCCGATAT CCAGGTGGTG
AGTGACCTGG CCAGGGTTCT AAATCAAATC AGCCCGGGAT GTTGTCCTTT TCCCTCCGGT
TGGACGGAAG AAGAGTGGCT GGAGCAGGTT TTTAACCCGG AGGTTTACCG GTTGCTGGGA
ATAAACCATT ACCGTGATCT CCTGGACGGT CCCAGGGAAT TAAAGCTACC GGCCAATCCC
TGGGCCGGGG GACGGTTCGC CACTCCTTCC GGGCGGTATG AAATCTATTC AGACCGGGCG
GCGGCAGCCG GCCTGCCATC ACTGCCGGTT TACCAGCCGG CAGCCGCGGG TACAGAGGCT
TACCCTTACC GTCTCCTGAC CCCCCACACC AGCGCCGGCC TGAACTCCCA GTTTTATAAT
CTTGGTGATG TACCGGAACC CCTGGCCCTG GTTAATCCAC GGCTGGCCCG GGAGCAGGGC
CTGCATAGCG GCAGCCAGGC CCGCCTTTAC AATGAGTGGG GCGAGATTAT AATCCCGGTG
GTCATAAGTG AACTGGTGCC ACCGCAAACT ATCCTCTGCC ACCAGCGTCC CCTGCCCGGA
GGCCAGGTTA TCAACGACCT TACCCCGCCC CTGGCCACTG ATATGGGAAC TATCACCAGC
AGCGGCCCTG GCTTGGCCTA TTATGACACC TTTGTCAATA TCGCCCCGGT GGCGTAA
 
Protein sequence
MAAKFYRHIC PRNCYSTCGL ISMVEGGKIK ELAGDPAHGY SRGHLCRFGY SYLDIFHHPA 
RVLYPLRQEP RGSGNWRRIG WDEAMTLIAG KMLELKGRWG SFLPVFFYSN SGNIGLLHQA
WNWLARSLGE VTVASGSLCW SAGLDAMVYG YGAHNHPDPA AMARAGYLLL WGANPAWTAV
HQMEYIYEAR ERGARLVVID PIFTATAARA DFYVQIKPGS DGALALGLAH QLWQKGLVDN
YYLENYVRGW PEWREYLAGL DPRELVAATG VPLELMARLA EEYAAGNPAA IWIGIGLQRH
INGGQNIRAI NALAAMTGNL GREGGGVYYA SPVASELFTT SWPEWMRPAG SERQIPVYDL
ARGLEEADSP PVKMALLANA NPLAQNATTQ ELQRALANLD LVVLSGQFLT ETARAADVFL
PVTTFLESWD VVPSYWHRWI GINEPAVSPG GECRSDIQVV SDLARVLNQI SPGCCPFPSG
WTEEEWLEQV FNPEVYRLLG INHYRDLLDG PRELKLPANP WAGGRFATPS GRYEIYSDRA
AAAGLPSLPV YQPAAAGTEA YPYRLLTPHT SAGLNSQFYN LGDVPEPLAL VNPRLAREQG
LHSGSQARLY NEWGEIIIPV VISELVPPQT ILCHQRPLPG GQVINDLTPP LATDMGTITS
SGPGLAYYDT FVNIAPVA