Gene Moth_0981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0981 
Symbol 
ID3830857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1008293 
End bp1009339 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content54% 
IMG OID637828910 
Productrespiratory-chain NADH dehydrogenase, subunit 1 
Protein accessionYP_429839 
Protein GI83589830 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTGG AAGATATCTT TACCGGCATA GCCGCTTACC TGCGGGGGTT CCTTGCCGGA 
GCACCGCCCT GGGTGGTGAC CCTTGCCATA GGGGTCTTAT ACCTGATCGG GGTACTGGCC
TTTATCTTCC TGAATGCCCT TTATTTAATC TATCTGGAAC GGAAGATAAG CGCTTATATG
CAACAGCGCA TCGGGCCCAA CCGTTTGGGG CCCCATGGCC TCCTGCAGTC AGTGGCCGAT
GCGGTTAAAC TCCTGGGGAA AGAGGATATC ATACCCCGGG GGGCTGACCG GTGGGTCTTT
ATCATCGCCC CGGTGCTAAT CTTTATCCCG GCGACGATGC TTTATGCCGT CATTCCCTTT
GGCAAAGGAA TGGTTCCCGC TGATTTAAAT ATTGGTGTCT TTTATTTCCT GGCGGTCGCT
TCAACTACAA CCATCGCCAT CTTGATGGGC GGCTGGGGTG CCAACAACAA ATATGCCCTG
CTGGGCAGCA TGCGTTGTGT AGCCCAGATG GTCAGTTACG AAATCCCCCT GACCTTTTCC
ATCCTGGGGG TAATAATGCT GGCCGGGTCC CTCCAGACCT CCCAGATCGT GGCCGCCCAG
GGGAAGATCT GGTATATCCT TCTTCAGCCC CTGGCCTTTA TTATCTACTT TATTGCTGCC
ACGGCCGAGG TCAACCGTGC TCCCTTTGAC CTGGTGGAAG GGGAACAGGA GATTATTGCC
GGACCTTATA CAGAATACAC CGGCATGCGT TACGCCCTCT TTTATCTTTC AGAGTATGCC
AACCTGGTCA GCGTTTCCGC CCTTGCGGTA ACCCTGTTCC TGGGCGGCTG GCAGGGGCCG
TGGTTGCCGT CATGGCTATG GTTTCTAATT AAGGTTTATA TTATGATTTT TATCTTCATG
TGGGTACGCT GGACCTTCCC TCGTATCCGT ATTGACCATC TGCTCAGCTT TAACTGGAAG
GTGCTCCTGC CCCTGTCCCT GGCCAATATC CTGGTGACCG GGGTGGGCAT TAAGATCTAC
CAGTTGTTAA CCCTGGGGAG GTGGTAG
 
Protein sequence
MTVEDIFTGI AAYLRGFLAG APPWVVTLAI GVLYLIGVLA FIFLNALYLI YLERKISAYM 
QQRIGPNRLG PHGLLQSVAD AVKLLGKEDI IPRGADRWVF IIAPVLIFIP ATMLYAVIPF
GKGMVPADLN IGVFYFLAVA STTTIAILMG GWGANNKYAL LGSMRCVAQM VSYEIPLTFS
ILGVIMLAGS LQTSQIVAAQ GKIWYILLQP LAFIIYFIAA TAEVNRAPFD LVEGEQEIIA
GPYTEYTGMR YALFYLSEYA NLVSVSALAV TLFLGGWQGP WLPSWLWFLI KVYIMIFIFM
WVRWTFPRIR IDHLLSFNWK VLLPLSLANI LVTGVGIKIY QLLTLGRW