Gene Moth_0993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0993 
Symbol 
ID3830869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1020920 
End bp1021930 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content59% 
IMG OID637828922 
Productisocitrate dehydrogenase (NADP) 
Protein accessionYP_429851 
Protein GI83589842 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR00175] isocitrate dehydrogenase, NAD-dependent, mitochondrial type
[TIGR00183] isocitrate dehydrogenase, NADP-dependent, prokaryotic type
[TIGR02088] isopropylmalate/isohomocitrate dehydrogenases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000014677 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0984652 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTGAAAC ATGTAGTTAC TCTGATTCCC GGGGATGGTA CCGGCCCGGA GTTAATCGCC 
GCCGCCAGGC GGGTCCTGGA AGCCAGCGGT GCGGAACTGG AATGGGAGGT TATGGCAGCC
GGGGAGGGCG CCCAGGAGAA ATACGGCAGC GTGTTGCCAG AAGAGACCCT GGCTTCAATC
CGTAAAAATG GCGTCGCCCT TAAAGGCCCT ATCACCACCC CGGTGGGCAC CGGCTTCCGG
AGCGTCAATG TGGCCCTGCG GAAAGAGCTG GATCTCTATG CCAATGTCCG GCCCTTCCGC
AACTTGCCCA ATGTCCCCTC ACGCTATCAG GGTGTTGACC TGGTGATCTA CCGGGAGAAC
ACCGAGGACC TCTATGCCGG GGTTGAACAT ATGGTGGGTG AAGATGCGGC TGAAAGCATT
AAGATTATTA CCAGGAAGGG CTCCGAACGT ATCGCCCGGG CAGCCTTTGA ATACGCCCGG
CGCCAGGGCC GGAAACGGGT GACAGCCGGC CACAAGGCCA ATATTATGAA GTTCAGCGAC
GGTCTTTTCC TGCGGACCTT CTACGACGTA GCCAGGGATT ATCCGGAAAT AACGGCTGAT
GACCGTATTG TGGACAACCT GAGCATGCAG CTGGTCCAGA AGCCGGAGCA ATATGATGTC
CTGGTACTGC CCAACCTTTA CGGCGATATC CTCTCCGACC TCTGCGCCGG CCTGGTGGGC
GGCCTGGGAG TGGCCCCTGG AGCCAATATC GGGGAGAAGG CAGCCGTCTT TGAACCAATC
CACGGCAGCG CACCCAAGTA TGCCGGCCAG AATAAGGTAA ATCCCCTGGC CACTATCCTC
TCCGGGGTTA TGATGCTGGA ACACCTGGGC GAGAAGGAAG CAGCAGCCAG GATCCAGCGC
GCTATCCTGG CGGTCCTGGC AGAAGGCAAG TACTTGACCT ACGATCTGGG CGGCAGTGCC
GGTACGAGCG ATATGGCCGA CGCCATCGTC AGGCGACTGG AAGTAGAATA A
 
Protein sequence
MLKHVVTLIP GDGTGPELIA AARRVLEASG AELEWEVMAA GEGAQEKYGS VLPEETLASI 
RKNGVALKGP ITTPVGTGFR SVNVALRKEL DLYANVRPFR NLPNVPSRYQ GVDLVIYREN
TEDLYAGVEH MVGEDAAESI KIITRKGSER IARAAFEYAR RQGRKRVTAG HKANIMKFSD
GLFLRTFYDV ARDYPEITAD DRIVDNLSMQ LVQKPEQYDV LVLPNLYGDI LSDLCAGLVG
GLGVAPGANI GEKAAVFEPI HGSAPKYAGQ NKVNPLATIL SGVMMLEHLG EKEAAARIQR
AILAVLAEGK YLTYDLGGSA GTSDMADAIV RRLEVE