Gene Athe_0504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0504 
Symbol 
ID7408628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp573650 
End bp574651 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content39% 
IMG OID643714886 
Productketol-acid reductoisomerase 
Protein accessionYP_002572403 
Protein GI222528521 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0059] Ketol-acid reductoisomerase 
TIGRFAM ID[TIGR00465] ketol-acid reductoisomerase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.450267 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAGA TATTCTATGA TAGTGATTGC AATTTAGATT TGCTTAAAGA CAAAACGGTT 
GCAGTAATTG GTTTTGGCAG CCAAGGTCAT GCACATGCAC TGAACTTGAG AGATTCTGGT
ATAAATGTTG TTGTTGGACT TTATCATGGA AGCAAGTCTT GGGCAAAAGC AGAAAGTCAT
GGTCTTAAAG TTATGACAGC TGATGAGGCA ACAAAAGTTG CAGATGTTAT TATGATTCTT
GTAAATGATG AAAAACAGCC AAAGCTATTT AAAGAGAGTA TAGAACCTAA CTTAAAAGAA
GGAAAGGCAA TAGCATTTGC GCACGGGTTT AACATTCACT TTGGTCAGAT AGTTCCACCA
CCATACGTTG ATGTTATTAT GATAGCTCCA AAAGGACCAG GGCACACAGT CAGAAGCCAG
TATGAGGAAG GAAAAGGTGT ACCAGCTTTA GTCGCTGTAC ATCAGGACTA TACAGGAAAA
GCCTTAGATG TTGCCTTAGC TTATGCAAAA GGTATTGGTG CATCAAGGGC AGGGATAATT
CTTACAACAT TTAAAGAGGA GACAGAGACA GACCTTTTTG GTGAACAGGC AGTTTTGTGT
GGCGGTCTTA CAGAGCTTAT CAAAGCCGGG TTTGATACAC TGGTTGAAGC AGGGTATCAG
CCAGAAATTG CGTATTTTGA GTGTTTGCAT GAGATGAAGC TTATAGTTGA TTTGATTTGG
CAGGGTGGAC TTTCACTTAT GCGCTATTCA ATTTCAGACA CAGCTGAGTA TGGCGACTAC
ATGACAGGTA AGAGAATTAT AACAGAAGAG ACAAGAAAAG AGATGAAAAA AGTATTAGAG
GAGATTCAAA ATGGTACATT TGCGAAGAAG TGGATTTTGG AGAACATGGC AGGAAGACCT
GAGTTCAATA GCATAAGAAA AAGAGAACAA AATCTTTTGA TTGAACAGGT TGGTAAAGAG
CTCAGAAAGA TGATGCCTTG GATAAAGCCA ATAAAAGAAT AA
 
Protein sequence
MAKIFYDSDC NLDLLKDKTV AVIGFGSQGH AHALNLRDSG INVVVGLYHG SKSWAKAESH 
GLKVMTADEA TKVADVIMIL VNDEKQPKLF KESIEPNLKE GKAIAFAHGF NIHFGQIVPP
PYVDVIMIAP KGPGHTVRSQ YEEGKGVPAL VAVHQDYTGK ALDVALAYAK GIGASRAGII
LTTFKEETET DLFGEQAVLC GGLTELIKAG FDTLVEAGYQ PEIAYFECLH EMKLIVDLIW
QGGLSLMRYS ISDTAEYGDY MTGKRIITEE TRKEMKKVLE EIQNGTFAKK WILENMAGRP
EFNSIRKREQ NLLIEQVGKE LRKMMPWIKP IKE