Gene Athe_2101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2101 
Symbol 
ID7408810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2226557 
End bp2227846 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content39% 
IMG OID643716467 
Product3-isopropylmalate dehydratase large subunit 
Protein accessionYP_002573950 
Protein GI222530068 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR01343] homoaconitate hydratase family protein
[TIGR02083] 3-isopropylmalate dehydratase, large subunit
[TIGR02086] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0493977 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAAAC CGATGACAAT GTCACAAAAG ATTTTGGCAT ACCATGCAGG AAAAGAATAT 
GTTGAACCTG GAGACTTGAT TTTTGCAAAT GTTGACCTTG TTTTGGGGAA TGACGTTACA
ACACCTGTTG CAATAAAGGA GTTTGAAAAG ATAGGGATTG ACAGGGTTTT TGACAAAGAT
AAAATTGCGA TAGTTCCCGA CCATTTTACT CCAAACAAAG ACATAAAGTC TGCTCAGCAG
TGCAAGATGG TTCGAGAGTT TGCTAAAAAG TATGAGATTA CAAATTATTT TGAAGTTGGC
GAGATGGGTA TTGAACATGC ACTCTTGCCA GAAAAAGGAC TTGTTGTGCC GGGTGATTTG
GTAATTGGTG CGGATTCGCA TACTTGCACA TATGGTGCAC TTGGTGCTTT TTCAACAGGA
ATTGGTTCTA CTGACATGGC ATGTGCAATG GCAACAGGAA AGTGCTGGTT CAAAGTTCCA
GAGGCCATTA AATTTATTCT CTACGGCAAA AAAACTGGCT GGACATCGGG AAAAGATATC
ATCCTTCACA TTATTGGTAT GATAGGTGTT GATGGTGCAC TTTACAAGTC AATGGAATAC
ACGGGAGAAG GTTTAAAATC ACTTTCAATG GATGACAGGT TCACCATTGC TAACATGGCA
ATTGAAGCAG GTGCGAAAAA TGGCATATTT GAGGTTGATG AAAAGACAAT AGAGTATGTA
AAACAGCACT CTACAAAGCC TTATAAGATA TTCAAGGCAG ACGAGGATGC AGAGTATTCA
CAGGTCTTTG AGATTGATAT TTCAAAAATT AGACCCACAG TTGCCTTTCC ACATCTTCCA
GAGAATACAA AGACGATTGA TGAGATAACA GAAAAGATTT ATATTGACCA GGTTGTGATT
GGTTCTTGCA CAAATGGCAG AATTGAAGAC TTAAGGATTG CAGCAAAGAT CTTAAAAGGA
AGAAAGGTTA AAAAAGGGCT CAGATGTATT ATATTCCCTG CAACACAGAA TATATACAAA
CAGGCATTAA AAGAGGGATT CATTGAGATA TTCATAGACG CTGGATGTGT TGTTTCAACA
CCAACTTGTG GTCCATGCCT TGGTGGACAC ATGGGAATTT TAGCAGATGG TGAGAAGGCT
CTTGCTACAA CAAATAGGAA CTTTGTTGGC AGAATGGGTC ATCCAAATAG TGAGGTTTAT
CTTTCATCGC CTGCAATTGC AGCAGCATCA GCAGTTTTAG GTTACATTGG CTCACCTGAA
GAGCTTGGAA TGAAAGGAGA TGAAGAATAG
 
Protein sequence
MTKPMTMSQK ILAYHAGKEY VEPGDLIFAN VDLVLGNDVT TPVAIKEFEK IGIDRVFDKD 
KIAIVPDHFT PNKDIKSAQQ CKMVREFAKK YEITNYFEVG EMGIEHALLP EKGLVVPGDL
VIGADSHTCT YGALGAFSTG IGSTDMACAM ATGKCWFKVP EAIKFILYGK KTGWTSGKDI
ILHIIGMIGV DGALYKSMEY TGEGLKSLSM DDRFTIANMA IEAGAKNGIF EVDEKTIEYV
KQHSTKPYKI FKADEDAEYS QVFEIDISKI RPTVAFPHLP ENTKTIDEIT EKIYIDQVVI
GSCTNGRIED LRIAAKILKG RKVKKGLRCI IFPATQNIYK QALKEGFIEI FIDAGCVVST
PTCGPCLGGH MGILADGEKA LATTNRNFVG RMGHPNSEVY LSSPAIAAAS AVLGYIGSPE
ELGMKGDEE