Gene Moth_2255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2255 
Symbol 
ID3830750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2359255 
End bp2360808 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content60% 
IMG OID637830175 
Product2-isopropylmalate synthase 
Protein accessionYP_431085 
Protein GI83591076 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00973] 2-isopropylmalate synthase, bacterial type 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCAGATA CCGGCAAGCG GATTTATATT TTCGACACGA CACTGCGTGA CGGCGAACAG 
TCGCCGGGGG TAAGCCTCAA TATCCAGGAA AAGCTGGAGA TTGCCCGCCA GCTGGCCCGC
CTGGGGGTCG ATGTTATTGA AGCCGGCTTT CCCATCGCCT CCCCCGGGGA CTTTGAGGCC
GTCCGGGCCG TGGCTCGAGA GGTGGAGGGA CCGGTCATCG CCGGCCTGGC CCGGATTAAC
GAACGGGATA TCGACCGCGC CTGGGAGGCC CTCCAGGAGG CCCAAAAGCC GCGGATCCAC
GTCTTTATTG CCACCTCCGA TATCCACCTG AAATACAAAC TGCGGATGAG CCGGCAGGAG
GTCCTGGAGG CTGTCCGGCG GGGAGTGGCC CGGGCCAAGG GTTACTGCCA GGATATTGAG
TTTTCTCCGG AGGATGCTGT CCGGAGCGAC CTGGACTTCC TCTGCCAGGT CCTGGCCACG
GCCATTGAGG CCGGGGCTAC CGTTCTTAAT ATCCCGGATA CCGTGGGCTA CGCTACCCCG
GAGGAGTTCG GCCGTCTGAT CAGCCAGATC CGGCAGCGGG TGCCGGGTAT TGATAAAGTC
CGGATCAGCG TTCACTGCCA TAACGATCTG GGCCTGGCCG TGGCCAACTC CCTGGCGGCC
ATTGAGAACG GCGCCCTGCA GGTCGAGGGG GCCATCAATG GCATTGGCGA AAGGGCTGGA
AATGCCGCCC TGGAGGAAGT GATTATGGCC CTTTATACCC GGCGGGATTA CTACGGCTGC
CGGACGGGCA TCGTCACCGA GGAGATCTAC CGCACCAGCA AACTGGTCAG TAGCTTGACG
GGTATGCCCG TCCAGTACAA TAAAGCCATT GTCGGCAAGA ACGCCTTCAC CCATGAGGCC
GGCATTCACC AGGACGGGGT CTTAAAAGAG CGGACTACCT ACGAAATCAT GAATCCGGCC
ATGATCGGCC TGGTCCAGAA TAATATTTTC CTGGGCAAGC ACTCCGGCCG CCACGCCCTG
CGTAGCCGCC TGGAGGAACT GGGCTTTAAG CTCACGGAGG CCGAGCTGGA TAAGGCCTTC
GCCCGCTTCA AAGAACTGGC CGACCGCAAG AAAGAAATCA GCGACCGCGA CCTGGAGGCC
ATTGTCGAGC ACGAAGTCAA GCGCATCCCG GAGAAGTTTG TTCTAGAGCA TATTCATATC
TCCACCGGCA ACCGGGTTGT ACCCACAGCC ACCATCGGTA TCCGGGTGGG TGAAGAATTA
AAGGAAGAGG CCGCCTGCGG TGAAGGCCCG GTAGACGCCG CCTTCAAGGC CATAGATAAG
CTGACCCGCA TCCCGGTTTG CCTCAAATCT TACAATCTCA ATGCCGTCAC CGGCGGCAAG
GACGCCGTCG GTGAGGTAAC GGTAAAGATT GAGTACGATG GGCGGGTCTT CATAGGTCGC
GGTATCAGCA CCGATGTCCT GGAAGCCAGT GCCCGCGCTT ACTTGAACGC CATCAATAAG
GTGGTTTACG AGGTAGGGGA AGAAAACCTG CAGCAGGCCA GTACGGCCCA GTAG
 
Protein sequence
MADTGKRIYI FDTTLRDGEQ SPGVSLNIQE KLEIARQLAR LGVDVIEAGF PIASPGDFEA 
VRAVAREVEG PVIAGLARIN ERDIDRAWEA LQEAQKPRIH VFIATSDIHL KYKLRMSRQE
VLEAVRRGVA RAKGYCQDIE FSPEDAVRSD LDFLCQVLAT AIEAGATVLN IPDTVGYATP
EEFGRLISQI RQRVPGIDKV RISVHCHNDL GLAVANSLAA IENGALQVEG AINGIGERAG
NAALEEVIMA LYTRRDYYGC RTGIVTEEIY RTSKLVSSLT GMPVQYNKAI VGKNAFTHEA
GIHQDGVLKE RTTYEIMNPA MIGLVQNNIF LGKHSGRHAL RSRLEELGFK LTEAELDKAF
ARFKELADRK KEISDRDLEA IVEHEVKRIP EKFVLEHIHI STGNRVVPTA TIGIRVGEEL
KEEAACGEGP VDAAFKAIDK LTRIPVCLKS YNLNAVTGGK DAVGEVTVKI EYDGRVFIGR
GISTDVLEAS ARAYLNAINK VVYEVGEENL QQASTAQ