Gene Cthe_2519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2519 
Symbol 
ID4809275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2987849 
End bp2989468 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content44% 
IMG OID640107935 
Productputative alpha-isopropylmalate/homocitrate synthase family transferase 
Protein accessionYP_001038914 
Protein GI125975004 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00968739 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGGAATTG TAAAGCATTT TTTCAGGAAG GATGAAAAGA TGAAGAAGAT TTTGATATAT 
GACTCTACCT TAAGGGATGG TGCCCAGGCA CAGGGTATTT CGTTCAGTGT TGAAGATAAA
CTGAAAATTG TGGAGAGACT TGACCGGCTC GGTATAAGCT ATATTGAAGC AGGTAACCCG
GGCTCCAACC CAAAGGATTT GGAGTTTTTT GACAGAATAG GGCGGGTCAA GTTAAGGCAT
GCAAAAATAA TCGCCTTTGG AAGTACCAGA AGAGTCAATG TAAGTGTTCA GGAGGATGCC
AATGTAAAAT CGCTTTTAAA GGCGGACACT CCGGCTGTGG CCATATTCGG CAAAAGCTGG
GACTTTCATG TTACCGATAT TTTAAAGACA ACACTGGATG AAAATTTGAG AATGATTTTT
GACACCATAT CCTTCTTTAA GAACAAAAAT AAAGAAGTGG TTTTTGATGC TGAGCATTTC
TTTGACGGAT ACAAGGCCAA CCCGGATTAT GCAATGAAAA CCCTCAAAAC TGCTGTTGAG
GCCGGAGCGG ACTGTATTTG CCTTTGCGAT ACCAATGGAG GAACATTCCC GAATGAAATC
AAGGATATTA CCGCCAGGGT TGTGAGCGAG TTTAACGTGA ATGTCGGTAT TCATTGCCAC
AATGACACGG GCATGGCGGT TGCCAACTCC ATTATGGCGG TGCTGGCCGG TGCCGTGCAG
GTTCAGGGGA CAATGAACGG GTTTGGAGAG AGAAGCGGTA ATGCCAATCT CTGCACAATA
ATACCCAATT TGCAGCTTAA AGCAGGCTAT GATTGCATAC CGCAGGAAAA CATGGCGGAC
CTTACGGCTA CTGCAAGGTC CATAAGTGAA ATTGCCAATG TTATACATGA TGAAAGGGCT
CCGTATGTAG GGAAATATGC ATTTGCCCAC AAGGCGGGAA TGCATGCGGA TGCGGTAACC
AAAAACTCCA TAGCTTACGA ACACATCAAC CCTGAAGTTG TCGGAAACGA AAGGCTTTTT
CTCATGTCGG AAGTTGCGGG AAGAAGCGCT GTGCTTCATT TAATCAAAAA TATTGACAGC
ACTATTACAA AGGATTCTCC CGAGACAAAA TTGATACTGG ACAAGCTCAA GGAGCTGGAA
TTTGAAGGCT ATCAGTACGA AGGTGCGGAG AGTTCTTTTG AAATTGTGAT TCGGAAAATC
CTTGGAAAAT ACCGTCCTTC CTTTGAACTT GGAGAGTTTA AGGTTGTGGT TAACGAACCG
TCTATTAGCG GTGCGAATTC TTCCGCCATG ATAAAGATTA ATGTGGACGG ACAGTATGAG
ATAACCGCGG ATGAGGGACA GGGTCCGGTA AATGCGCTGG ACAAGGCGCT AAGAAAGGCT
TTGGAGAAAT TTTATCCTCA GATTGCGGAA ATGAAGCTTA CCGACTACAA AGTTAGGGTT
CTTGATTCCA ACTCGGCTAC GGCTGCAAAG GTAAGGGTTT TAATTGAGTC AACCGACGGT
AAAGAAGTCT GGACAACCAT TGGAGTTTCA ACGGACATTA TTGAAGCCAG CTGGAAGGCG
TTGGTGGATT CTATAGAATA CAAGCTTATC AAGGACAAGG AAGCAAAACA AAAGTCTTAA
 
Protein sequence
MGIVKHFFRK DEKMKKILIY DSTLRDGAQA QGISFSVEDK LKIVERLDRL GISYIEAGNP 
GSNPKDLEFF DRIGRVKLRH AKIIAFGSTR RVNVSVQEDA NVKSLLKADT PAVAIFGKSW
DFHVTDILKT TLDENLRMIF DTISFFKNKN KEVVFDAEHF FDGYKANPDY AMKTLKTAVE
AGADCICLCD TNGGTFPNEI KDITARVVSE FNVNVGIHCH NDTGMAVANS IMAVLAGAVQ
VQGTMNGFGE RSGNANLCTI IPNLQLKAGY DCIPQENMAD LTATARSISE IANVIHDERA
PYVGKYAFAH KAGMHADAVT KNSIAYEHIN PEVVGNERLF LMSEVAGRSA VLHLIKNIDS
TITKDSPETK LILDKLKELE FEGYQYEGAE SSFEIVIRKI LGKYRPSFEL GEFKVVVNEP
SISGANSSAM IKINVDGQYE ITADEGQGPV NALDKALRKA LEKFYPQIAE MKLTDYKVRV
LDSNSATAAK VRVLIESTDG KEVWTTIGVS TDIIEASWKA LVDSIEYKLI KDKEAKQKS