Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2519 |
Symbol | |
ID | 4809275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2987849 |
End bp | 2989468 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640107935 |
Product | putative alpha-isopropylmalate/homocitrate synthase family transferase |
Protein accession | YP_001038914 |
Protein GI | 125975004 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00968739 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGGAATTG TAAAGCATTT TTTCAGGAAG GATGAAAAGA TGAAGAAGAT TTTGATATAT GACTCTACCT TAAGGGATGG TGCCCAGGCA CAGGGTATTT CGTTCAGTGT TGAAGATAAA CTGAAAATTG TGGAGAGACT TGACCGGCTC GGTATAAGCT ATATTGAAGC AGGTAACCCG GGCTCCAACC CAAAGGATTT GGAGTTTTTT GACAGAATAG GGCGGGTCAA GTTAAGGCAT GCAAAAATAA TCGCCTTTGG AAGTACCAGA AGAGTCAATG TAAGTGTTCA GGAGGATGCC AATGTAAAAT CGCTTTTAAA GGCGGACACT CCGGCTGTGG CCATATTCGG CAAAAGCTGG GACTTTCATG TTACCGATAT TTTAAAGACA ACACTGGATG AAAATTTGAG AATGATTTTT GACACCATAT CCTTCTTTAA GAACAAAAAT AAAGAAGTGG TTTTTGATGC TGAGCATTTC TTTGACGGAT ACAAGGCCAA CCCGGATTAT GCAATGAAAA CCCTCAAAAC TGCTGTTGAG GCCGGAGCGG ACTGTATTTG CCTTTGCGAT ACCAATGGAG GAACATTCCC GAATGAAATC AAGGATATTA CCGCCAGGGT TGTGAGCGAG TTTAACGTGA ATGTCGGTAT TCATTGCCAC AATGACACGG GCATGGCGGT TGCCAACTCC ATTATGGCGG TGCTGGCCGG TGCCGTGCAG GTTCAGGGGA CAATGAACGG GTTTGGAGAG AGAAGCGGTA ATGCCAATCT CTGCACAATA ATACCCAATT TGCAGCTTAA AGCAGGCTAT GATTGCATAC CGCAGGAAAA CATGGCGGAC CTTACGGCTA CTGCAAGGTC CATAAGTGAA ATTGCCAATG TTATACATGA TGAAAGGGCT CCGTATGTAG GGAAATATGC ATTTGCCCAC AAGGCGGGAA TGCATGCGGA TGCGGTAACC AAAAACTCCA TAGCTTACGA ACACATCAAC CCTGAAGTTG TCGGAAACGA AAGGCTTTTT CTCATGTCGG AAGTTGCGGG AAGAAGCGCT GTGCTTCATT TAATCAAAAA TATTGACAGC ACTATTACAA AGGATTCTCC CGAGACAAAA TTGATACTGG ACAAGCTCAA GGAGCTGGAA TTTGAAGGCT ATCAGTACGA AGGTGCGGAG AGTTCTTTTG AAATTGTGAT TCGGAAAATC CTTGGAAAAT ACCGTCCTTC CTTTGAACTT GGAGAGTTTA AGGTTGTGGT TAACGAACCG TCTATTAGCG GTGCGAATTC TTCCGCCATG ATAAAGATTA ATGTGGACGG ACAGTATGAG ATAACCGCGG ATGAGGGACA GGGTCCGGTA AATGCGCTGG ACAAGGCGCT AAGAAAGGCT TTGGAGAAAT TTTATCCTCA GATTGCGGAA ATGAAGCTTA CCGACTACAA AGTTAGGGTT CTTGATTCCA ACTCGGCTAC GGCTGCAAAG GTAAGGGTTT TAATTGAGTC AACCGACGGT AAAGAAGTCT GGACAACCAT TGGAGTTTCA ACGGACATTA TTGAAGCCAG CTGGAAGGCG TTGGTGGATT CTATAGAATA CAAGCTTATC AAGGACAAGG AAGCAAAACA AAAGTCTTAA
|
Protein sequence | MGIVKHFFRK DEKMKKILIY DSTLRDGAQA QGISFSVEDK LKIVERLDRL GISYIEAGNP GSNPKDLEFF DRIGRVKLRH AKIIAFGSTR RVNVSVQEDA NVKSLLKADT PAVAIFGKSW DFHVTDILKT TLDENLRMIF DTISFFKNKN KEVVFDAEHF FDGYKANPDY AMKTLKTAVE AGADCICLCD TNGGTFPNEI KDITARVVSE FNVNVGIHCH NDTGMAVANS IMAVLAGAVQ VQGTMNGFGE RSGNANLCTI IPNLQLKAGY DCIPQENMAD LTATARSISE IANVIHDERA PYVGKYAFAH KAGMHADAVT KNSIAYEHIN PEVVGNERLF LMSEVAGRSA VLHLIKNIDS TITKDSPETK LILDKLKELE FEGYQYEGAE SSFEIVIRKI LGKYRPSFEL GEFKVVVNEP SISGANSSAM IKINVDGQYE ITADEGQGPV NALDKALRKA LEKFYPQIAE MKLTDYKVRV LDSNSATAAK VRVLIESTDG KEVWTTIGVS TDIIEASWKA LVDSIEYKLI KDKEAKQKS
|
| |