Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0786 |
Symbol | |
ID | 4810404 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 949649 |
End bp | 950731 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106203 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_001037214 |
Protein GI | 125973304 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0410044 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAAAC TAAACGTCAA TTTACAGGAC AGAAGTTACC CAATTTATAT AAGTACGGAT TATTCCCAAA TAGGTAAATG CATTCAAAGT GCAAAACTTA CAGGCAAGAT GGTTTTAATA ACCGATACCA ATGTAGACAA ATACCAGGCG GAAGAATGTG TAAAAGCTTT TTCGGATGCG GGATATGAAG TAAGTAAGTT TGTTATTCCC GCAGGAGAGG AAAACAAGAA TTTGGATACC ACCAGGGATA TTTACAAATA CCTGCTTGGT CTGAAACTGG ACAGAAGTGC TACGCTGATG GCGTTGGGCG GTGGAGTTGT CGGAGATATA ACCGGTTTTG CCGCTGCCAC TTTTCTTCGG GGAATAAATT TTGTCCAGAT ACCCACGACT CTTCTTGCCC AGTCGGACAG CAGTGTGGGC GGAAAAGTAG GAGTTGACTT TGAAGGAACC AAAAATATTA TTGGCGCTTT TTACCAGCCG AAATTTGTAT ATATAAATGT CAATACATTA AAAACCCTGC CCGAAAGGGA ACTTAAGGCA GGACTTGCAG AAGTGGTCAA GCATGGCGTA ATTATGGATG AAGAGTTTTA TGAATATATA GACTATAATG TTCACAAAAT ATTAAACCAT GATGAAGCTG TGCTCCAATA TATTGCCAAA AGGAATTGCT CCATAAAAGC TTCGGTAGTT GAAAAGGACG AAAAAGAAGG GGGCCTTAGG GCAATCCTGA ACTTTGGCCA CACGATAGGC CATGCAATCG AGACGGTAAT GAATTTTGAG CTTTTGCACG GAGAATGTGT TTCATTGGGA ATGGTAGGCG CCATGAGGAT GGCCCTGTAT CTTGAGATGA TTGATGAGCA AAGCGTTAAC CGTGTAAAGA ACACTTTGGA TAAAATCGGG CTTCCGACAA GGCTTGAAGG CATTGATGTG GACAAGGTTT ACAATCAAAT GTTTTATGAC AAGAAAATTA AAGGGAGCAA GCTTACCTTT GTACTTCCAA GGAAGAGAAT CGGAGAAGTA ATACAGTGCA CTATCGATGA TGAAGATTTG ATAAAGAGGG TAATAGCCAG CCTTGGTGAA TGA
|
Protein sequence | MIKLNVNLQD RSYPIYISTD YSQIGKCIQS AKLTGKMVLI TDTNVDKYQA EECVKAFSDA GYEVSKFVIP AGEENKNLDT TRDIYKYLLG LKLDRSATLM ALGGGVVGDI TGFAAATFLR GINFVQIPTT LLAQSDSSVG GKVGVDFEGT KNIIGAFYQP KFVYINVNTL KTLPERELKA GLAEVVKHGV IMDEEFYEYI DYNVHKILNH DEAVLQYIAK RNCSIKASVV EKDEKEGGLR AILNFGHTIG HAIETVMNFE LLHGECVSLG MVGAMRMALY LEMIDEQSVN RVKNTLDKIG LPTRLEGIDV DKVYNQMFYD KKIKGSKLTF VLPRKRIGEV IQCTIDDEDL IKRVIASLGE
|
| |