Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1795 |
Symbol | |
ID | 4810040 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2119801 |
End bp | 2120826 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640107209 |
Product | 3-deoxy-D-arabinoheptulosonate-7-phosphate synthase |
Protein accession | YP_001038209 |
Protein GI | 125974299 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTATTG TAATGAAACC CAACTCAACG GAAAACGACA TCAACGAGGT AGCAAAGGTA TTGACTTCTT TGGGACTTGG GGTGCATATT TCAAAGGGTT CCGAAAGGAC TATTATCGGT GTAATCGGCG ACAAAAGGAA GCTTTCCGAC GTACCTTTAG AGCTTATGAA CGGCGTCGAA AAGCTGATTC CCATTGTGGA GTCATACAAG CTCGCAAGCA AAACTTTTAA GCCGGAACCC AGTATAATCG ACGTCGGCGG TGTAAAAATC GGAGGCAAGG AAATTGTTGT CATGGCAGGT CCCTGTGCCG TTGAAAGCAG GGAGCAGATT ATGGCTGCCG CCCAGGCAGT AAAAAAAGCC GGTGCGCAGT TTTTAAGGGG CGGAGCTTTC AAGCCGAGGA CTTCCCCTTA TTCATTCCAG GGGCTTGAAG AAGAAGGATT AAAACTTTTA AAAGAAGCAA AAGAAGCAAC CGGACTTCTG ATTATAACCG AGGTTACCAG CGAAAGAGCC ATAGAAATAG CCGACAGTTA TGTTGACATG TTCCAGGTGG GAGCGAGAAA TGTTCAGAAT TTCCAGCTTC TGCGCGAGAT TGGTCGCTCC AAGAAACCTG TTTTGCTGAA AAGAGGTCCT TCAACCACTA TAGACGAATG GTTGAATGCG GCTGAATATA TAATGAGTGA AGGCAATTAC AACGTTGTTC TTTGTGAAAG AGGCATAAGA ACCTTTGAAA CGGCTACCAG AAACACACTG GATATCAGTG CGGTGCCTGT TGTAAAAAGC TTGAGTCATC TTCCGATAAT TGTCGACCCA AGCCATGCGG CAGGAAAAGC CCAGTATATT CTTCCTCTTT CAAAGGCGGC AATTGCCGCG GGCGCAGACG GACTTATCGT AGAAGTCCAT CCGAATCCAA AATGTGCATT GTCCGACGCT GCCCAACAAC TTCCGCCGGA AGATTTCTGT GAACTGTGTA AAGATATAAG TAAAATTGCC GAAATATTAG GAAGAGAGTT TCACTATGCA GGTTGA
|
Protein sequence | MVIVMKPNST ENDINEVAKV LTSLGLGVHI SKGSERTIIG VIGDKRKLSD VPLELMNGVE KLIPIVESYK LASKTFKPEP SIIDVGGVKI GGKEIVVMAG PCAVESREQI MAAAQAVKKA GAQFLRGGAF KPRTSPYSFQ GLEEEGLKLL KEAKEATGLL IITEVTSERA IEIADSYVDM FQVGARNVQN FQLLREIGRS KKPVLLKRGP STTIDEWLNA AEYIMSEGNY NVVLCERGIR TFETATRNTL DISAVPVVKS LSHLPIIVDP SHAAGKAQYI LPLSKAAIAA GADGLIVEVH PNPKCALSDA AQQLPPEDFC ELCKDISKIA EILGREFHYA G
|
| |