Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0732 |
Symbol | |
ID | 4810350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 889176 |
End bp | 890360 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640106149 |
Product | chorismate synthase |
Protein accession | YP_001037160 |
Protein GI | 125973250 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0075518 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAGGAA ATACATTCGG CAGAATATTC AGGGTTACAA CCTGTGGAGA ATCTTATGCA GGTGCTTTTC GCAAAAATCT TCAAATACCA AAAGAGTTGT TTGGGGGACT AATAGCAATC GTGGACGGTG TTCCGCCCGG AATAAAGCTC ACTGCTGATT TCGTGCAGGA GGAGCTTGAT AAAAGAAGAC CGGGAAAAAC TCCTTTGGAT ACACCAAGAA AAGAAAGGGA CAAAGTATAT ATTTTTTCCG GAGTAATGGA AGATGATATT ACAACCGGTG CTCCTGTCGG GATGATTATA CCCAATGACG TTATTGAGGA TGAGCACATT AACAAGCATA AGAGCTACAA AGAGGTTGTA AGACCGGGAC AGGCAGGATA TACCTTTTTT AAGAAGTACG GACAATTTGC AGACAATATA GGTGCAGGAA GAGCTTCCGG AAGAGAAACG GCTGCCCGTG TTGCCGCCGG AGCCGTGGCA AAGGCGGTGT TAGATACCAT GGGTATAGAT GTAATTGCTT TTGTAACTGA AATACACGGA ATTAAAGCCC AGGAAAATAT TACTTATGAA ATGGCCAAAG CCAATTATCG CAAAAATGAA ATAAACTGCC CGGACCTTGA AAAAGCAAAA GAAATGATTG AAGAACTTAA AAGGATAAAG GAAGAAGGAG ATTCTGTAGG CGGAGTGGTG GAAATAATTG CAAGAGGTGT TCCGGCAGGT TTGGGAGAAC CCGTGTTTGA CAAGCTTCAG GCCACACTTG CCCACGCCTT AATGTCCATT GGAGCCATAA AAGGGATAGA GTTTGGCGAA GGATTCGGTC ATACAAAATT AAAGGGTTCG GAATCAAACG ATGTTCCTTA TTACGATGAA GCCTCAGGCC GTGTAAGATT TAAAACCAAC AGAGCGGGAG GTATACTGGG CGGAATTTCC AACGGCGAGG ATATCAGAAT CAGAGTTGCG GTTAAGCCGA CGCCTACTAT TTCAATACCT CAGAAAACAG TAAACATGTA CACTCTTGAG AATGTTGAAG TAGAGTTTAA CACAAGAAAC GATCCTTCAA TATGTCCAAG AATTTATCCT GTATGTGAAG CTATGGTCAG AATTGCTCTT TTGGATGCTT TATATATTGC AAAAGGCTAT AGGGCAATCA GCAGCAACAT AGATCCCCGT TGGGACCGTT TATAA
|
Protein sequence | MVGNTFGRIF RVTTCGESYA GAFRKNLQIP KELFGGLIAI VDGVPPGIKL TADFVQEELD KRRPGKTPLD TPRKERDKVY IFSGVMEDDI TTGAPVGMII PNDVIEDEHI NKHKSYKEVV RPGQAGYTFF KKYGQFADNI GAGRASGRET AARVAAGAVA KAVLDTMGID VIAFVTEIHG IKAQENITYE MAKANYRKNE INCPDLEKAK EMIEELKRIK EEGDSVGGVV EIIARGVPAG LGEPVFDKLQ ATLAHALMSI GAIKGIEFGE GFGHTKLKGS ESNDVPYYDE ASGRVRFKTN RAGGILGGIS NGEDIRIRVA VKPTPTISIP QKTVNMYTLE NVEVEFNTRN DPSICPRIYP VCEAMVRIAL LDALYIAKGY RAISSNIDPR WDRL
|
| |