Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_1423 |
Symbol | aroB |
ID | 5877814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | + |
Start bp | 1458219 |
End bp | 1459289 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 641541772 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_001663048 |
Protein GI | 167040063 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000905299 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGATTTTA TAACAATTGA TTTAAAAGAA AGGTCTTATC CTATTTATTT TGCCTATGAT TCTTTTGATA AATTAGGAGA AATAGTAAAG AAACATGTAA GAAGTAGTAA AACATTTATA ATAACTGATT TTAATGTTTA TCCTTTGTAT TTTGAAAAAC TCAATGAAAG TCTTAAAAAA AGTCGTTTTG ATGTGTCATA TGAAGTTATT CCTGCAGGGG AAACAAGTAA AACAATGGAA ATGGCACAAA GACTTCTTGA AAAAGCTTAT GATAGTGGAC TTTTAAGAGA CAGTTCAGTT ATAGCTCTTG GAGGAGGCGT TGTAGGAGAC ATAGCTGGAT TTGTCGCAGC AACTTACATG AGGGGAATAG ACTTTGTGCA AATTCCCACA ACCTTATTGG CTCAGGTTGA TAGTAGTGTT GGTGGCAAAG TAGCAGTCAA TCTAAAAAAA GGCAAAAACA TAGTAGGAGC TTTTCATCAG CCTAAAATGG TATATATAGA CGCTGCTGTT TTAAACACGT TAGATAAAAG AGAGATACTT GGAGGATTAG CTGAGATAAT CAAATATGGA ATTATATGGG ATTTTGATTT ATTTGAATAC ATCGAAAATA ACCTGCATGA AATTTTAGAT TTAAAAGAAG ACAAATTAAA ACATATAGTC AAAAAATCTT GTGAAATAAA AGGGAAAATC GTATCCCTTG ATGAAAAAGA GGAAAACCTA CGTTCAATAT TAAATTTTGG CCATACCATA GGGCATGCCA TTGAAGCTTT GACCGGTTAT GAGTGGTATA TTCATGGAGA AGCAGTCGCT ATAGGGATGG TATACGCTTG TAAACTTGCT CTAAATTTAG GATATATTGA TGAAAAATAT TTTGAAAGGA TTTTTTCTTT AATACAAAGG ACAGGATTAC CCACAGATTA TGAGGATTTG CATAAAGAAG ACATTATAAA AGCTATAAAA CTTGACAAAA AAAATAGAAG CAGCAAAATA AATTTTGTTC TTCCTTGTGG TTTTGGAAAA GTTGAAGTCA TAAGTGTTAG AGAAGAAGAA ATTTTAAAGG TTTTAAAATA A
|
Protein sequence | MDFITIDLKE RSYPIYFAYD SFDKLGEIVK KHVRSSKTFI ITDFNVYPLY FEKLNESLKK SRFDVSYEVI PAGETSKTME MAQRLLEKAY DSGLLRDSSV IALGGGVVGD IAGFVAATYM RGIDFVQIPT TLLAQVDSSV GGKVAVNLKK GKNIVGAFHQ PKMVYIDAAV LNTLDKREIL GGLAEIIKYG IIWDFDLFEY IENNLHEILD LKEDKLKHIV KKSCEIKGKI VSLDEKEENL RSILNFGHTI GHAIEALTGY EWYIHGEAVA IGMVYACKLA LNLGYIDEKY FERIFSLIQR TGLPTDYEDL HKEDIIKAIK LDKKNRSSKI NFVLPCGFGK VEVISVREEE ILKVLK
|
| |