Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dgeo_2474 |
Symbol | aroB |
ID | 4073705 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Deinococcus geothermalis DSM 11300 |
Kingdom | Bacteria |
Replicon accession | NC_008010 |
Strand | + |
Start bp | 2611 |
End bp | 3828 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641228476 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_593982 |
Protein GI | 94971942 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCTGA ACCGGGTGAT CGCGCAGACT GTGCCGGTCA CGTTTCGTTA CACCGTTCAG TTCACCCACG GCCTGTTTCA GCTGGAGAAT CCGGTGCTGC GCACCACGCT GGCCGGGGAC ACCCAGGACC CTGTCAAGGT GATGTGCGTC ATCGACGCCG GGGTTCTGGC AGCTTTTCCC GAGCTGCGAG ACCAGATCGG CCGCTATTTC GCCCGGCACG CTGGCGCCCT GCATCTGGTC GCGCCGCCCC TGATCGTACC AGGTGGTGAG CGCGTCAAGC AGGAAGAAGG CTGGGTGCGG CAGGTGCAGG ACCAGATTCA CTGCTGCGGG ATCGACCGTC ATTCCTTTGT GATGGTGGTG GGCGGCGGCG CGGTGATCGA CATGGTGGGC TTTGCAGCTG CCACGGCCCA CCGAGGGGTC CGGCTGGTTC GGGTGCCCAC CACCGTCCTC GCGCAGAACG ACTCGGGCGT GGGCGTCAAG AACAGCGTGA ATGCCTACGG CAAGAAGAAT TGGCTGGGGA CCTTCGCGCC GCCCTACGCC GTGTTGAACG ACTTGGGCTT CTTGCCGGCG CTGGCGGACC GCGATTGGCT GGGCGGGCTG GCTGAGGCAG TCAAGGTCGC GCTGCTGAAA GACGCTGCCT TCTTCGCCTG GCTTGAGGAC CACGCCGGAG CGCTGGTGAA CCGCGACCTG ACGGCGATGG AGGACGCGGT TTACCGCTGT GCGGAGTTGC ATCTCGCGCA CATTGCCGGA AGCGGTGATC CCTTTGAGAT GGGTTCCTCG CGGCCCCTCG ACTTCGGGCA CTGGGCGGCG CACAAGCTTG AGAGCCTGAC CGCCTACACG CTGCGCCACG GTGAGGCGGT GGCGGTAGGC TTGGCCCTCG ACTGCACCTA TGCCGTGCTC AAGGGCCTGC TCCCCGAAGC CGACTGGCGG CGCGTCCTCG ACCTGCTGCT GGCGCTGCGG CTCCCGGTCT ATGTACCGGA ACTGGGTACC TCGGCCCAGG ACCCCGCAGA CCCCATGAGC GTGTTGAGCG GCTTGAACGA GTTCCGCGAG CACCTGGGCG GGCGCCTCAC CGTCCCGCTG CTGACCGCCG TCGGCCAGAT GACCGAGGTG CATGAGCTGG ATCTCGGCCT GCTGCGCCGC AGTGTTGGCC TGCTCAAAGA CCTGCACGAC CACCACTCAG GAGCGATATG TCTGCCTACG CCAATCCCTG CCTGCTAA
|
Protein sequence | MPLNRVIAQT VPVTFRYTVQ FTHGLFQLEN PVLRTTLAGD TQDPVKVMCV IDAGVLAAFP ELRDQIGRYF ARHAGALHLV APPLIVPGGE RVKQEEGWVR QVQDQIHCCG IDRHSFVMVV GGGAVIDMVG FAAATAHRGV RLVRVPTTVL AQNDSGVGVK NSVNAYGKKN WLGTFAPPYA VLNDLGFLPA LADRDWLGGL AEAVKVALLK DAAFFAWLED HAGALVNRDL TAMEDAVYRC AELHLAHIAG SGDPFEMGSS RPLDFGHWAA HKLESLTAYT LRHGEAVAVG LALDCTYAVL KGLLPEADWR RVLDLLLALR LPVYVPELGT SAQDPADPMS VLSGLNEFRE HLGGRLTVPL LTAVGQMTEV HELDLGLLRR SVGLLKDLHD HHSGAICLPT PIPAC
|
| |