Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2111 |
Symbol | aroB |
ID | 5054492 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1886503 |
End bp | 1887522 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640469663 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_001154309 |
Protein GI | 145592307 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.71054 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGTTCT ACTACAGACA CAGCAGAGGC GTCACCGAAG TGGTGGTTGG GAAGGGACTG CCCTACGGCG ACTACGCCGA GAGGGCTGTG GTTCTCATTG AGGAGGGCTT AGAGAACCCT CTTCCAGGCG CGCCCGCCCT AGTCCTCAAG GGAGGGGAGG GGGTGAAGAG CCTTGATGCG CTTACCCAAG TGTACAAGTT CCTCTACGAA GTGGGGGCCG ACCGGTCCAC AACATTGGTG GCGGTGGGGG GAGGGGCGCT TTTAGACTTG GCCACCTTCG CCGCGGGGAC GTTCATGAGG GGCATCCGCC TCGTCCAGGT GCCCACCACT CTGCTGTCCA TGGTGGATGC AGCGCTGGGT GGGAAGGGGG CGGTTGATTG GGGCCACGTG AAGAACTTGG TGGGGGTCTT CTATCAGCCC TCGGCTATCC TCTGTGACTT GAGATGGGTG GAGACGTTGC CCGAGAGGGT GTACAGATCG GCCTTCGCCG AGGTGGTGAA ATACGGAGTG GCTCTCGACG GCGAGTTCTA CAACTGGCTT CGTGAAAACG TACCTCGTCT GCTGAGGAGG GAAGAGGAGG CCCTCGAGCA GGCGGTGTAC CGCTCTCTTA GAATTAAGGC CTCTGTGGTT GAGGCGGATG AGTTCGAGGA GAGGGGGATT AGAAATGTCC TCAACGTGGG CCACACGGTG GGCCACGCCG TTGAGCGGGT GCTGGGGCTT CTCCACGGAG AGGCCGTTGC CGTGGGCATC GTAGCTGAGG CTTACCTCTC GGCGGAAATG GGGTACCTCA AAGATGGAGT GGTTGAGGAG ATTAAGGCGC TCATCTCCTC TTTCGGCCTC CCCACGGCGG TTAAGCCCGG CGACTCTGAA TTGGAGGAGG CGAGGAGGCT ACTCCTCTAC GACAAGAAGA GGAGGGGCGA CTACATATAC ATGCCCCTTG TGGTGAGGGT GGGGAGGTGG GTGTTGGAGA GGGTTAGGCC GGAGGAGGCG GCTAAGGCGC TTCGCTATGT TGTGTATTGA
|
Protein sequence | MRFYYRHSRG VTEVVVGKGL PYGDYAERAV VLIEEGLENP LPGAPALVLK GGEGVKSLDA LTQVYKFLYE VGADRSTTLV AVGGGALLDL ATFAAGTFMR GIRLVQVPTT LLSMVDAALG GKGAVDWGHV KNLVGVFYQP SAILCDLRWV ETLPERVYRS AFAEVVKYGV ALDGEFYNWL RENVPRLLRR EEEALEQAVY RSLRIKASVV EADEFEERGI RNVLNVGHTV GHAVERVLGL LHGEAVAVGI VAEAYLSAEM GYLKDGVVEE IKALISSFGL PTAVKPGDSE LEEARRLLLY DKKRRGDYIY MPLVVRVGRW VLERVRPEEA AKALRYVVY
|
| |