Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1553 |
Symbol | aroB |
ID | 5733440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1804427 |
End bp | 1805584 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278692 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_001544324 |
Protein GI | 159898077 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTATGC GTTCGATCCT GCAAGCGTTT GAAGTACGCT ATTCCTATCC GGTGCATTGT ACGCACCGAT TATTTGGCTT GGATAACCCA ATTCTCCATG AATTATTTGC GCCAAGCGCT AGTCTACCGA AGCTTTGGGT TGTGCTTGAT CAGGCGGTGG CTGAGCATCA CCCCAATCTA TTAACTGAAA TTGCAGCCTA TGCCCAAGCC AGCCAAGCCT TCAGCTTGGT TGAGCCAAGC CTGATTTTGG CTGGTGGCGA AGCAATTAAG CAGACTACTG AGCCATTGCA AGCAGTCTAC GATGGAATTA ATCGCTATGC GATCGATCGC CATTCGTATC TGATGGCAAT TGGCGGCGGG GCGTTGATCG ATATGGTTGG CTATGCAGCG GCGACGGCGC ATCGCGGGGT GCGCTTAATT CGCGTGCCAA CCACAGTTTT GGCCCAAAAC GATGCAGCGG TTGGGGTTAA AAATAGCATC AATGCCTTTG GCAAAAAGAA TTTCTTGGGC ACATTTGCTC CGCCTTATGC TGTGCTCAAC GATAGCCATT TTCTCACGAC GCTGAGTGAG CGCGATTGGC GCAGTGGCAT CGCCGAGGCA ATCAAGGTAG CTTTGCTCAA AGATCCCGCC TTTTTTGCCA CGATTGAGCG TACTGCTGCG GCCTTGCGTC AGCGTGATTT AGCCGTAATG GAAGATCAGG TTTTTCGCTG TGCCGAGCTG CATTTGGCCC ACATCGCTGG TGGCGACCCG TTTGAGCGAG GCTCAGCGCG GCCCTTGGAT TTTGGCCATT GGGCGGCGCA TAAACTTGAA CAGCTCAGCA ATTATAGTTT GCGCCATGGC GAGGCGGTGG CAATTGGCAT CGCCTTGGAT TGCACCTACA GCTATTTAAA CGCTGATTTA GCCGAGGCCG ATTGGCAACG GGTCTTGACT TGTCTAACCG CAGTTGGCTT CGAACTCTAT CATCCTGCCC TGAGCAACCA GCTTGAACTG CCCGAACATC CCCAAAGTTT GCTGAGCGGC TTGGCCGAAT TTCGCGAGCA CCTTGGTGGC CAATTAACCA TCACCTTAAT GCGCGGCATC GGCCAACCCT ACGATGTTCA CACAATTGAT CTACCGATGA TGCAACAAGC CATTCGATAT TTAGCCGAAC GGGCTTAA
|
Protein sequence | MVMRSILQAF EVRYSYPVHC THRLFGLDNP ILHELFAPSA SLPKLWVVLD QAVAEHHPNL LTEIAAYAQA SQAFSLVEPS LILAGGEAIK QTTEPLQAVY DGINRYAIDR HSYLMAIGGG ALIDMVGYAA ATAHRGVRLI RVPTTVLAQN DAAVGVKNSI NAFGKKNFLG TFAPPYAVLN DSHFLTTLSE RDWRSGIAEA IKVALLKDPA FFATIERTAA ALRQRDLAVM EDQVFRCAEL HLAHIAGGDP FERGSARPLD FGHWAAHKLE QLSNYSLRHG EAVAIGIALD CTYSYLNADL AEADWQRVLT CLTAVGFELY HPALSNQLEL PEHPQSLLSG LAEFREHLGG QLTITLMRGI GQPYDVHTID LPMMQQAIRY LAERA
|
| |