Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0266 |
Symbol | |
ID | 5732161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 313141 |
End bp | 314760 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277390 |
Product | shikimate kinase., 3-dehydroquinate synthase |
Protein accession | YP_001543046 |
Protein GI | 159896799 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase [COG0703] Shikimate kinase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACCAT CTGGGCAATC GATTGCATTA ATTGGCCCAA GCGGCGCAGG CAAAAGTACG GTTGGCGTAG GTTTAGCCCA AGCGCTCGGC TGGCGTTTTA TCGATTTAGA TCAACTGATT ATCGAACGCG CTGAAAAAAG TATTAGCGAC ATTTTTAGCC AAGAGGGCGA AGCAGGCTTT CGTGAGCGCG AAACCGCCGC TTTGGTCCAA GCACTACAAA CAGATCAAGC CGTCATCGCC TGTGGTGGCG GCATCGTTTT GCGCGAAATT AATCGGCAAT TGCTGCGTGA ACAGGCTTGG TGTGTCTATC TCACCAGCGC CATCAGCACC TTGGTTAAAC GTTTAACTGC CGATCAAGCC AATCCTCGGC CATTGCTGGC GAGCGATATT AATGAACAAC TGGTGATTCA GCTCGCCGAG CGTTTGCCAC TTTACAGCAC CCTTGCCAAC TGGACGATCC AGACCGATGG CTTGGCTCCA CAAATTGTGG TCGAACAACT CATTCGGGCT TGGGGTTTAG TGGGCAAACC GCAGGTCAGC GAAGTGTTTA GCTCATTCAA CAGTCGTTAT TATGTCAAAA GCGGCGCACT GGCAGCCCTA CCCAATGAAT TAGCCAATCT TGGCCTCACT GGCACATGCT GGCTAATCAG CGACAGTCAT GTTGGGCCGA TGTATGCGCC AACCTTGATT CAAGCCTTAG AAAACTCCGG TTTCAGCTGC CAACGCTACG ATATTCCGGC GCAAGAGGCT AGCAAATCAT GGCAGCAAGC AGGCGAATTA TATACATGGC TGCTCGAAAA TGGCGTGCAG CGTGGCGATA CGGTGTTGGC GTTGGGCGGC GGCGTGGTTG GCGATTTAGC GGGCTTTATC GCGGCCTCAA TTTTGCGCGG CATCGCGGTC GTGCAATTGC CAACCACAAT TTTGGCGATG ATCGATAGCA GCATTGGTGG CAAAACAGGC ATTAATCATC CACGGGGCAA AAATTTGATC GGTGCGTTTC ATCCGCCACG TTTGGTGTTA ATTGATGATG CAGTATTGAG CAGTTTGCCG CGTCGCGAGC GGGCAGCAGG CTGGGCCGAA GCCGTCAAAC ATGGCGTGAT TGCTGATGCG CAGTTGTTCG CTGATTTGGA GCAAGCAGGG GCAAGCCTCA ATGATGTGCC CGCCAAAATA ACCAGCGATC TGTTGGTACG CTCGGCGGCG GTCAAAATTG GCGTGGTCAA TCGCGACGAA CGCGAAACTG GCGAACGCAT GTTGCTCAAT TATGGCCATA CCTTGGGCCA AGCCGTCGAG GCCGCGACCC AATACAAACG CTATGTGCAT GGCGAAGCAG TAGCGATTGG CATGACCTTT GCTGCCAATC TGGCAGTGCA ACTTGGCATG TGGTCTAGCG CCGAGGCCGA GCGTCAACGG GCTTTGTTGC AAGCACTCGA ATTACCAACC GCCCTGCCGC GTGATTTGGA TATCGAGGCG ACTTTGGCAG CTTTGAACTT AGATAAGAAA CGTGCCAAAG GCAGCGTGCG TTGGGTATTG CCAACCCGCA TCGGCCATGC GCAAGTTGAA TCACACGTTG ACCCAGAATT GGTGCGCAAA TTGGTGCACG AATTGGTCGA GCAAACCTAA
|
Protein sequence | MQPSGQSIAL IGPSGAGKST VGVGLAQALG WRFIDLDQLI IERAEKSISD IFSQEGEAGF RERETAALVQ ALQTDQAVIA CGGGIVLREI NRQLLREQAW CVYLTSAIST LVKRLTADQA NPRPLLASDI NEQLVIQLAE RLPLYSTLAN WTIQTDGLAP QIVVEQLIRA WGLVGKPQVS EVFSSFNSRY YVKSGALAAL PNELANLGLT GTCWLISDSH VGPMYAPTLI QALENSGFSC QRYDIPAQEA SKSWQQAGEL YTWLLENGVQ RGDTVLALGG GVVGDLAGFI AASILRGIAV VQLPTTILAM IDSSIGGKTG INHPRGKNLI GAFHPPRLVL IDDAVLSSLP RRERAAGWAE AVKHGVIADA QLFADLEQAG ASLNDVPAKI TSDLLVRSAA VKIGVVNRDE RETGERMLLN YGHTLGQAVE AATQYKRYVH GEAVAIGMTF AANLAVQLGM WSSAEAERQR ALLQALELPT ALPRDLDIEA TLAALNLDKK RAKGSVRWVL PTRIGHAQVE SHVDPELVRK LVHELVEQT
|
| |