Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_1114 |
Symbol | aroB |
ID | 4240615 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 1250334 |
End bp | 1251422 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638104677 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_719326 |
Protein GI | 113461257 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0407056 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGTGTG TAAATGTAGA ACTAAGGGAA CGCAGTTATC CTATCCATAT CGGAATGGGG TTGTTATCCG AAGCACAAGT TTATCCGCTA AAAAAAGGCG ATAAAGTGAT GATTGTAACT AACCCTACAA TTGCACAGTA TTATTTATCC TCTGTAACAG ACACCTTAGA AAAAATTGGT TGCTCGGTAG AGAATGTGCA ACTCCCAGAA GGTGAACAAT ACAAAACTTT AGAATCTCTA GACTTAATTT TTACCGCACT TTTAAAAGCT AATCATGGAC GAGATACCTC TATTATTGCA CTGGGTGGCG GTGTGATCGG TGATATTGCC GGATATGCAG CGGCAAGCTA TCAACGTGGT GTCCGTTTTA TTCAAATTCC AACCACATTA CTTGCTCAAG TAGATTCTTC CGTTGGGGGA AAAACGGCTG TCAATCACAA ATTAGGAAAA AACATGATCG GTGCTTTTTA TCAACCTTGT GCCGTTATTA TTGATACGCT AACGCTAACT ACTTTACCTA AAAGGGAAAT TCACGCAGGT TTAGCTGAAG TCATTAAATA TGGTGCTATT TTAGATGATG AATTTTTTAC ATGGCTAGAA AAACATATAA CTAATTTAGT TGCTTTAGAA CAACAATATT TACAGCAGTG CATTGCTCGC TGTTGTCAAA TTAAAGCAGA TGTGGTTACT CGTGATGAAA CTGAAAAAGG AGAGCGTGCG TTATTAAATT TAGGTCATAC TTTCGGGCAT GCTATCGAAA CTCACCTTGG ATATGGAAAT TGGTTACATG GAGAAGCAGT CGCAACCGGA ATGATGATAG CTGCGATCTT GTCTAATAAA TTAGGTGATT TATCACTTAA CGATGTAACG AGACTGGAGA AACTTTTAAT TCAAGCAGAT TTACCTACAG CCTCACCTGA TACAATGAAA GCTGAAGATT ATCTACCACA TATGATGCGT GATAAAAAGG TTCTTGCTGG AAAATTACGC CTAGTCTTAC TAAAATCACT TGGTCAAGCC TACGTTGCAA CAGATACAGA CAAAGAATAC GTGCTTGATG CAATTCGTAC CTGTTCAAAA AAAAGTTAA
|
Protein sequence | MLCVNVELRE RSYPIHIGMG LLSEAQVYPL KKGDKVMIVT NPTIAQYYLS SVTDTLEKIG CSVENVQLPE GEQYKTLESL DLIFTALLKA NHGRDTSIIA LGGGVIGDIA GYAAASYQRG VRFIQIPTTL LAQVDSSVGG KTAVNHKLGK NMIGAFYQPC AVIIDTLTLT TLPKREIHAG LAEVIKYGAI LDDEFFTWLE KHITNLVALE QQYLQQCIAR CCQIKADVVT RDETEKGERA LLNLGHTFGH AIETHLGYGN WLHGEAVATG MMIAAILSNK LGDLSLNDVT RLEKLLIQAD LPTASPDTMK AEDYLPHMMR DKKVLAGKLR LVLLKSLGQA YVATDTDKEY VLDAIRTCSK KS
|
| |