Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0941 |
Symbol | aroG |
ID | 4240434 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | - |
Start bp | 1041668 |
End bp | 1042747 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638104497 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_719152 |
Protein GI | 113461084 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAAGA ATAAAAATAA TATTCGTATT GCAAATGATG ACACCAGAAT TGCAAAAGTT GAGCAAGTTT TGCCCCCGAT TGCATTATTA GAAAAATTTC CTGCAAGTGA TGTAGCCATT AAAACAGTAA AAAAAGCCCG CTTGGCAGCA CATAACATTA TTCATCAAAA AGATGATCGT TTACTCGTTA TTATTGGACC TTGTTCTATT CATGATCCTG CTTCTGCGTT GGAATATGCT AAACGCATTA AAGAAATGCG TATAAAATAT CATGATACGT TGGAAATTAT AATGCGTGTG TATTTTGAAA AACCACGTAC AACTGTGGGT TGGAAAGGAT TAGTGAATGA TCCTCATTTA GATGGTAGTT ATGCTTTAAA TGATGGTTTG CGTATTGCAC GTAAATTGTT ATCTGATATT AACGATATGG AAATGCCTAC TGCCGGTGAA TTTTTAGATA TGATTTCACC GCAATACCTT GCGGACTTTA TGAGTTGGGG GGCTATTGGA GCAAGAACAA CGGAATCGCA AGTTCATCGT GAATTGGCAT CCGGTTTGTC ATGTGCGGTG GGCTTTAAGA ATGGAACTAA CGGTGGAGTT CGCATTGCTC TTGATGCAAT AGGTACGGCA GAAGCACCAC ATTATTTTCT ATCTGTCACA AAATTCGGTC ATTCAGCAAT TGTGTCTACA AAAGGAAATG AGGATTGTCA TATTATTTTG CGTGGAGGTG AGAAAGGTCC TAATTATAGT GCGGAAGATG TTCGGACAGT TTGTGCGGAT ATAGAAAAAA CCGGTCGCAT TCCGCATGTT ATGGTGGATT TTAGTCATGC GAACAGTAGT AAACAATTTA AAAAACAGTT AGATGTTTGC ACTGATGTCT GTACTCAAAT TGCCAATGGT TCAAAACAAA TTTTTGGTGT CATGGTTGAA AGCCATTTAG TCGAAGGTCG TCAAGATTTA GTTGAAGCAA AAGCACTTAC CTATGGACAA AGTATTACCG ATGCTTGTAT TGGTTGGTCA GATACAGAAA TCGTATTACA ACAATTGTCT GAGGCATTAA TTGAACGACG TAAGAAGTAA
|
Protein sequence | MAKNKNNIRI ANDDTRIAKV EQVLPPIALL EKFPASDVAI KTVKKARLAA HNIIHQKDDR LLVIIGPCSI HDPASALEYA KRIKEMRIKY HDTLEIIMRV YFEKPRTTVG WKGLVNDPHL DGSYALNDGL RIARKLLSDI NDMEMPTAGE FLDMISPQYL ADFMSWGAIG ARTTESQVHR ELASGLSCAV GFKNGTNGGV RIALDAIGTA EAPHYFLSVT KFGHSAIVST KGNEDCHIIL RGGEKGPNYS AEDVRTVCAD IEKTGRIPHV MVDFSHANSS KQFKKQLDVC TDVCTQIANG SKQIFGVMVE SHLVEGRQDL VEAKALTYGQ SITDACIGWS DTEIVLQQLS EALIERRKK
|
| |