Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_1205 |
Symbol | aroC |
ID | 4240706 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | - |
Start bp | 1359929 |
End bp | 1361011 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 638104768 |
Product | chorismate synthase |
Protein accession | YP_719417 |
Protein GI | 113461348 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0521105 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGGAA ATAGTATTGG GCAACTTTTC AGGGTAACGA CATTTGGAGA ATCCCATGGT ATTGCGTTGG GCTGCATTGT AGACGGTATG CCGCCCGGAC TTGCTTTATC CGAAGATGAT ATTCAGCCGG ATTTGGATCG TCGTAAACCG GGAACTTCAA AATATACTAC GCCACGCCGT GAAGAGGATA AAGTACAAAT TTTGTCCGGT GTGTTTGACG GAAAAACCAC AGGAACAAGT ATCGGTATGA TAATTAAAAA TACCGACCAA CGTTCGCAAG ACTATGGTGA AATAAAAGAT CGTTTTCGTC CGGGGCATGC TGATTTTACT TACCAGCAAA AATATGGTTT ACGTGATTAC CGTGGCGGAG GGCGATCTTC TGCACGTGAA ACAGTGATGC GTGTAGCAGC GGGGGCAATT GCTAAAAAAT ATTTGCGTGA GTATTTTGGT ATTGAAGTAC GAGGTTATTT ATCACAAATC GGTGAGATAA AAATTGATCC GCAAGTGGTA GCCGATGTGA GTAAAATTGA TTGGGCGAAA GTAAATAGCA ATCCATTTTT TTGTCCCGAT GAAAGTGCGG TGGAAAAATT TGATGAATTG ATTCGAGAAT TGAAAAAGGC AGGAAATTCT ATTGGGGCGA AATTAACCAT TGTTGCTGAA CATGTGCCTG TCGGTTTAGG CGAGCCTGTT TTTGATCGTT TAGATGCGGA TTTAGCTCAT GCTCTCATGG GAATTAATGC GGTAAAGGCG GTAGAAATTG GTGACGGTTT TGCAGTTGTT GAACAAAAAG GAACGGAACA TCGTGATGAA ATGACACCGG AGGGGTTTTG TTCCAACCAT GCCGGCGGTA TTTTGGGCGG GATTAGCTCT GGGCAACCGA TTATCGCTAC CATTGCGTTA AAGCCTACTT CGAGTATTAC TGTTGTAGGG CGTTCAGTGA ATTTAAATAA TGAGCCTGTT GACGTGATTA CTAAGGGACG GCATGATCCT TGTGTGGGAA TAAGAGCTGT TCCAATTGCT GAGGCGATGA TGGCTATTGT TTTATTGGAT CATTTATTAC GCTTTAAAGC ACAATGTAAA TAG
|
Protein sequence | MAGNSIGQLF RVTTFGESHG IALGCIVDGM PPGLALSEDD IQPDLDRRKP GTSKYTTPRR EEDKVQILSG VFDGKTTGTS IGMIIKNTDQ RSQDYGEIKD RFRPGHADFT YQQKYGLRDY RGGGRSSARE TVMRVAAGAI AKKYLREYFG IEVRGYLSQI GEIKIDPQVV ADVSKIDWAK VNSNPFFCPD ESAVEKFDEL IRELKKAGNS IGAKLTIVAE HVPVGLGEPV FDRLDADLAH ALMGINAVKA VEIGDGFAVV EQKGTEHRDE MTPEGFCSNH AGGILGGISS GQPIIATIAL KPTSSITVVG RSVNLNNEPV DVITKGRHDP CVGIRAVPIA EAMMAIVLLD HLLRFKAQCK
|
| |