Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A1114 |
Symbol | aroH |
ID | 5137428 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 1173863 |
End bp | 1175050 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640532572 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001217060 |
Protein GI | 147675263 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGTCAAAT GCTCGGTTTT TACTTCAATT TACACCCCTT TCCGCAGCTG TTTTCGGCGT AATGGTTTAT TCTCGTTGAC AAATTCGTTA TCTTTAGCGC TTCGCAATAA AAGGACAGGC GCTCCTATGC CACTGAAAAC CGATGAATTA AGAACCCAAG CATTGGGACC TATGCCTACT CCGGCAGAAT TAAGTCACGC ACATCCTATC ACTGATGAAG TTGCGCTACG GATCGCGCAG TCTCGTCGCC AAATTGAAAG CATTCTTACC GGTGAAGATG ATCGCCTATT AGTGATAGTG GGGCCTTGTT CGGTGCATGA TACCGATGCC GCACTCGACT ACGCTCGCCG CCTTGCTGCG CTACAAGAAA ACTATACTGA TGAGCTTTTT GTGGTGATGC GGACCTATTT CGAAAAACCA CGTACTGTGG TGGGTTGGAA AGGACTGATC ACCGATCCAA ACTTAGATGG CTCTTACGCT TTAGAAACCG GTCTCAATAA AGCGCGAAAG TTGCTGCTTG ATGTAAACAA GCTCGGATTG GCTACCGCGA CCGAGTTTCT TGATATGATC ACAGGCCAAT ACATCGCGGA CCTTATCACG TGGGGCGCAA TTGGTGCGCG TACCACTGAG TCGCAAATTC ACCGTGAGAT GGCCTCTGCG CTCTCCTGCC CTGTGGGTTT TAAAAATGGC ACTAACGGTA ATGTGAAAAT CGCGATTGAT GCGATCCGCG CGGCCAAAGC GTCACATTAC TTCTATTCAC CAGATAAGAA TGGTCGTATG ACGGTTTACC GTACCAGTGG TAACCCATTT GGTCATATTA TTCTGCGTGG TGGTGATAGC GGACCAAACT TTGATGCGGC TTCGATTAAT GAAGCTTGCC AGCAGTTGGC GCAATTCAAC TTACCAGAGC GTTTAGTGGT GGATTTCAGC CACGCGAACT GTCAAAAACA ACACCGTAAA CAAGTGGATG TCGCGCGCGA TATTTGCCAG CAAATTGAAG CTGGCAGCCA CAAAATTGCG GGCATCATGG CGGAAAGCTT CCTTGTGGAA GGCAATCAGC CAATGCACGA TCTCAATAAT CTGACTTATG GTCTGTCGAT CACCGATCCT TGTTTAGGAT GGAAAGATAC CGCCACCATG CTTGATATGC TGGCTCAATC GATCAAAGTC CGTCGTTCTC GTCATTAA
|
Protein sequence | MVKCSVFTSI YTPFRSCFRR NGLFSLTNSL SLALRNKRTG APMPLKTDEL RTQALGPMPT PAELSHAHPI TDEVALRIAQ SRRQIESILT GEDDRLLVIV GPCSVHDTDA ALDYARRLAA LQENYTDELF VVMRTYFEKP RTVVGWKGLI TDPNLDGSYA LETGLNKARK LLLDVNKLGL ATATEFLDMI TGQYIADLIT WGAIGARTTE SQIHREMASA LSCPVGFKNG TNGNVKIAID AIRAAKASHY FYSPDKNGRM TVYRTSGNPF GHIILRGGDS GPNFDAASIN EACQQLAQFN LPERLVVDFS HANCQKQHRK QVDVARDICQ QIEAGSHKIA GIMAESFLVE GNQPMHDLNN LTYGLSITDP CLGWKDTATM LDMLAQSIKV RRSRH
|
| |