Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A2205 |
Symbol | aroB |
ID | 5135068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 2353222 |
End bp | 2354322 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640533661 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_001218121 |
Protein GI | 147673423 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00000150156 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGAGTGCA AAACCATGGA GCGGATTACG GTCAACTTAG GTGAACGTAG CTACCCAATC TCAATCGGTG CCGGATTGTT TGCCAATCCG GCTCTTCTTT CCCTCTCTGC TAAGCAGAAG GTGGTGATAG TCACCAACCA CACTGTCGCT CCGCTGTATG CGCCTGCGAT CATTTCATTA CTTGATCACA TCGGTTGCCA GCATGCTTTG CTGGAACTGC CCGATGGCGA ACAGTACAAA ACTCTCGAAA CCTTCAACAC TGTGATGAGC TTTTTGCTTG AGCATAACTA CAGCCGTGAT GTGGTTGTCA TTGCTTTAGG TGGGGGAGTG ATTGGTGATC TGGTCGGTTT CGCGGCGGCT TGTTATCAGC GTGGCGTTGA TTTCATTCAA ATACCGACCA CACTGCTGTC GCAGGTGGAT TCTTCGGTCG GTGGGAAAAC CGCGGTCAAT CATCCGCTCG GTAAAAACAT GATTGGTGCT TTTTATCAAC CCAAAGCGGT AGTGATTGAT ACGGACTGTT TGACTACACT GCCCGCCCGT GAATTTGCCG CTGGCATGGC GGAAGTCATC AAGTACGGCA TCATTTACGA CTCAGCCTTC TTCGACTGGC TAGAAGCACA GATGGAGGCC TTGTACGCAT TGGACGAACA AGCGCTCACT TACGCAATTG CGCGCTGCTG CCAAATCAAA GCCGAGGTGG TCGCGCAAGA TGAGAAAGAG TCGGGCATTC GTGCGTTACT CAATCTTGGT CACACCTTCG GCCATGCAAT TGAAGCACAC ATGGGCTATG GCAATTGGCT GCACGGTGAA GCCGTATCTG CGGGTACAGT AATGGCGGCG AAAACGGCTC AATTACAAGG CCTTATCGAT GCATCTCAAT TTGAACGTAT CCTAGCTATA CTGAAAAAAG CGCATTTGCC TGTGCGAACC CCAGAGAACA TGACCTTTGC CGATTTCATG CAGCACATGA TGCGCGATAA AAAAGTGTTG GCAGGGGAAC TGCGTTTGGT GTTACCGACC AGTATCGGCA CGTCAGCGGT CGTAAAAGGG GTACCTGAAG CCGTGATTGC CCAAGCGATA GAGTATTGTC GCACGGTGTA A
|
Protein sequence | MECKTMERIT VNLGERSYPI SIGAGLFANP ALLSLSAKQK VVIVTNHTVA PLYAPAIISL LDHIGCQHAL LELPDGEQYK TLETFNTVMS FLLEHNYSRD VVVIALGGGV IGDLVGFAAA CYQRGVDFIQ IPTTLLSQVD SSVGGKTAVN HPLGKNMIGA FYQPKAVVID TDCLTTLPAR EFAAGMAEVI KYGIIYDSAF FDWLEAQMEA LYALDEQALT YAIARCCQIK AEVVAQDEKE SGIRALLNLG HTFGHAIEAH MGYGNWLHGE AVSAGTVMAA KTAQLQGLID ASQFERILAI LKKAHLPVRT PENMTFADFM QHMMRDKKVL AGELRLVLPT SIGTSAVVKG VPEAVIAQAI EYCRTV
|
| |