Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A0226 |
Symbol | aroF |
ID | 5135273 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 234311 |
End bp | 235384 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640531686 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001216189 |
Protein GI | 147673285 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 58 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA GTGAATTGAG TGATGTGAAT ATCAGTGATG AACAGATATT GATCACCCCT GATGCCTTAA AAGCCAAGAT TCCACTCAGT GACAAGGCGC GTAAGTTTAT TCGTGAGTCA CGTCAGACGG TGGCGGACAT TATCCACAAA CGCGATCCTC GCTTGCTGAT CGTCTGTGGT CCTTGCTCAA TTCATGATGT GGATGCGGCG AAAGAGTACG CCAAAAAACT CAAAGCACTC TCGGCGCAGC TGAGCGACCA GCTCTACATC GTAATGCGCG TGTACTTTGA AAAACCGCGT ACTACGGTCG GTTGGAAAGG GCTGATTAAC GATCCGCATC TTGATGGCAG TTTTGATATC GAACACGGTC TGCATGTTGG GCGTCAACTC TTGGTCGATT TGGCCGAAAT GGAAATTCCA CTGGCAACCG AAGCGTTGGA CCCGATTAGC CCGCAATACT TGGCGGATAC TTTTAGTTGG GCGGCGATTG GTGCGCGCAC CACGGAATCA CAAACTCACC GCGAGATGGC AAGTGGCCTT TCCATGCCAA TCGGCTTTAA AAATGGCACG GATGGTAACT TGGCAACGGC GATTAATGCC ATGCAGGCCG CTTCATCCAG TCACCGCTTT ATGGGAATTA ACCGCGAAGG ACAAGTGGCA CTGCTCACGA CACAAGGCAA CCCGAATGGG CATGTGATTT TACGTGGTGG CAAACAGACC AATTACGATT CGGTTTCTGT CGCGGAATGT GAAGAAGAGA TGCAAAAAGC GCGTTTGGAA CCCTCTTTGA TGGTGGATTG CAGCCATGCG AACTCACGTA AAGATTACCG CCGTCAGCCG TTGGTGGCAG AAGATGTGAT TCATCAAATC CGTGAAGGCA ATCGCTCGAT CATTGGCCTG ATGATTGAAA GCCATCTCAA TGAAGGAAAT CAGTCTTCGG GTCTGCCTCG TGAAAAGATG CAATACGGAG TCTCGATCAC TGACGGCTGT ATAAATTGGT CAACGACTGA GGCATTATTA CGTCGTGCGC ATCAGGAACT TATTCCCTTT TTGCACAATC GTTTGCAAGG ATAA
|
Protein sequence | MKKSELSDVN ISDEQILITP DALKAKIPLS DKARKFIRES RQTVADIIHK RDPRLLIVCG PCSIHDVDAA KEYAKKLKAL SAQLSDQLYI VMRVYFEKPR TTVGWKGLIN DPHLDGSFDI EHGLHVGRQL LVDLAEMEIP LATEALDPIS PQYLADTFSW AAIGARTTES QTHREMASGL SMPIGFKNGT DGNLATAINA MQAASSSHRF MGINREGQVA LLTTQGNPNG HVILRGGKQT NYDSVSVAEC EEEMQKARLE PSLMVDCSHA NSRKDYRRQP LVAEDVIHQI REGNRSIIGL MIESHLNEGN QSSGLPREKM QYGVSITDGC INWSTTEALL RRAHQELIPF LHNRLQG
|
| |