Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A1334 |
Symbol | aroA |
ID | 5136275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 1423637 |
End bp | 1424917 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640532792 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_001217278 |
Protein GI | 147673856 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00000221059 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAGCT TGACTCTACA ACCGATTGAA CTCATCTCTG GGGAAGTGAA TCTTCCCGGT TCCAAAAGCG TTTCAAACCG TGCGCTCTTG CTGGCTGCGC TAGCCTCAGG CACGACTCGT CTTACTAACT TGCTCGATAG CGACGATATT CGCCATATGC TCAATGCTTT GACCAAGCTG GGTGTGAACT ATCGCCTCTC GGCCGATAAA ACCACCTGTG AAGTAGAAGG TTTGGGCCAA GCCTTTCACA CGACTCAGCC ATTAGAGCTG TTTCTAGGTA ACGCAGGTAC TGCAATGCGT CCGCTGGCGG CGGCGTTGTG TCTTGGACAA GGCGACTATG TACTGACTGG CGAACCGCGC ATGAAAGAGC GCCCGATTGG CCACTTAGTG GATGCTCTTC GTCAAGCCAG CGCACAGATT GAGTATCTGG AGCAGGAAAA CTTTCCTCCA CTGCGTATTC AAGGGACGGG CTTACAAGCA GGAACGGTGA CTATCGATGG TTCTATCTCT AGTCAGTTTT TGACCGCCTT TCTTATGTCG GCACCGTTGG CGCAGGGCAA AGTGACCATC AAGATCGTCG GTGAGCTGGT TTCTAAGCCT TACATCGACA TTACACTGCA CATCATGGAG CAGTTTGGTG TTCAGGTGAT CAACCACGAT TATCAAGAAT TTGTGATCCC AGCGGGGCAA TCTTATGTGT CTCCGGGGCA GTTCCTCGTC GAAGGTGATG CCTCTTCTGC TTCCTATTTC CTTGCTGCGG CTGCCATTAA AGGCGGTGAG GTAAAAGTGA CCGGTATTGG TAAAAACAGC ATCCAAGGGG ATATTCAATT TGCGGATGCA TTAGAAAAGA TGGGCGCGCA AATTGAGTGG GGCGATGATT ATGTGATTGC TCGCCGTGGT GAACTGAATG CGGTGGATCT CGATTTTAAC CATATCCCAG ATGCGGCGAT GACGATTGCG ACGACGGCAC TTTTTGCCAA AGGTACCACG GCCATTCGTA ACGTTTACAA CTGGCGTGTA AAAGAGACGG ATCGCTTGGC AGCAATGGCC ACCGAACTGC GTAAAGTGGG CGCGACAGTC GAAGAGGGGG AAGATTTCAT TGTGATTACG CCTCCAACTA AGCTCATCCA TGCGGCAATC GATACCTATG ACGATCACCG GATGGCGATG TGTTTTTCTC TGGTTGCGTT GAGCGATACA CCAGTGACGA TCAATGACCC GAAATGCACG TCAAAAACGT TCCCCGATTA CTTTGATAAG TTTGCGCAAT TAAGCCGCTA A
|
Protein sequence | MESLTLQPIE LISGEVNLPG SKSVSNRALL LAALASGTTR LTNLLDSDDI RHMLNALTKL GVNYRLSADK TTCEVEGLGQ AFHTTQPLEL FLGNAGTAMR PLAAALCLGQ GDYVLTGEPR MKERPIGHLV DALRQASAQI EYLEQENFPP LRIQGTGLQA GTVTIDGSIS SQFLTAFLMS APLAQGKVTI KIVGELVSKP YIDITLHIME QFGVQVINHD YQEFVIPAGQ SYVSPGQFLV EGDASSASYF LAAAAIKGGE VKVTGIGKNS IQGDIQFADA LEKMGAQIEW GDDYVIARRG ELNAVDLDFN HIPDAAMTIA TTALFAKGTT AIRNVYNWRV KETDRLAAMA TELRKVGATV EEGEDFIVIT PPTKLIHAAI DTYDDHRMAM CFSLVALSDT PVTINDPKCT SKTFPDYFDK FAQLSR
|
| |