Gene VC0395_A1334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1334 
SymbolaroA 
ID5136275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1423637 
End bp1424917 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content51% 
IMG OID640532792 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_001217278 
Protein GI147673856 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000221059 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAGCT TGACTCTACA ACCGATTGAA CTCATCTCTG GGGAAGTGAA TCTTCCCGGT 
TCCAAAAGCG TTTCAAACCG TGCGCTCTTG CTGGCTGCGC TAGCCTCAGG CACGACTCGT
CTTACTAACT TGCTCGATAG CGACGATATT CGCCATATGC TCAATGCTTT GACCAAGCTG
GGTGTGAACT ATCGCCTCTC GGCCGATAAA ACCACCTGTG AAGTAGAAGG TTTGGGCCAA
GCCTTTCACA CGACTCAGCC ATTAGAGCTG TTTCTAGGTA ACGCAGGTAC TGCAATGCGT
CCGCTGGCGG CGGCGTTGTG TCTTGGACAA GGCGACTATG TACTGACTGG CGAACCGCGC
ATGAAAGAGC GCCCGATTGG CCACTTAGTG GATGCTCTTC GTCAAGCCAG CGCACAGATT
GAGTATCTGG AGCAGGAAAA CTTTCCTCCA CTGCGTATTC AAGGGACGGG CTTACAAGCA
GGAACGGTGA CTATCGATGG TTCTATCTCT AGTCAGTTTT TGACCGCCTT TCTTATGTCG
GCACCGTTGG CGCAGGGCAA AGTGACCATC AAGATCGTCG GTGAGCTGGT TTCTAAGCCT
TACATCGACA TTACACTGCA CATCATGGAG CAGTTTGGTG TTCAGGTGAT CAACCACGAT
TATCAAGAAT TTGTGATCCC AGCGGGGCAA TCTTATGTGT CTCCGGGGCA GTTCCTCGTC
GAAGGTGATG CCTCTTCTGC TTCCTATTTC CTTGCTGCGG CTGCCATTAA AGGCGGTGAG
GTAAAAGTGA CCGGTATTGG TAAAAACAGC ATCCAAGGGG ATATTCAATT TGCGGATGCA
TTAGAAAAGA TGGGCGCGCA AATTGAGTGG GGCGATGATT ATGTGATTGC TCGCCGTGGT
GAACTGAATG CGGTGGATCT CGATTTTAAC CATATCCCAG ATGCGGCGAT GACGATTGCG
ACGACGGCAC TTTTTGCCAA AGGTACCACG GCCATTCGTA ACGTTTACAA CTGGCGTGTA
AAAGAGACGG ATCGCTTGGC AGCAATGGCC ACCGAACTGC GTAAAGTGGG CGCGACAGTC
GAAGAGGGGG AAGATTTCAT TGTGATTACG CCTCCAACTA AGCTCATCCA TGCGGCAATC
GATACCTATG ACGATCACCG GATGGCGATG TGTTTTTCTC TGGTTGCGTT GAGCGATACA
CCAGTGACGA TCAATGACCC GAAATGCACG TCAAAAACGT TCCCCGATTA CTTTGATAAG
TTTGCGCAAT TAAGCCGCTA A
 
Protein sequence
MESLTLQPIE LISGEVNLPG SKSVSNRALL LAALASGTTR LTNLLDSDDI RHMLNALTKL 
GVNYRLSADK TTCEVEGLGQ AFHTTQPLEL FLGNAGTAMR PLAAALCLGQ GDYVLTGEPR
MKERPIGHLV DALRQASAQI EYLEQENFPP LRIQGTGLQA GTVTIDGSIS SQFLTAFLMS
APLAQGKVTI KIVGELVSKP YIDITLHIME QFGVQVINHD YQEFVIPAGQ SYVSPGQFLV
EGDASSASYF LAAAAIKGGE VKVTGIGKNS IQGDIQFADA LEKMGAQIEW GDDYVIARRG
ELNAVDLDFN HIPDAAMTIA TTALFAKGTT AIRNVYNWRV KETDRLAAMA TELRKVGATV
EEGEDFIVIT PPTKLIHAAI DTYDDHRMAM CFSLVALSDT PVTINDPKCT SKTFPDYFDK
FAQLSR