Gene VC0395_A1114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A1114 
SymbolaroH 
ID5137428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp1173863 
End bp1175050 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content48% 
IMG OID640532572 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001217060 
Protein GI147675263 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGTCAAAT GCTCGGTTTT TACTTCAATT TACACCCCTT TCCGCAGCTG TTTTCGGCGT 
AATGGTTTAT TCTCGTTGAC AAATTCGTTA TCTTTAGCGC TTCGCAATAA AAGGACAGGC
GCTCCTATGC CACTGAAAAC CGATGAATTA AGAACCCAAG CATTGGGACC TATGCCTACT
CCGGCAGAAT TAAGTCACGC ACATCCTATC ACTGATGAAG TTGCGCTACG GATCGCGCAG
TCTCGTCGCC AAATTGAAAG CATTCTTACC GGTGAAGATG ATCGCCTATT AGTGATAGTG
GGGCCTTGTT CGGTGCATGA TACCGATGCC GCACTCGACT ACGCTCGCCG CCTTGCTGCG
CTACAAGAAA ACTATACTGA TGAGCTTTTT GTGGTGATGC GGACCTATTT CGAAAAACCA
CGTACTGTGG TGGGTTGGAA AGGACTGATC ACCGATCCAA ACTTAGATGG CTCTTACGCT
TTAGAAACCG GTCTCAATAA AGCGCGAAAG TTGCTGCTTG ATGTAAACAA GCTCGGATTG
GCTACCGCGA CCGAGTTTCT TGATATGATC ACAGGCCAAT ACATCGCGGA CCTTATCACG
TGGGGCGCAA TTGGTGCGCG TACCACTGAG TCGCAAATTC ACCGTGAGAT GGCCTCTGCG
CTCTCCTGCC CTGTGGGTTT TAAAAATGGC ACTAACGGTA ATGTGAAAAT CGCGATTGAT
GCGATCCGCG CGGCCAAAGC GTCACATTAC TTCTATTCAC CAGATAAGAA TGGTCGTATG
ACGGTTTACC GTACCAGTGG TAACCCATTT GGTCATATTA TTCTGCGTGG TGGTGATAGC
GGACCAAACT TTGATGCGGC TTCGATTAAT GAAGCTTGCC AGCAGTTGGC GCAATTCAAC
TTACCAGAGC GTTTAGTGGT GGATTTCAGC CACGCGAACT GTCAAAAACA ACACCGTAAA
CAAGTGGATG TCGCGCGCGA TATTTGCCAG CAAATTGAAG CTGGCAGCCA CAAAATTGCG
GGCATCATGG CGGAAAGCTT CCTTGTGGAA GGCAATCAGC CAATGCACGA TCTCAATAAT
CTGACTTATG GTCTGTCGAT CACCGATCCT TGTTTAGGAT GGAAAGATAC CGCCACCATG
CTTGATATGC TGGCTCAATC GATCAAAGTC CGTCGTTCTC GTCATTAA
 
Protein sequence
MVKCSVFTSI YTPFRSCFRR NGLFSLTNSL SLALRNKRTG APMPLKTDEL RTQALGPMPT 
PAELSHAHPI TDEVALRIAQ SRRQIESILT GEDDRLLVIV GPCSVHDTDA ALDYARRLAA
LQENYTDELF VVMRTYFEKP RTVVGWKGLI TDPNLDGSYA LETGLNKARK LLLDVNKLGL
ATATEFLDMI TGQYIADLIT WGAIGARTTE SQIHREMASA LSCPVGFKNG TNGNVKIAID
AIRAAKASHY FYSPDKNGRM TVYRTSGNPF GHIILRGGDS GPNFDAASIN EACQQLAQFN
LPERLVVDFS HANCQKQHRK QVDVARDICQ QIEAGSHKIA GIMAESFLVE GNQPMHDLNN
LTYGLSITDP CLGWKDTATM LDMLAQSIKV RRSRH