Gene VC0395_A0029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0029 
Symbolepd 
ID5136919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp25031 
End bp26056 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content46% 
IMG OID640531489 
Producterythrose 4-phosphate dehydrogenase 
Protein accessionYP_001216003 
Protein GI229259769 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01532] D-erythrose-4-phosphate dehydrogenase
[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.000119037 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAGGG TAGCAATTAA CGGCTTTGGG CGCATTGGGC GTAATGTGTT ACGCGCTGTG 
TATGAAAGTG GTAAACGTGA CCGCATTCAA GTGGTTGCGG TGAACGAGCT TGCCAAGCCA
GATGCGATGG CACACCTCCT GCAGTACGAT ACGAGCCACG GTCGCTTTGG CAAAAAAATC
AGCCATGATC AGCAGCATAT CTATGTTCAC CATCAAAATG GTGAGTATGA CTCAATCCGC
ATTCTGCATC TTTCGGAAAT TCCTCTCTTA CCTTGGCGTG ATTTAGGCGT GGATTTAGTC
CTCGACTGCA CAGGTGTGTA TGGTTGCCAA GAGGATGGTC AGCAACATAT TGATGCGGGC
GCGAAGTTAG TTTTATTTTC CCATCCTGGT GCTAGCGATC TCGACAACAC CATCATCTAT
GGCGTGAATC ATGAGACGCT CACTGCTGAG CATAAAATCG TCTCTAACGG TTCTTGTACC
ACCAACTGCA TTGTGCCCAT CATCAAGGTA TTGGATGACG CATTCGGCAT TGATTCTGGC
ACCATCACCA CCATCCATTC CTCGATGAAC GATCAACAAG TGATTGATGC GTATCATAAT
GACTTACGAC GCACACGAGC GGCCAGTCAA TCTATCATTC CGGTTGATAC TAAGTTGCAT
AAAGGTATTG AAAGAATATT TCCGAAATTT TCTAACAAAT TTGAAGCGAT TTCGGTGCGT
GTACCAACAG TAAATGTGAC CGCGATGGAT TTAAGTGTCA CAATTAAATC AAATGTGAAA
GTTAATGACG TAAATCAAAC CATTGTGAAC GCATCTCAGT GTACATTACG TGGCATTGTT
GACTATACTG AAGCGCCTTT GGTTTCCATC GATTTTAACC ACGATCCGCA CAGCGCGATT
GTGGATGGGA CGCAAACGAG AGTCAGTAAC GGGCAGTTAG TAAAAATGCT CGTTTGGTGT
GATAACGAAT GGGGCTTTGC CAACCGGATG TTAGATACCG CATTGGCAAT GCAGGCCACG
CAGTAG
 
Protein sequence
MLRVAINGFG RIGRNVLRAV YESGKRDRIQ VVAVNELAKP DAMAHLLQYD TSHGRFGKKI 
SHDQQHIYVH HQNGEYDSIR ILHLSEIPLL PWRDLGVDLV LDCTGVYGCQ EDGQQHIDAG
AKLVLFSHPG ASDLDNTIIY GVNHETLTAE HKIVSNGSCT TNCIVPIIKV LDDAFGIDSG
TITTIHSSMN DQQVIDAYHN DLRRTRAASQ SIIPVDTKLH KGIERIFPKF SNKFEAISVR
VPTVNVTAMD LSVTIKSNVK VNDVNQTIVN ASQCTLRGIV DYTEAPLVSI DFNHDPHSAI
VDGTQTRVSN GQLVKMLVWC DNEWGFANRM LDTALAMQAT Q