Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3352 |
Symbol | epd |
ID | 6271921 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 3121980 |
End bp | 3122999 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641727246 |
Product | erythrose 4-phosphate dehydrogenase |
Protein accession | YP_001881696 |
Protein GI | 187730160 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase |
TIGRFAM ID | [TIGR01532] D-erythrose-4-phosphate dehydrogenase [TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 0.652674 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGTAC GCGTAGCGAT AAATGGCTTC GGTCGCATCG GGCGTAATGT GGTTCGTGCT TTGTATGAAT CCGGACGCCG GGCGGAAATT ACCGTGGTGG CAATCAACGA ACTGGCGGAT GCTGCGGGCA TGGCGCATTT GTTGAAATAT GACACCAGCC ATGGCCGTTT TGCATGGGAA GTACGACAGG AACGCGATCA ACTTTTTGTT GGTGATGACG CCATCCGCGT ATTGCATGAA CGTTCACTGC AATCGCTCCC CTGGCGTGAA CTTGGCGTTG ATGTAGTCCT CGACTGCACC GGCGTATATG GCTCCCGCGA GCATGGCGAA GCGCATATTG CCGCCGGGGC CAAAAAAGTG CTCTTTTCAC ATCCTGGCAG TAACGATCTC GACGCGACCG TTGTTTACGG CGTCAATCAG GATCAACTTC GTGCGGAACA CCGCATCGTT TCTAACGCTT CCTGTACCAC GAATTGCATA ATTCCCGTCA TCAAATTGTT AGATGATGCG TACGGTATTG AGTCCGGCAC TGTGACCACA ATTCACTCCG CCATGCACGA TCAACAGGTT ATTGATGCAT ACCATCCTGA CCTGCGTCGC ACCCGGGCAG CCAGCCAGTC GATCATTCCG GTCGATACTA AACTGGCCGC CGGTATCACA CGATTTTTTC CGCAATTTAA CGATCGCTTT GAAGCGATTG CGGTACGTGT GCCAACCATA AATGTGACGG CAATCGATTT AAGCGTGACG GTGAAGAAAC CTGTAAAAGC CAATGAAGTC AACCTGTTGC TGCAAAAAGC AGCACAAGGT GCATTTCATG GTATAGTTGA CTATACGGAA TTGCCGTTGG TCTCTGTAGA TTTTAACCAC GATCCGCACA GTGCCATTGT CGATGGCACC CAAACCCGGG TCAGTGGCGC ACACCTGATC AAAACGTTGG TCTGGTGCGA TAACGAATGG GGCTTTGCTA ACCGAATGCT CGACACGACG TTAGCTATGG CTACTGTTGC TTTCAGGTAA
|
Protein sequence | MTVRVAINGF GRIGRNVVRA LYESGRRAEI TVVAINELAD AAGMAHLLKY DTSHGRFAWE VRQERDQLFV GDDAIRVLHE RSLQSLPWRE LGVDVVLDCT GVYGSREHGE AHIAAGAKKV LFSHPGSNDL DATVVYGVNQ DQLRAEHRIV SNASCTTNCI IPVIKLLDDA YGIESGTVTT IHSAMHDQQV IDAYHPDLRR TRAASQSIIP VDTKLAAGIT RFFPQFNDRF EAIAVRVPTI NVTAIDLSVT VKKPVKANEV NLLLQKAAQG AFHGIVDYTE LPLVSVDFNH DPHSAIVDGT QTRVSGAHLI KTLVWCDNEW GFANRMLDTT LAMATVAFR
|
| |