Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1756 |
Symbol | gapC |
ID | 6143907 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1760214 |
End bp | 1761215 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641616632 |
Product | glyceraldehyde-3-phosphate dehydrogenase (phosphorylating) |
Protein accession | YP_001743810 |
Protein GI | 170680304 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase |
TIGRFAM ID | [TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000000000000557721 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTAAAG TTGGTATTAA CGGTTTTGGT CGTATCGGTC GACTGGTGTT GCGTCGATTA CTTGAAGTCA AAAGCAACAT AGACATTGTT GCTATTAATG ATCTCACTTC CCCAAAAATT CTCGCCTACC TGTTGAAACA TGATTCAAAC TACGGACCGT TCCCCTGGAG CGTTGATTTT ACGGAAGATT CACTTATCGT TGATGGAAAA AGTATCGCGG TTTACGCCGA AAAAGAGGCT AAAAATATTC CATGGAAAGC GAAAGGCGCA GAAATCATTG TCGAATGTAC TGGCTTTTAT ACCTCCGCCG AGAAATCGCA GGCGCACCTT GATGCTGGTG CGAAGAAGGT GTTGATTTCC GCCCCTGCCG GTGAAATGAA AACCATCGTT TATAACGTCA ATGACGACAC ACTGGATGGC AACGACACCA TTGTTTCCGT GGCGTCATGC ACCACTAACT GTCTTGCACC GATGGCCAAA GCCTTGCACG ACAGTTTCGG AATAGAAGTC GGCACGATGA CGACCATTCA TGCCTATACC GGCACTCAGT CACTGGTGGA TGGTCCGCGA GGTAAAGATC TACGCGCTTC ACGTGCAGCG GCAGAAAATA TCATTCCCCA CACTACAGGT GCGGCAAAAG CCATTGGTCT GGTGATCCCG GAACTGAGCG GCAAACTGAA AGGTCATGCG CAACGCGTGC CGGTGAAAAC AGGTTCGGTC ACTGAGCTGG TGTCCATTCT CGGAAAAAAA GTGACTGCCG AAGAGGTGAA TAACGCACTT AAACAAGCAA CCACCAATAA CGAGTCATTT GGTTATACCG ATGAAGAAAT AGTCTCTTCC GATATCATTG GCAGCCATTT CGGTTCGGTG TTTGATGCCA CGCAAACGGA AATTACCGCC GTGGGCGATT TACAACTGGT GAAAACGGTC GCCTGGTACG ATAACGAATA TGGCTTCGTC ACACAGCTCA TTCGCACCCT CGAAAAATTC GCTAAACTCT GA
|
Protein sequence | MSKVGINGFG RIGRLVLRRL LEVKSNIDIV AINDLTSPKI LAYLLKHDSN YGPFPWSVDF TEDSLIVDGK SIAVYAEKEA KNIPWKAKGA EIIVECTGFY TSAEKSQAHL DAGAKKVLIS APAGEMKTIV YNVNDDTLDG NDTIVSVASC TTNCLAPMAK ALHDSFGIEV GTMTTIHAYT GTQSLVDGPR GKDLRASRAA AENIIPHTTG AAKAIGLVIP ELSGKLKGHA QRVPVKTGSV TELVSILGKK VTAEEVNNAL KQATTNNESF GYTDEEIVSS DIIGSHFGSV FDATQTEITA VGDLQLVKTV AWYDNEYGFV TQLIRTLEKF AKL
|
| |