Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1335 |
Symbol | zwf |
ID | 6143997 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1323791 |
End bp | 1325266 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641616213 |
Product | glucose-6-phosphate 1-dehydrogenase |
Protein accession | YP_001743393 |
Protein GI | 170683501 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0364] Glucose-6-phosphate 1-dehydrogenase |
TIGRFAM ID | [TIGR00871] glucose-6-phosphate 1-dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.606999 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGTAA CGCAAACAGC CCAGGCCTGT GACCTGGTCA TTTTCGGCGC GAAAGGCGAC CTTGCGCGTC GTAAATTGCT GCCTTCCCTG TATCAACTGG AAAAAGCCGG TCAGCTCAAC CCGGACACCC GGATTATCGG CGTAGGGCGT GCTGACTGGG ATAAAGCGGC TTATACCAAA GTTGTCCGCG AGGCGCTCGA AACTTTCATG AAAGAAACGA TTGATGAAGG TTTATGGGAC ACCCTGAGTG CACGTCTGGA TTTTTGTAAT CTTGATGTCA ATGACACTGC TGCATTCAGC CGTCTCGGCG CGATGCTGGA TCAAAAAAAT CGTATCACCA TTAACTACTT TGCCATGCCG CCCAGCACTT TTGGCGCAAT TTGCAAAGGG CTTGGTGAGG CAAAACTGAA TGCTAAACCG GCACGCGTAG TCATGGAGAA ACCGCTGGGG ACGTCGCTGG CGACCTCGCA GGAAATCAAC GATCAGGTTG GCGAATACTT CGAGGAGTGC CAGGTTTACC GTATCGACCA CTATCTTGGT AAAGAAACAG TGCTGAACCT GTTGGCGCTG CGTTTTGCTA ACTCCCTGTT TGTGAATAAC TGGGACAATC GCACCATTGA TCATGTTGAG ATTACCGTGG CAGAAGAAGT GGGGATCGAA GGGCGCTGGG GCTATTTTGA TAAAGCCGGT CAGATGCGCG ATATGATCCA AAACCACCTG CTGCAAATTC TTTGCATGAT TGCGATGTCT CCGCCGTCTG ACCTGAGTGC AGACAGCATC CGCGATGAAA AAGTCAAAGT ACTGAAGTCT CTGCGCCGCA TTGACCGCTC CAACGTGCGC GAAAAAACCG TACGCGGGCA ATATACTGCG GGCTTCGCCC AGGGCAAAAA AGTGCCGGGA TATCTGGAAG AAGAGGGCGC GAACAAGAGC AGCAATACAG AAACCTTCGT GGCGATCCGC GTCGACATTG ATAACTGGCG CTGGGCCGGT GTGCCATTCT ACCTGCGTAC TGGTAAACGT CTGCCGACCA AATGTTCTGA AGTCGTGGTC TATTTCAAAA CACCTGAACT GAATCTGTTT AAAGAGTCGT GGCAGGATCT GCCGCAGAAT AAACTGACTA TCCGTCTGCA ACCTGATGAA GGCGTGGATA TCCAGGTACT AAATAAAGTT CCTGGCCTTG ACCATAAACA TAACCTGCAA ATCACCAAGC TGGATCTGAG CTATTCAGAA ACCTTTAATC AAACGCATCT GGCAGATGCC TATGAACGTC TGCTGCTGGA AACCATGCGT GGTATTCAGG CACTGTTTGT ACGTCGTGAT GAAGTGGAAG AAGCCTGGAA ATGGGTAGAC TCCATTACTG AGGCGTGGGC GATGGACAAT GATGCGCCGA AACCGTATCA GGCCGGAACC TGGGGACCCG TTGCCTCAGT GGCGATGATT ACCCGGGATG GTCGTTCCTG GAATGAGTTT GAGTAA
|
Protein sequence | MAVTQTAQAC DLVIFGAKGD LARRKLLPSL YQLEKAGQLN PDTRIIGVGR ADWDKAAYTK VVREALETFM KETIDEGLWD TLSARLDFCN LDVNDTAAFS RLGAMLDQKN RITINYFAMP PSTFGAICKG LGEAKLNAKP ARVVMEKPLG TSLATSQEIN DQVGEYFEEC QVYRIDHYLG KETVLNLLAL RFANSLFVNN WDNRTIDHVE ITVAEEVGIE GRWGYFDKAG QMRDMIQNHL LQILCMIAMS PPSDLSADSI RDEKVKVLKS LRRIDRSNVR EKTVRGQYTA GFAQGKKVPG YLEEEGANKS SNTETFVAIR VDIDNWRWAG VPFYLRTGKR LPTKCSEVVV YFKTPELNLF KESWQDLPQN KLTIRLQPDE GVDIQVLNKV PGLDHKHNLQ ITKLDLSYSE TFNQTHLADA YERLLLETMR GIQALFVRRD EVEEAWKWVD SITEAWAMDN DAPKPYQAGT WGPVASVAMI TRDGRSWNEF E
|
| |