Gene EcSMS35_1335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1335 
Symbolzwf 
ID6143997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1323791 
End bp1325266 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content51% 
IMG OID641616213 
Productglucose-6-phosphate 1-dehydrogenase 
Protein accessionYP_001743393 
Protein GI170683501 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0364] Glucose-6-phosphate 1-dehydrogenase 
TIGRFAM ID[TIGR00871] glucose-6-phosphate 1-dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.606999 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTAA CGCAAACAGC CCAGGCCTGT GACCTGGTCA TTTTCGGCGC GAAAGGCGAC 
CTTGCGCGTC GTAAATTGCT GCCTTCCCTG TATCAACTGG AAAAAGCCGG TCAGCTCAAC
CCGGACACCC GGATTATCGG CGTAGGGCGT GCTGACTGGG ATAAAGCGGC TTATACCAAA
GTTGTCCGCG AGGCGCTCGA AACTTTCATG AAAGAAACGA TTGATGAAGG TTTATGGGAC
ACCCTGAGTG CACGTCTGGA TTTTTGTAAT CTTGATGTCA ATGACACTGC TGCATTCAGC
CGTCTCGGCG CGATGCTGGA TCAAAAAAAT CGTATCACCA TTAACTACTT TGCCATGCCG
CCCAGCACTT TTGGCGCAAT TTGCAAAGGG CTTGGTGAGG CAAAACTGAA TGCTAAACCG
GCACGCGTAG TCATGGAGAA ACCGCTGGGG ACGTCGCTGG CGACCTCGCA GGAAATCAAC
GATCAGGTTG GCGAATACTT CGAGGAGTGC CAGGTTTACC GTATCGACCA CTATCTTGGT
AAAGAAACAG TGCTGAACCT GTTGGCGCTG CGTTTTGCTA ACTCCCTGTT TGTGAATAAC
TGGGACAATC GCACCATTGA TCATGTTGAG ATTACCGTGG CAGAAGAAGT GGGGATCGAA
GGGCGCTGGG GCTATTTTGA TAAAGCCGGT CAGATGCGCG ATATGATCCA AAACCACCTG
CTGCAAATTC TTTGCATGAT TGCGATGTCT CCGCCGTCTG ACCTGAGTGC AGACAGCATC
CGCGATGAAA AAGTCAAAGT ACTGAAGTCT CTGCGCCGCA TTGACCGCTC CAACGTGCGC
GAAAAAACCG TACGCGGGCA ATATACTGCG GGCTTCGCCC AGGGCAAAAA AGTGCCGGGA
TATCTGGAAG AAGAGGGCGC GAACAAGAGC AGCAATACAG AAACCTTCGT GGCGATCCGC
GTCGACATTG ATAACTGGCG CTGGGCCGGT GTGCCATTCT ACCTGCGTAC TGGTAAACGT
CTGCCGACCA AATGTTCTGA AGTCGTGGTC TATTTCAAAA CACCTGAACT GAATCTGTTT
AAAGAGTCGT GGCAGGATCT GCCGCAGAAT AAACTGACTA TCCGTCTGCA ACCTGATGAA
GGCGTGGATA TCCAGGTACT AAATAAAGTT CCTGGCCTTG ACCATAAACA TAACCTGCAA
ATCACCAAGC TGGATCTGAG CTATTCAGAA ACCTTTAATC AAACGCATCT GGCAGATGCC
TATGAACGTC TGCTGCTGGA AACCATGCGT GGTATTCAGG CACTGTTTGT ACGTCGTGAT
GAAGTGGAAG AAGCCTGGAA ATGGGTAGAC TCCATTACTG AGGCGTGGGC GATGGACAAT
GATGCGCCGA AACCGTATCA GGCCGGAACC TGGGGACCCG TTGCCTCAGT GGCGATGATT
ACCCGGGATG GTCGTTCCTG GAATGAGTTT GAGTAA
 
Protein sequence
MAVTQTAQAC DLVIFGAKGD LARRKLLPSL YQLEKAGQLN PDTRIIGVGR ADWDKAAYTK 
VVREALETFM KETIDEGLWD TLSARLDFCN LDVNDTAAFS RLGAMLDQKN RITINYFAMP
PSTFGAICKG LGEAKLNAKP ARVVMEKPLG TSLATSQEIN DQVGEYFEEC QVYRIDHYLG
KETVLNLLAL RFANSLFVNN WDNRTIDHVE ITVAEEVGIE GRWGYFDKAG QMRDMIQNHL
LQILCMIAMS PPSDLSADSI RDEKVKVLKS LRRIDRSNVR EKTVRGQYTA GFAQGKKVPG
YLEEEGANKS SNTETFVAIR VDIDNWRWAG VPFYLRTGKR LPTKCSEVVV YFKTPELNLF
KESWQDLPQN KLTIRLQPDE GVDIQVLNKV PGLDHKHNLQ ITKLDLSYSE TFNQTHLADA
YERLLLETMR GIQALFVRRD EVEEAWKWVD SITEAWAMDN DAPKPYQAGT WGPVASVAMI
TRDGRSWNEF E