Gene EcHS_A1944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1944 
Symbolzwf 
ID5594034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1953816 
End bp1955291 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content52% 
IMG OID640921089 
Productglucose-6-phosphate 1-dehydrogenase 
Protein accessionYP_001458638 
Protein GI157161320 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0364] Glucose-6-phosphate 1-dehydrogenase 
TIGRFAM ID[TIGR00871] glucose-6-phosphate 1-dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGTAA CGCAAACAGC CCAGGCCTGT GACCTGGTCA TTTTCGGCGC GAAAGGCGAC 
CTTGCGCGTC GTAAATTGCT GCCTTCCCTG TATCAACTGG AAAAAGCCGG TCAGCTCAAC
CCGGACACCC GGATTATCGG CGTAGGGCGT GCTGACTGGG ATAAAGCGGC ATATACCAAA
GTTGTCCGCG AGGCGCTCGA AACTTTCATG AAAGAAACCA TTGATGAAGG TTTATGGGAC
ACCCTGAGTG CACGTCTGGA TTTTTGTAAT CTCGATGTCA ATGACACTGC TGCATTCAGC
CGTCTCGGCG CGATGCTGGA TCAAAAAAAT CGTATCACCA TTAACTACTT TGCCATGCCG
CCCAGCACTT TTGGCGCAAT TTGCAAAGGG CTTGGCGAGG CAAAACTGAA TGCTAAACCG
GCACGCGTAG TCATGGAGAA ACCGCTGGGG ACGTCGCTGG CGACCTCGCA GGAAATCAAT
GATCAGGTTG GCGAATACTT CGAGGAGTGC CAGGTTTACC GTATCGACCA CTATCTTGGT
AAAGAAACGG TGCTGAACCT GTTGGCGCTG CGTTTTGCTA ACTCCCTGTT TGTGAATAAC
TGGGACAATC GCACCATTGA TCATGTTGAG ATTACCGTGG CAGAAGAAGT GGGGATCGAA
GGGCGCTGGG GCTATTTTGA TAAAGCCGGT CAGATGCGCG ACATGATCCA GAACCACCTG
CTGCAAATTC TTTGCATGAT TGCGATGTCT CCGCCGTCTG ACCTGAGCGC AGACAGCATC
CGCGATGAAA AAGTGAAAGT ACTGAAGTCT CTGCGCCGCA TCGACCGCTC CAACGTACGC
GAAAAAACCG TACGCGGGCA ATATACTGCG GGCTTCGCCC AGGGCAAAAA AGTGCCGGGA
TATCTGGAAG AAGAGGGCGC GAACAAGAGC AGCAATACAG AAACTTTCGT GGCGATCCGC
GTCGACATTG ATAACTGGCG CTGGGCCGGT GTGCCATTCT ACCTGCGTAC TGGTAAACGT
CTGCCGACCA AATGTTCTGA AGTCGTGGTC TATTTCAAAA CACCTGAACT GAATCTGTTT
AAAGAATCGT GGCAGGATCT GCCGCAGAAT AAACTGACTA TCCGTCTGCA ACCTGATGAA
GGCGTGGATA TCCAGGTACT GAATAAAGTT CCTGGCCTTG ACCACAAACA TAACCTGCAA
ATCACCAAGC TAGATCTGAG CTATTCAGAA ACCTTTAATC AGACGCATCT GGCGGATGCC
TATGAACGTT TGCTGCTGGA AACCATGCGT GGTATTCAGG CACTGTTTGT ACGTCGCGAC
GAAGTGGAAG AAGCCTGGAA ATGGGTAGAC TCCATTACTG AGGCGTGGGC GATGGACAAT
GATGCGCCGA AACCGTATCA GGCCGGAACC TGGGGACCCG TTGCCTCGGT GGCGATGATT
ACCCGTGATG GTCGTTCCTG GAATGAGTTT GAGTAA
 
Protein sequence
MAVTQTAQAC DLVIFGAKGD LARRKLLPSL YQLEKAGQLN PDTRIIGVGR ADWDKAAYTK 
VVREALETFM KETIDEGLWD TLSARLDFCN LDVNDTAAFS RLGAMLDQKN RITINYFAMP
PSTFGAICKG LGEAKLNAKP ARVVMEKPLG TSLATSQEIN DQVGEYFEEC QVYRIDHYLG
KETVLNLLAL RFANSLFVNN WDNRTIDHVE ITVAEEVGIE GRWGYFDKAG QMRDMIQNHL
LQILCMIAMS PPSDLSADSI RDEKVKVLKS LRRIDRSNVR EKTVRGQYTA GFAQGKKVPG
YLEEEGANKS SNTETFVAIR VDIDNWRWAG VPFYLRTGKR LPTKCSEVVV YFKTPELNLF
KESWQDLPQN KLTIRLQPDE GVDIQVLNKV PGLDHKHNLQ ITKLDLSYSE TFNQTHLADA
YERLLLETMR GIQALFVRRD EVEEAWKWVD SITEAWAMDN DAPKPYQAGT WGPVASVAMI
TRDGRSWNEF E