Gene EcolC_1780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1780 
Symbol 
ID6067322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1979600 
End bp1981075 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content52% 
IMG OID641601195 
Productglucose-6-phosphate 1-dehydrogenase 
Protein accessionYP_001724757 
Protein GI170019803 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0364] Glucose-6-phosphate 1-dehydrogenase 
TIGRFAM ID[TIGR00871] glucose-6-phosphate 1-dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.093662 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTAA CGCAAACAGC CCAGGCCTGT GACCTGGTCA TTTTCGGCGC GAAAGGCGAC 
CTTGCGCGTC GTAAATTGCT GCCTTCCCTG TATCAACTGG AAAAAGCCGG TCAGCTCAAC
CCGGACACCC GGATTATCGG CGTAGGGCGT GCTGACTGGG ATAAAGCGGC ATATACCAAA
GTTGTCCGCG AGGCGCTCGA AACTTTCATG AAAGAAACCA TTGATGAAGG TTTATGGGAC
ACCCTGAGTG CACGTCTGGA TTTTTGTAAT CTCGATGTCA ATGACACTGC TGCATTCAGC
CGTCTCGGCG CGATGCTGGA TCAAAAAAAT CGTATCACCA TTAACTACTT TGCCATGCCG
CCCAGCACTT TTGGCGCAAT TTGCAAAGGG CTTGGCGAGG CAAAACTGAA TGCTAAACCG
GCACGCGTAG TCATGGAGAA ACCGCTGGGG ACGTCGCTGG CGACCTCGCA GGAAATCAAT
GATCAGGTTG GCGAATACTT CGAGGAGTGC CAGGTTTACC GTATCGACCA CTATCTTGGT
AAAGAAACGG TGCTGAACCT GTTGGCGCTG CGTTTTGCTA ACTCCCTGTT TGTGAATAAC
TGGGACAATC GCACCATTGA TCATGTTGAG ATTACCGTGG CAGAAGAAGT GGGGATCGAA
GGGCGCTGGG GCTATTTTGA TAAAGCCGGT CAGATGCGCG ACATGATCCA GAACCACCTG
CTGCAAATTC TTTGCATGAT TGCGATGTCT CCGCCGTCTG ACCTGAGCGC AGACAGCATC
CGCGATGAAA AAGTGAAAGT ACTGAAGTCT CTGCGCCGCA TCGACCGCTC CAACGTACGC
GAAAAAACCG TACGCGGGCA ATATACTGCG GGCTTCGCCC AGGGCAAAAA AGTGCCGGGA
TATCTGGAAG AAGAGGGCGC GAACAAGAGC AGCAATACAG AAACTTTCGT GGCGATCCGC
GTCGACATTG ATAACTGGCG CTGGGCCGGT GTGCCATTCT ACCTGCGTAC TGGTAAACGT
CTGCCGACCA AATGTTCTGA AGTCGTGGTC TATTTCAAAA CACCTGAACT GAATCTGTTT
AAAGAATCGT GGCAGGATCT GCCGCAGAAT AAACTGACTA TCCGTCTGCA ACCTGATGAA
GGCGTGGATA TCCAGGTACT GAATAAAGTT CCTGGCCTTG ACCACAAACA TAACCTGCAA
ATCACCAAGC TGGATCTGAG CTATTCAGAA ACCTTTAATC AGACGCATCT GGCGGATGCC
TATGAACGTT TGCTGCTGGA AACCATGCGT GGTATTCAGG CACTGTTTGT ACGTCGCGAC
GAAGTGGAAG AAGCCTGGAA ATGGGTAGAC TCCATTACTG AGGCGTGGGC GATGGACAAT
GATGCGCCGA AACCGTATCA GGCCGGAACC TGGGGACCCG TTGCCTCGGT GGCGATGATT
ACCCGTGATG GTCGTTCCTG GAATGAGTTT GAGTAA
 
Protein sequence
MAVTQTAQAC DLVIFGAKGD LARRKLLPSL YQLEKAGQLN PDTRIIGVGR ADWDKAAYTK 
VVREALETFM KETIDEGLWD TLSARLDFCN LDVNDTAAFS RLGAMLDQKN RITINYFAMP
PSTFGAICKG LGEAKLNAKP ARVVMEKPLG TSLATSQEIN DQVGEYFEEC QVYRIDHYLG
KETVLNLLAL RFANSLFVNN WDNRTIDHVE ITVAEEVGIE GRWGYFDKAG QMRDMIQNHL
LQILCMIAMS PPSDLSADSI RDEKVKVLKS LRRIDRSNVR EKTVRGQYTA GFAQGKKVPG
YLEEEGANKS SNTETFVAIR VDIDNWRWAG VPFYLRTGKR LPTKCSEVVV YFKTPELNLF
KESWQDLPQN KLTIRLQPDE GVDIQVLNKV PGLDHKHNLQ ITKLDLSYSE TFNQTHLADA
YERLLLETMR GIQALFVRRD EVEEAWKWVD SITEAWAMDN DAPKPYQAGT WGPVASVAMI
TRDGRSWNEF E