Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0991 |
Symbol | |
ID | 6969011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1007393 |
End bp | 1008508 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643385006 |
Product | glucose / sorbosone dehydrogenase protein |
Protein accession | YP_002269506 |
Protein GI | 209399419 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 0.657976 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATCGAC AATTCTTTTT CCTTGTGCCC CTTATTTGTC TTTCTTCCGC TCTCTGGGCG GCTCCTGCAA CGGTAAATGT CGAAGTACTG CAAGACAAAC TCGACCATCC CTGGGCACTG GCCTTTTTAC CCGATAACCA CGGTATGTTA ATCACTCTGC GCGGCGGCGA GTTGCGTCAC TGGCAAGCAG GAAAAGGATT ATCTGCGCCG CTTTCCGGAG TTCCGGACGT TTGGGCGCAC GGGCAGGGCG GCCTGCTGGA CGTGGTTTTA GCGCCTGATT TTGCTCAGTC TCGCCGCATC TGGTTAAGTT ATTCCGAAGT TGGCGATGAT GGCAAAGCCG GAACTGCTGT GGGGTATGGC CGCTTAAGTG ATGATCTCTC AAAAGTGACC GACTTTCGCA CCGTCTTTCG CCAGATGCCA AAACTGTCTA CCGGCAACCA TTTTGGCGGG CGGCTGGTAT TCGACGGTAA AGGTTATCTT TTTATTGCTC TGGGCGAAAA CAATCAGCGC CCGACGGCGC AGGATCTGGA TAAATTACAG GGCAAACTGG TGCGTCTGAC CGACCAGGGC GAAATCCCGG ATGATAATCC TTTTATAAAG GAATCCGGTG CGCGCGCCGA GATCTGGTCT TATGGCATTC GTAATCCGCA AGGAATGGCG ATGAATCCGT GGAGTAATGC ACTGTGGCTG AATGAACATG GCCCGCGCGG TGGTGATGAA ATTAATATCC CGCAAAAAGG CAAAAACTAC GGCTGGCCGC TGGCAACCTG GGGAATCAAC TATTCAGGCT TTAAGATACC GGAAGCGAAA GGGGAGATCG TCGCCGGGAC CGAGCAACCT GTTTTTTACT GGAAAGATTC GCCCGCCGTG AGCGGCATGG CCTTCTATAA CAGCGATAAA TTCCCCCAGT GGCAGCAAAA ATTATTTATT GGCGCGCTGA AAGATAAAGA TGTCATTGTG ATGAGCGTCA ACGGCGACAA AGTAACAGAA GATGGCCGTA TTTTAACGGA CAGAGGGCAG CGAATTCGTG ATGTTCGCAC TGGACCCGAC GGTTATTTAT ACGTTCTCAC CGACGAGTCC AGTGGGGAAT TACTTAAAGT TAGCCCACGC AATTAG
|
Protein sequence | MHRQFFFLVP LICLSSALWA APATVNVEVL QDKLDHPWAL AFLPDNHGML ITLRGGELRH WQAGKGLSAP LSGVPDVWAH GQGGLLDVVL APDFAQSRRI WLSYSEVGDD GKAGTAVGYG RLSDDLSKVT DFRTVFRQMP KLSTGNHFGG RLVFDGKGYL FIALGENNQR PTAQDLDKLQ GKLVRLTDQG EIPDDNPFIK ESGARAEIWS YGIRNPQGMA MNPWSNALWL NEHGPRGGDE INIPQKGKNY GWPLATWGIN YSGFKIPEAK GEIVAGTEQP VFYWKDSPAV SGMAFYNSDK FPQWQQKLFI GALKDKDVIV MSVNGDKVTE DGRILTDRGQ RIRDVRTGPD GYLYVLTDES SGELLKVSPR N
|
| |