Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4293 |
Symbol | rhaD |
ID | 6145695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4396244 |
End bp | 4397068 |
Gene Length | 825 bp |
Protein Length | 274 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641619114 |
Product | rhamnulose-1-phosphate aldolase |
Protein accession | YP_001746238 |
Protein GI | 170679635 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0235] Ribulose-5-phosphate 4-epimerase and related epimerases and aldolases |
TIGRFAM ID | [TIGR02624] rhamnulose-1-phosphate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.200304 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAACA TTACCCAGTC CTGGTTTGTC CAGGGAATGA TCAAAGCCAC CACCGACGCC TGGCTGAAAG GCTGGGACGA GCGCAATGGT GGTAACCTGA CGCTGCGCCT GGATGACGCC GATATCGCAC CGTATCACGA CAACTTTCAT GCACAACCGC GCTATATCCC GCTCAGCCAG CCCATGCCTT TACTGGCAAA TACACCGTTT ATTGTCACCG GCTCCGGCAA ATTCTTCCGT AACGTCCAGC TTGATCCTGC GGCTAACTTA GGCGTCGTAA AAGTCGACAG CGACGGCGCG GGCTACCATA TTCTCTGGGG GTTAACCAAC GAAGCCGTCC CGACATCCGA ACTTCCGGCG CACTTCCTCT CTCATTGCGA ACGCATTAAA GCCACCAACG GTAAAGATCG GGTGATCATG CACTGCCATG CCACCAACCT GATCGCCCTT ACCTACGTAT TGGAAAACGA CACCGCGGTC TTCACTCGCC AGCTTTGGGA AGGCAGCACC GAGTGTCTGG TAGTGTTCCC GGATGGCGTC GGCATTTTGC CGTGGATGGT GCCCGGCACC GACGAAATCG GCCAGGCGAC CGCACAGGAA ATGCAAAAAC ATTCGCTGGT GTTGTGGCCC TTCCACGGCG TCTTCGGCAG CGGTCCGACA CTGGATGAAA CATTCGGTTT AATCGACACC GCAGAAAAAT CAGCACAAGT ATTAGTGAAG GTTTATTCGA TGGGCGGCAT GAAACAGACC ATCAGTCGTG AAGAGCTGAT AGCACTCGGC CAGCGTTTCG GCGTTACACC ACTTGCCAGT GCGCTGGCGC TGTAA
|
Protein sequence | MQNITQSWFV QGMIKATTDA WLKGWDERNG GNLTLRLDDA DIAPYHDNFH AQPRYIPLSQ PMPLLANTPF IVTGSGKFFR NVQLDPAANL GVVKVDSDGA GYHILWGLTN EAVPTSELPA HFLSHCERIK ATNGKDRVIM HCHATNLIAL TYVLENDTAV FTRQLWEGST ECLVVFPDGV GILPWMVPGT DEIGQATAQE MQKHSLVLWP FHGVFGSGPT LDETFGLIDT AEKSAQVLVK VYSMGGMKQT ISREELIALG QRFGVTPLAS ALAL
|
| |