Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_3382 |
Symbol | rafD |
ID | 5588444 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 3403525 |
End bp | 3404955 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640927011 |
Product | raffinose invertase |
Protein accession | YP_001464381 |
Protein GI | 157155023 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1621] Beta-fructosidases (levanase/invertase) |
TIGRFAM ID | [TIGR01322] sucrose-6-phosphate hydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAGC GTCTTGCTTT GGCACAGTCT GCCCTTGAAA AACTTTGCGC ACGTCGTGGT AATGCCTGGT ACCCGATTTT TCATCTGGCT CCACCTGCCG GCTGGATGAA TGATCCAAAT GGCCTTATTT ACTTCAATGG GCGTTACCAT GCGTTCTTCC AGCATCATCC TGCAAGCGCA TATCAGGGGC CAATGCACTG GGGGCATGCC ACCAGTACCG ACATGTTGCA CTGGCAACAC GAACCTGTCG CGCTGGCACC CGGAGATAAA TATGATCGTG ATGGCTGTTT TTCAGGGAGT GCCGTGGATG ATGATGGCGT GCTATCACTT ATTTATACTG GTCATATTTG TCTCGATGAT CGTGGTAATG ACAGCATTAT CCGTGAAGTA CAGTGTCTGG CTACCAGTCA TGATGGTATT CACTTTGAGA AGCAGGGCTG TGTGCTGACA CCTCCGGAAG GTATAATGCA TTTCCGTGAT CCCAAAGTCT GGCACGAAGA CGGCTCCTGG TGGATGGTCA TTGGTGCCCG GGACGCTTCT GACAATGGGC AAGTTCTGTT GTATCGCGGG ACATCTTTGC GGGACTGGCA TCTGGAGCAT GTTCTTGCTC ATTCCGCAGC CGGAAAAAGT TATATGTGGG AATGCCCCGA TTTCTTCAGG TGTGGTAATT TTCACTGGCT GATGTTCTCA CCACAGGGGA TGCCCCCTTC CGGTTATCGG TTCCGTAACC TTTTTCAGAG CGGTGTGTTG GCAGGGAGCT GGAAGCCTGG TTCTGTCTTT GCGCTGAAAG GGAGATTTGA AGAGCTGGAT TATGGTCATG ACTTTTATGC TCCACAGTCC ATGCTGGCTG AGGACGGCAG GCGTATCATT ATGGCATGGA TGAATATGTG GGATTCACCC GTGCCCACCC GCAGTGAAGC CTGGGCAGGA TGTCTGACGC TGCCCAGAGA GGTTTTTGAG CGCGATGGCC GGCTGTGCCA GCGACCTGTG CGTGAAGTCG AATCTCTGCG CAGAAAATGC CAGCCATTAT CCCCTGTAAG GTTACAGGGT TTGCAATTAC TGACCGAAAA TGTACAGGCC GCAGAATTAT TGGTGACGTG GCATACGGTT GACAGTCATG CGGAGCACTA TGGCGTCCGC CTTGGAGACG GTCTGCGACT TTATGTGGAT AATCAGGCCG GGCGACTGGT ACTGTGGCGC TATTACCCTG AGGAAGGGCT GGATGGTTAC CGCAGTGTTG AACTTCCTGA TACAGAATAT CTGACTCTTC GTATTTTCTT GGATCGTTCA TCTGTTGAAG TGTTTGTTAA CGATGGTGAG GCAACCTTAT CAAGTCGTAT TTATCCGCAA GCGGACTCGA GACAATTATC GTTATATGCC GCTCATGGCG ATGCGATATT AACTGATGGC ACTTTATGGA TGCTGACCTG A
|
Protein sequence | MKQRLALAQS ALEKLCARRG NAWYPIFHLA PPAGWMNDPN GLIYFNGRYH AFFQHHPASA YQGPMHWGHA TSTDMLHWQH EPVALAPGDK YDRDGCFSGS AVDDDGVLSL IYTGHICLDD RGNDSIIREV QCLATSHDGI HFEKQGCVLT PPEGIMHFRD PKVWHEDGSW WMVIGARDAS DNGQVLLYRG TSLRDWHLEH VLAHSAAGKS YMWECPDFFR CGNFHWLMFS PQGMPPSGYR FRNLFQSGVL AGSWKPGSVF ALKGRFEELD YGHDFYAPQS MLAEDGRRII MAWMNMWDSP VPTRSEAWAG CLTLPREVFE RDGRLCQRPV REVESLRRKC QPLSPVRLQG LQLLTENVQA AELLVTWHTV DSHAEHYGVR LGDGLRLYVD NQAGRLVLWR YYPEEGLDGY RSVELPDTEY LTLRIFLDRS SVEVFVNDGE ATLSSRIYPQ ADSRQLSLYA AHGDAILTDG TLWMLT
|
| |