Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5222 |
Symbol | rfbA1 |
ID | 6970774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4870110 |
End bp | 4870991 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643388887 |
Product | glucose-1-phosphate thymidylyltransferase |
Protein accession | YP_002273307 |
Protein GI | 209400406 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1209] dTDP-glucose pyrophosphorylase |
TIGRFAM ID | [TIGR01207] glucose-1-phosphate thymidylyltransferase, short form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.350144 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 66 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGGTA TTATCCTGGC GGGCGGTTCC GGTACCCGAT TGCATCCGAT TACGCGCGGC GTATCGAAGC AACTGTTGCC GATTTACGAT AAGCCAATGA TTTACTATCC GCTGTCGGTG CTGATGCTGG CCGGTATCCG CGAAATTCTC ATCATCACTA CGCCGGAAGA TAAAGGTTAT TTCCAGCGCC TGCTGGGCGA TGGTAGTGAG TTCGGTATCC AGCTGGAATA TGCCGAACAG CCCAGCCCGG ACGGTCTGGC GCAGGCCTTT ATCATCGGTG AAACCTTCCT TAATGGTGAA CCTTCTTGTC TGGTGCTGGG CGATAACATC TTCTTCGGTC AGGGCTTCAG TCCGAAGCTG CGTCATGTTG CGGCGCGCAC GGAAGTGGCG ACGGTTTTTG GCTATCAGGT GATGGACCCG GAACGCTTTG GCGTGGTGGA GTTTGACGAT AATTTCCGCG CTATCTCGCT GGAAGAAAAG CCAAAACAGC CGAAATCAAA CTGGGCGGTG ACCGGGCTTT ATTTCTACGA CAGTAAAGTC GTGGAGTACG CAAAGCAGGT GAAGCCGTCG GAGCGTGGTG AACTGGAGAT TACCTCCATC AACCAGATGT ACCTCGAGGC GGGCAACCTG ACCGTTGAAC TGCTCGGGCG CGGATTTGCC TGGCTGGACA CTGGCACTCA CGACAGCCTG ATTGAAGCCA GCACCTTTGT ACAGACGGTG GAAAAACGCC AGGGCTTTAA GATTGCCTGC CTGGAAGAGA TCGCCTGGCG TAACGGCTGG CTCGATGACG AGGGTGTGAA GCGTGCTGCC AGTTCATTAG CGAAAACTGG CTACGGCCAA TATCTGCTGG AGTTACTTCG TGCCCGTCCG CGCCAGTATT GA
|
Protein sequence | MKGIILAGGS GTRLHPITRG VSKQLLPIYD KPMIYYPLSV LMLAGIREIL IITTPEDKGY FQRLLGDGSE FGIQLEYAEQ PSPDGLAQAF IIGETFLNGE PSCLVLGDNI FFGQGFSPKL RHVAARTEVA TVFGYQVMDP ERFGVVEFDD NFRAISLEEK PKQPKSNWAV TGLYFYDSKV VEYAKQVKPS ERGELEITSI NQMYLEAGNL TVELLGRGFA WLDTGTHDSL IEASTFVQTV EKRQGFKIAC LEEIAWRNGW LDDEGVKRAA SSLAKTGYGQ YLLELLRARP RQY
|
| |