Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1905 |
Symbol | galU |
ID | 6145308 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1925916 |
End bp | 1926824 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641616781 |
Product | UTP--glucose-1-phosphate uridylyltransferase subunit GalU |
Protein accession | YP_001743959 |
Protein GI | 170681483 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1210] UDP-glucose pyrophosphorylase |
TIGRFAM ID | [TIGR01099] UTP-glucose-1-phosphate uridylyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0432202 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.235357 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGCCA TTAATACGAA AGTCAAAAAA GCCGTTATCC CCGTTGCGGG ATTAGGAACC AGGATGTTGC CGGCGACGAA AGCCATCCCG AAAGAGATGC TGCCACTTGT CGATAAGCCA TTAATTCAAT ACGTCGTGAA TGAATGTATT GCGGCTGGCA TTACTGAAAT TGTGCTGGTT ACTCACTCAT CTAAAAACTC TATTGAAAAC CACTTTGATA CCAGTTTTGA ACTGGAAGCA ATGCTGGAAA AACGTGTAAA ACGTCAACTG CTTGATGAAG TGCAGTCTAT TTGTCCGCCG CACGTGACTA TTATGCAAGT TCGTCAGGGG CTGGCGAAAG GCCTGGGACA CGCGGTATTG TGTGCTCACC CGGTAGTGGG TGATGAACCT GTGGCTGTTA TTTTGCCTGA CGTTATTCTG GATGAATATG AATCCGATTT GTCACAGGAT AACCTGGCAG AGATGATCCG CCGCTTTGAT GAAACGGGTC ATAGCCAGAT CATGGTTGAA CCGGTTGCTG ATGTGACCGC ATATGGCGTT GTGGATTGCA AAGGCGTTGA ATTAGCGCCG GGTGAAAGCG TACCGATGGT TGGCGTGGTT GAAAAACCAA AAGCGGATGT TGCGCCGTCT AATCTCGCTA TTGTGGGTCG TTACGTTCTT AGCGCGGATA TTTGGCCGTT GCTGGCAAAA ACCCCTCCGG GTGCTGGTGA TGAAATTCAG CTCACCGACG CAATTGATAT GCTGATCGAA AAAGAAACGG TGGAAGCCTA TCATATGAAA GGGAAGAGCC ATGACTGCGG TAATAAATTA GGTTACATGC AGGCCTTCGT TGAATACGGT ATTCGTCATA ACACCCTTGG CTCGGAATTT AAAGCCTGGC TTGAAGAAGA GATGGGCATT AAGAAGTAA
|
Protein sequence | MAAINTKVKK AVIPVAGLGT RMLPATKAIP KEMLPLVDKP LIQYVVNECI AAGITEIVLV THSSKNSIEN HFDTSFELEA MLEKRVKRQL LDEVQSICPP HVTIMQVRQG LAKGLGHAVL CAHPVVGDEP VAVILPDVIL DEYESDLSQD NLAEMIRRFD ETGHSQIMVE PVADVTAYGV VDCKGVELAP GESVPMVGVV EKPKADVAPS NLAIVGRYVL SADIWPLLAK TPPGAGDEIQ LTDAIDMLIE KETVEAYHMK GKSHDCGNKL GYMQAFVEYG IRHNTLGSEF KAWLEEEMGI KK
|
| |