Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0781 |
Symbol | galT |
ID | 6143889 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 781993 |
End bp | 783039 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641615669 |
Product | galactose-1-phosphate uridylyltransferase |
Protein accession | YP_001742861 |
Protein GI | 170680663 |
COG category | [C] Energy production and conversion |
COG ID | [COG1085] Galactose-1-phosphate uridylyltransferase |
TIGRFAM ID | [TIGR00209] galactose-1-phosphate uridylyltransferase, family 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCAAT TTAATCCCGT TGATCATCCA CATCGCCGCT ACAACCCGCT CACCGGGCAA TGGATTCTGG TTTCACCGCA CCGCGCTAAG CGCCCCTGGC AGGGGGCGCA GGAAACGCCA GCCAAACAGG TGTTACCTGC GCACGATCCA GATTGCTTCC TCTGCGCAGG TAATGTGCGG GTGACAGGCG ATAAAAACCC CGATTACACC GGGACTTACG TTTTCACTAA TGACTTTGCG GCCTTGATGT CTGACACGCC AGATGCGCCA GAAAGCAACG ATCCGCTAAT GCGTTGCCAG AGCGCGCGCG GTACCAGCCG GGTGATCTGC TTTTCACCGG ATCACAGTAA AACGTTGCCA GAACTGAGCG TTGCGGCATT GACGGAAATC GTCAAAACCT GGCAGGAGCA AACCGCAGAG CTGGGAAAAA CGTACCCGTG GGTACAGGTC TTTGAAAACA AAGGCGCGGC GATGGGCTGC TCTAACCCGC ATCCGCACGG ACAGATTTGG GCAAATAGCT TCCTGCCTAA CGAAGCTGAG CGCGAAGACC GCCTGCAAAA AGAATATTTC GCCGGGCAGA AATCACCAAT GCTGGTGGAT TATGTTCAGC GCGAGCTGGC AGACGGTAGC CGTACCGTTG TCGAAACCGA ACACTGGTTA GCCGTTGTAC CTTACTGGGC TGCCTGGCCG TTCGAAACGC TACTGCTGCC CAAAGCCCAC GTTTTGTGGA TCACCGATTT GACCGACGCC CAGCGCAGCG ATTTGGCACT GGCGTTGAAA AAGCTGACCA GTCGTTATGA CAACCTCTTC CAGTGCTCCT TCCCCTACTC TATGGGCTGG CACGGCGCGC CATTTAATGG CGAAGAGAAT CAACACTGGC AGCTGCACGC GCACTTTTAT CCGCCTCTGT TGCGCTCCGC CACCGTACGT AAATTTATGG TTGGTTATGA AATGCTGGCA GAAACCCAGC GAGACCTGAC CGCAGAACAG GCAGCAGAGC GTTTGCGCGC AGTCAGCGAT ATCCATTTTC GCGAATCCGG AGTGTAA
|
Protein sequence | MTQFNPVDHP HRRYNPLTGQ WILVSPHRAK RPWQGAQETP AKQVLPAHDP DCFLCAGNVR VTGDKNPDYT GTYVFTNDFA ALMSDTPDAP ESNDPLMRCQ SARGTSRVIC FSPDHSKTLP ELSVAALTEI VKTWQEQTAE LGKTYPWVQV FENKGAAMGC SNPHPHGQIW ANSFLPNEAE REDRLQKEYF AGQKSPMLVD YVQRELADGS RTVVETEHWL AVVPYWAAWP FETLLLPKAH VLWITDLTDA QRSDLALALK KLTSRYDNLF QCSFPYSMGW HGAPFNGEEN QHWQLHAHFY PPLLRSATVR KFMVGYEMLA ETQRDLTAEQ AAERLRAVSD IHFRESGV
|
| |