Gene ECH74115_0861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0861 
SymbolgalT 
ID6970055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp877216 
End bp878262 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content55% 
IMG OID643384886 
Productgalactose-1-phosphate uridylyltransferase 
Protein accessionYP_002269386 
Protein GI209397834 
COG category[C] Energy production and conversion 
COG ID[COG1085] Galactose-1-phosphate uridylyltransferase 
TIGRFAM ID[TIGR00209] galactose-1-phosphate uridylyltransferase, family 1 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.14597 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.922241 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAAT TTAATCCCGT TGATCATCCA CATCGCCGCT ACAACCAGCT CACCGGGCAA 
TGGATTCTGG TTTCACCGCA CCGCGCTAAG CGCCCCTGGC AGGGGGCGCA GGAAACGCCA
GCCAAACAGG TGTTACCTGC GCACGATCCA GATTGCTTCC TCTGCGCAGG TAATGTGCGG
GTGACAGGCG ATAAAAACCC CGATTACACC GGGACTTACG TTTTCACTAA TGACTTTGCG
GCCTTGATGT CTGACACGCC AGATGCGCCA GAAAGCAACG ATCCTTTGAT GCGTTGCCAG
AGCGCGCGCG GTACCAGCCG GGTGATCTGC TTTTCACCGG ATCACAGTAA AACGCTGCCA
GAACTGAGCG TTGCGGCATT GACGGAAATC GTCAAAACCT GGCAGGAGCA AACCGCAGAA
CTGGGGAAAA CATACCCGTG GGTGCAGGTC TTTGAAAACA AAGGTGCGGC GATGGGCTGC
TCTAACCCGC ATCCGCACGG ACAGATTTGG GCAAATAGCT TCCTGCCTAA CGAAGCTGAG
CGCGAAGACC GCCTGCAAAA AGAATATTTT GCCGAGCAGA AATCGCCAAT GCTGGTGGAT
TATGTTCAGC GCGAGCTGGC AGACGGTAGC CGTACCGTTG TCGAAACCGA ACACTGGTTA
GCAGTTGTGC CTTACTGGGC TGCCTGGCCG TTCGAAACGC TACTTCTGCC CAAAGCCCAC
GTTTTGCGGA TCACCGATTT GACCGACGCC CAGCGCAGCG ATTTGGCACT GGCGTTGAAA
AAGCTGACCA GTCGTTATGA CAACCTCTTC CAGTGCTCCT TCCCCTACTC TATGGGCTGG
CACGGCGCAC CGTTTAATGG CGAAGAGAAT CAACACTGGC AGCTGCACGC GCACTTTTAT
CCGCCTCTGT TGCGCTCCGC CACCGTACGT AAATTTATGG TTGGTTATGA AATGCTGGCA
GAAACCCAGC GAGACCTGAC CGCAGAACAG GCAGCAGAGC GTTTGCGCGC AGTCAGCGAT
ATCCATTTTC GCGAATCCGG AGTGTAA
 
Protein sequence
MTQFNPVDHP HRRYNQLTGQ WILVSPHRAK RPWQGAQETP AKQVLPAHDP DCFLCAGNVR 
VTGDKNPDYT GTYVFTNDFA ALMSDTPDAP ESNDPLMRCQ SARGTSRVIC FSPDHSKTLP
ELSVAALTEI VKTWQEQTAE LGKTYPWVQV FENKGAAMGC SNPHPHGQIW ANSFLPNEAE
REDRLQKEYF AEQKSPMLVD YVQRELADGS RTVVETEHWL AVVPYWAAWP FETLLLPKAH
VLRITDLTDA QRSDLALALK KLTSRYDNLF QCSFPYSMGW HGAPFNGEEN QHWQLHAHFY
PPLLRSATVR KFMVGYEMLA ETQRDLTAEQ AAERLRAVSD IHFRESGV