Gene EcSMS35_0781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0781 
SymbolgalT 
ID6143889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp781993 
End bp783039 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content55% 
IMG OID641615669 
Productgalactose-1-phosphate uridylyltransferase 
Protein accessionYP_001742861 
Protein GI170680663 
COG category[C] Energy production and conversion 
COG ID[COG1085] Galactose-1-phosphate uridylyltransferase 
TIGRFAM ID[TIGR00209] galactose-1-phosphate uridylyltransferase, family 1 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAAT TTAATCCCGT TGATCATCCA CATCGCCGCT ACAACCCGCT CACCGGGCAA 
TGGATTCTGG TTTCACCGCA CCGCGCTAAG CGCCCCTGGC AGGGGGCGCA GGAAACGCCA
GCCAAACAGG TGTTACCTGC GCACGATCCA GATTGCTTCC TCTGCGCAGG TAATGTGCGG
GTGACAGGCG ATAAAAACCC CGATTACACC GGGACTTACG TTTTCACTAA TGACTTTGCG
GCCTTGATGT CTGACACGCC AGATGCGCCA GAAAGCAACG ATCCGCTAAT GCGTTGCCAG
AGCGCGCGCG GTACCAGCCG GGTGATCTGC TTTTCACCGG ATCACAGTAA AACGTTGCCA
GAACTGAGCG TTGCGGCATT GACGGAAATC GTCAAAACCT GGCAGGAGCA AACCGCAGAG
CTGGGAAAAA CGTACCCGTG GGTACAGGTC TTTGAAAACA AAGGCGCGGC GATGGGCTGC
TCTAACCCGC ATCCGCACGG ACAGATTTGG GCAAATAGCT TCCTGCCTAA CGAAGCTGAG
CGCGAAGACC GCCTGCAAAA AGAATATTTC GCCGGGCAGA AATCACCAAT GCTGGTGGAT
TATGTTCAGC GCGAGCTGGC AGACGGTAGC CGTACCGTTG TCGAAACCGA ACACTGGTTA
GCCGTTGTAC CTTACTGGGC TGCCTGGCCG TTCGAAACGC TACTGCTGCC CAAAGCCCAC
GTTTTGTGGA TCACCGATTT GACCGACGCC CAGCGCAGCG ATTTGGCACT GGCGTTGAAA
AAGCTGACCA GTCGTTATGA CAACCTCTTC CAGTGCTCCT TCCCCTACTC TATGGGCTGG
CACGGCGCGC CATTTAATGG CGAAGAGAAT CAACACTGGC AGCTGCACGC GCACTTTTAT
CCGCCTCTGT TGCGCTCCGC CACCGTACGT AAATTTATGG TTGGTTATGA AATGCTGGCA
GAAACCCAGC GAGACCTGAC CGCAGAACAG GCAGCAGAGC GTTTGCGCGC AGTCAGCGAT
ATCCATTTTC GCGAATCCGG AGTGTAA
 
Protein sequence
MTQFNPVDHP HRRYNPLTGQ WILVSPHRAK RPWQGAQETP AKQVLPAHDP DCFLCAGNVR 
VTGDKNPDYT GTYVFTNDFA ALMSDTPDAP ESNDPLMRCQ SARGTSRVIC FSPDHSKTLP
ELSVAALTEI VKTWQEQTAE LGKTYPWVQV FENKGAAMGC SNPHPHGQIW ANSFLPNEAE
REDRLQKEYF AGQKSPMLVD YVQRELADGS RTVVETEHWL AVVPYWAAWP FETLLLPKAH
VLWITDLTDA QRSDLALALK KLTSRYDNLF QCSFPYSMGW HGAPFNGEEN QHWQLHAHFY
PPLLRSATVR KFMVGYEMLA ETQRDLTAEQ AAERLRAVSD IHFRESGV