Gene EcSMS35_1021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1021 
SymbolgalF 
ID6143064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1042254 
End bp1043147 
Gene Length894 bp 
Protein Length297 aa 
Translation table11 
GC content51% 
IMG OID641615908 
ProductUTP--glucose-1-phosphate uridylyltransferase subunit GalF 
Protein accessionYP_001743100 
Protein GI170680712 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1210] UDP-glucose pyrophosphorylase 
TIGRFAM ID[TIGR01099] UTP-glucose-1-phosphate uridylyltransferase
[TIGR01105] UTP-glucose-1-phosphate uridylyltransferase, non-catalytic GalF subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.108584 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAATT TAAAAGCAGT TATACCGGTA GCAGGTCTTG GGATGCATAT GTTGCCTGCC 
ACTAAGGCGA TTCCCAAAGA GATGCTACCG ATCGTCGACA AGCCAATGAT TCAGTACATT
GTTGACGAGA TTGTGGCTGC AGGGATCAAA GAAATCCTCC TGGTAACTCA CGCGTCCAAG
AACGCAGTCG AAAACCACTT CGACACCTCT TATGAATTAG AATCTCTCCT TGAACAGCGC
GTGAAGCGTC AACTGCTGGC GGAAGTACAG TCTATCTGTC CGCCGGGCGT GACCATTATG
AACGTGCGTC AGGGCGAACC TTTAGGTTTA GGCCACTCCA TTTTGTGTGC ACGACCCGCC
ATTGGTGACA ACCCATTTGT CGTGGTATTG CCGGACGTTG TGATCGACGA CGCCAGTGCC
GACCCGCTGC GCTACAACCT TGCTGCCATG ATTGCACGTT TCAACGAAAC GGGCCGCAGC
CAGGTGCTGG CAAAACGTAT GCCGGGTGAC CTCTCTGAAT ACTCCGTCAT TCAGACTAAA
GAGCCGCTGG ATCGTGAAGG TAAAGTCAGC CGTATTGTTG AATTCATAGA AAAGCCGGAT
CAGCCGCAGA CGCTGGACTC AGACATCATG GCCGTTGGTC GATATGTGCT TTCTGCCGAT
ATTTGGCCGG AACTTGAACG CACTCAGCCT GGTGCATGGG GGCGTATTCA GCTGACTGAT
GCCATTGCCG AACTGGCGAA AAAACAGTCC GTTGATGCCA TGCTGATGAC TGGTGACAGC
TATGACTGCG GTAAAAAAAT GGGTTATATG CAGGCGTTTG TGAAGTATGG ACTGCGCAAC
CTCAAAGAAG GGGCGAAGTT CCGCAAAGGC ATTGAGAAAC TGTTAAGCGA ATAA
 
Protein sequence
MTNLKAVIPV AGLGMHMLPA TKAIPKEMLP IVDKPMIQYI VDEIVAAGIK EILLVTHASK 
NAVENHFDTS YELESLLEQR VKRQLLAEVQ SICPPGVTIM NVRQGEPLGL GHSILCARPA
IGDNPFVVVL PDVVIDDASA DPLRYNLAAM IARFNETGRS QVLAKRMPGD LSEYSVIQTK
EPLDREGKVS RIVEFIEKPD QPQTLDSDIM AVGRYVLSAD IWPELERTQP GAWGRIQLTD
AIAELAKKQS VDAMLMTGDS YDCGKKMGYM QAFVKYGLRN LKEGAKFRKG IEKLLSE