Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1021 |
Symbol | galF |
ID | 6143064 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1042254 |
End bp | 1043147 |
Gene Length | 894 bp |
Protein Length | 297 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641615908 |
Product | UTP--glucose-1-phosphate uridylyltransferase subunit GalF |
Protein accession | YP_001743100 |
Protein GI | 170680712 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1210] UDP-glucose pyrophosphorylase |
TIGRFAM ID | [TIGR01099] UTP-glucose-1-phosphate uridylyltransferase [TIGR01105] UTP-glucose-1-phosphate uridylyltransferase, non-catalytic GalF subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.108584 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAATT TAAAAGCAGT TATACCGGTA GCAGGTCTTG GGATGCATAT GTTGCCTGCC ACTAAGGCGA TTCCCAAAGA GATGCTACCG ATCGTCGACA AGCCAATGAT TCAGTACATT GTTGACGAGA TTGTGGCTGC AGGGATCAAA GAAATCCTCC TGGTAACTCA CGCGTCCAAG AACGCAGTCG AAAACCACTT CGACACCTCT TATGAATTAG AATCTCTCCT TGAACAGCGC GTGAAGCGTC AACTGCTGGC GGAAGTACAG TCTATCTGTC CGCCGGGCGT GACCATTATG AACGTGCGTC AGGGCGAACC TTTAGGTTTA GGCCACTCCA TTTTGTGTGC ACGACCCGCC ATTGGTGACA ACCCATTTGT CGTGGTATTG CCGGACGTTG TGATCGACGA CGCCAGTGCC GACCCGCTGC GCTACAACCT TGCTGCCATG ATTGCACGTT TCAACGAAAC GGGCCGCAGC CAGGTGCTGG CAAAACGTAT GCCGGGTGAC CTCTCTGAAT ACTCCGTCAT TCAGACTAAA GAGCCGCTGG ATCGTGAAGG TAAAGTCAGC CGTATTGTTG AATTCATAGA AAAGCCGGAT CAGCCGCAGA CGCTGGACTC AGACATCATG GCCGTTGGTC GATATGTGCT TTCTGCCGAT ATTTGGCCGG AACTTGAACG CACTCAGCCT GGTGCATGGG GGCGTATTCA GCTGACTGAT GCCATTGCCG AACTGGCGAA AAAACAGTCC GTTGATGCCA TGCTGATGAC TGGTGACAGC TATGACTGCG GTAAAAAAAT GGGTTATATG CAGGCGTTTG TGAAGTATGG ACTGCGCAAC CTCAAAGAAG GGGCGAAGTT CCGCAAAGGC ATTGAGAAAC TGTTAAGCGA ATAA
|
Protein sequence | MTNLKAVIPV AGLGMHMLPA TKAIPKEMLP IVDKPMIQYI VDEIVAAGIK EILLVTHASK NAVENHFDTS YELESLLEQR VKRQLLAEVQ SICPPGVTIM NVRQGEPLGL GHSILCARPA IGDNPFVVVL PDVVIDDASA DPLRYNLAAM IARFNETGRS QVLAKRMPGD LSEYSVIQTK EPLDREGKVS RIVEFIEKPD QPQTLDSDIM AVGRYVLSAD IWPELERTQP GAWGRIQLTD AIAELAKKQS VDAMLMTGDS YDCGKKMGYM QAFVKYGLRN LKEGAKFRKG IEKLLSE
|
| |