Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_01222 |
Symbol | galU |
ID | 8115281 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 1278655 |
End bp | 1279563 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644847473 |
Product | hypothetical protein |
Protein accession | YP_002999046 |
Protein GI | 251784742 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1210] UDP-glucose pyrophosphorylase |
TIGRFAM ID | [TIGR01099] UTP-glucose-1-phosphate uridylyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.774265 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGCCA TTAATACGAA AGTCAAAAAA GCCGTTATCC CCGTTGCGGG ATTAGGAACC AGGATGTTGC CGGCGACGAA AGCCATCCCG AAAGAGATGC TGCCACTTGT CGATAAGCCA TTAATTCAAT ACGTCGTGAA TGAATGTATT GCGGCTGGCA TTACTGAAAT TGTGCTGGTT ACACACTCAT CTAAAAACTC TATTGAAAAC CACTTTGATA CCAGTTTTGA ACTGGAAGCA ATGCTGGAAA AACGTGTAAA ACGTCAACTG CTTGATGAAG TGCAGTCTAT TTGTCCGCCG CACGTGACTA TTATGCAAGT TCGTCAGGGG CTGGCGAAAG GTCTGGGACA CGCGGTATTG TGTGCTCACC CGGTAGTGGG TGATGAACCT GTGGCTGTTA TTTTGCCTGA CGTTATTCTG GATGAATATG AATCCGATTT GTCACAGGAT AACCTGGCAG AGATGATCCG CCGCTTTGAT GAAACGGGTC ATAGCCAGAT CATGGTTGAA CCGGTTGCTG ATGTGACCGC ATATGGCGTT GTGGATTGCA AAGGCGTTGA ATTAGCGCCG GGTGAAAGCG TACCGATGGT TGGCGTGGTT GAAAAACCAA AAGCGGATGT TGCGCCGTCT AATCTCGCTA TTGTGGGTCG TTACGTACTT AGTGCGGATA TTTGGCCGTT GCTGGCAAAA ACCCCTCCGG GAGCTGGTGA TGAAATTCAG CTCACCGACG CAATTGATAT GCTAATCGAA AAAGAAACGG TTGAAGCCTA TCATATGAAA GGGAAGAGCC ATGACTGCGG TAATAAATTA GGTTACATGC AGGCCTTCGT TGAATACGGT ATTCGTCATA ACACCCTTGG CACGGAATTT AAAGCCTGGC TTGAAGAAGA GATGGGCATT AAGAAGTAA
|
Protein sequence | MAAINTKVKK AVIPVAGLGT RMLPATKAIP KEMLPLVDKP LIQYVVNECI AAGITEIVLV THSSKNSIEN HFDTSFELEA MLEKRVKRQL LDEVQSICPP HVTIMQVRQG LAKGLGHAVL CAHPVVGDEP VAVILPDVIL DEYESDLSQD NLAEMIRRFD ETGHSQIMVE PVADVTAYGV VDCKGVELAP GESVPMVGVV EKPKADVAPS NLAIVGRYVL SADIWPLLAK TPPGAGDEIQ LTDAIDMLIE KETVEAYHMK GKSHDCGNKL GYMQAFVEYG IRHNTLGTEF KAWLEEEMGI KK
|
| |