Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1582 |
Symbol | uidA |
ID | 6144065 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1566599 |
End bp | 1568410 |
Gene Length | 1812 bp |
Protein Length | 603 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641616459 |
Product | beta-D-glucuronidase |
Protein accession | YP_001743637 |
Protein GI | 170683754 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTACGTC CTGTAGAAAC CCCAACCCGT GAAATCAAAA AACTCGACGG CCTGTGGGCA TTCAGTCTGG ATCGCGAAAA CTGTGGAATT GATCAGCGTT GGTGGGAAAG CGCGTTACAA GAAAGCCGGG CAATTGCTGT GCCGGGCAGT TTTAACGATC AGTTCGCCGA TGCAGATATT CGTAATTATG TGGGCAACGT CTGGTATCAG CGCGAAGTCT TTATACCGAA AGGTTGGGCA GGCCAGCGTA TCGTGCTGCG TTTCGATGCG GTCACTCATT ACGGCAAAGT GTGGGTAAAT AATCAGGAAG TGATGGAGCA TCAGGGCGGC TATACGCCAT TTGAAGCCGA TGTCACGCCG TATGTTATTG CCGGGAAAAG TGTTCGTATC ACCGTTTGTG TGAACAATGA ACTGAACTGG CAGACTATCC CGCCGGGAAT GGTGATTACC GATGAAAACG GCAAAAAAAA GCAGTCTTAC TTCCATGATT TCTTTAACTA TGCCGGGATC CATCGCAGCG TAATGCTCTA CATCACGCCG AACACCTGGG TGGACGATAT CACCGTGGTG ACGCATGTCG CGCAAGACTG TAACCACGCG TCTGTTGACT GGCAGGTGGT AGCAAATGGT GATGTCAGCG TTGAACTGCG TGATGCGGAT CAACAGGTGG TTGCAACTGG ACAAGGCACC AGCGGGACTT TGCAAGTGGT GAATCCGCAC CTCTGGCAAC CAGGTGAAGG TTATCTCTAT GAACTGTGCG TCACAGCTAA AAGCCAGACA GAGTGTGATA TCTACCCGCT GCGCGTCGGT ATCCGGTCAG TGGCAGTGAA GGGCGAACAG TTCCTGATCA ACCACAAACC GTTCTACTTT ACTGGCTTTG GTCGTCATGA AGATGCGGAT TTGCGCGGCA AAGGATTCGA TAACGTGCTG ATGGTGCACG ATCACGCATT AATGGACTGG ATTGGGGCCA ACTCCTACCG TACCTCGCAT TACCCTTACG CTGAAGAGAT GCTCGACTGG GCAGATGAAC ATGGCATCGT GGTGATTGAT GAAACTGCAG CTGTCGGCTT TAACCTCTCT TTAGGCATTG GTTTCGAAGC GGGCAACAAG CCGAAAGAAC TGTACAGCGA AGACGCAGTC AACGGGGAAA CCCAGCAGGC GCACTTACAG GCGATTGAAG AGCTGATTGC GCGTGACAAA AACCACCCAA GCGTGGTGAT GTGGAGTATT GCCAACGAAC CGGATACCCG TCCGCAAGGT GCACGGGAAT ATTTCGCGCC ACTGGCGGAA GCAACGCGTA AACTCGACCC GACGCGTCCG ATCACCTGTG TCAATGTAAT GTTCTGCGAC GCTCACACCG ATACCATCAG CGATCTCTTT GATGTGCTGT GCCTGAACCG TTATTACGGT TGGTATGTCC AAAGCGGCGA TTTGGAAACG GCAGAGAAGG TTCTGGAAAA AGAACTGCTG GCCTGGCAGG AGAAACTGCA TCAGCCGATT ATCATCACCG AATACGGCGT GGATACGTTA GCCGGGCTGC ACTCAATGTA CACCGACATG TGGAGTGAAG AGTATCAGTG TGCATGGCTG GATATGTATC ACCGCGTCTT TGATCGCGTC AGCGCCGTCG TCGGTGAACA GGTATGGAAT TTTGCCGATT TTGCGACCTC GCAAGGCATA TTGCGCGTTG GCGGTAACAA GAAAGGGATC TTCACCCGCG ACCGCAAACC GAAGTCGGCG GCTTTTCTGC TGCAAAAACG CTGGACTGGC ATGAACTTCG GTGAAAAACC GCAGCAGGGA GGCAAACAAT GA
|
Protein sequence | MLRPVETPTR EIKKLDGLWA FSLDRENCGI DQRWWESALQ ESRAIAVPGS FNDQFADADI RNYVGNVWYQ REVFIPKGWA GQRIVLRFDA VTHYGKVWVN NQEVMEHQGG YTPFEADVTP YVIAGKSVRI TVCVNNELNW QTIPPGMVIT DENGKKKQSY FHDFFNYAGI HRSVMLYITP NTWVDDITVV THVAQDCNHA SVDWQVVANG DVSVELRDAD QQVVATGQGT SGTLQVVNPH LWQPGEGYLY ELCVTAKSQT ECDIYPLRVG IRSVAVKGEQ FLINHKPFYF TGFGRHEDAD LRGKGFDNVL MVHDHALMDW IGANSYRTSH YPYAEEMLDW ADEHGIVVID ETAAVGFNLS LGIGFEAGNK PKELYSEDAV NGETQQAHLQ AIEELIARDK NHPSVVMWSI ANEPDTRPQG AREYFAPLAE ATRKLDPTRP ITCVNVMFCD AHTDTISDLF DVLCLNRYYG WYVQSGDLET AEKVLEKELL AWQEKLHQPI IITEYGVDTL AGLHSMYTDM WSEEYQCAWL DMYHRVFDRV SAVVGEQVWN FADFATSQGI LRVGGNKKGI FTRDRKPKSA AFLLQKRWTG MNFGEKPQQG GKQ
|
| |