Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1323 |
Symbol | ureC |
ID | 6967007 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1335487 |
End bp | 1337190 |
Gene Length | 1704 bp |
Protein Length | 567 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643385307 |
Product | urease subunit alpha |
Protein accession | YP_002269802 |
Protein GI | 209396808 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0804] Urea amidohydrolase (urease) alpha subunit |
TIGRFAM ID | [TIGR01792] urease, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAATA TTTCACGCCA GGCCTATGCT GACATGTTCG GCCCTACCAC CGGTGATAAA ATTCGTCTGG CAGACACTGA GCTGTGGATC GAGGTCGAAG ATGATTTAAC TACCTACGGC GAAGAGGTCA AATTCGGCGG CGGTAAAGTA ATCCGCGACG GTATGGGACA GGGGCAAATG CTCTCCGCCG GCTGCGCTGA TCTGGTGCTG ACCAATGCCC TGATCATCGA TTACTGGGGG ATCGTTAAAG CCGATATCGG CGTCAAAGAT GGAAGGATAT TTGCTATCGG CAAAGCCGGT AATCCTGATA TACAACCCAA CGTCACTATC CCAATCGGCG TATCCACGGA AATTATTGCC GCAGAAGGCA GGATCGTTAC CGCAGGTGGC GTCGATACGC ATATTCACTG GATCTGCCCA CAGCAGGCTG AAGAAGCGCT GACATCCGGC ATTACCACCA TGATCGGTGG CGGTACTGGC CCGACAGCGG GTTCTAACGC CACAACCTGT ACCCCAGGAC CATGGTACAT TTATCAAATG CTGCAGGCTG CAGACAGCCT GCCGGTCAAT ATCGGGTTGC TGGGTAAAGG CAATTGCTCC AATCCGGATG CGCTTCGTGA GCAGGTCGCG GCCGGGGTTA TCGGCCTCAA AATTCACGAA GACTGGGGAG CTACACCTGC GGTAATCAAC TGCGCACTGA CTGTAGCCGA CGAAATGGAC GTTCAGGTTG CGCTACACAG TGATACGCTT AACGAATCAG GATTCGTTGA GGATACTCTG ACTGCCATCG GCGGGCGCAC TATCCATACC TTCCATACAG AAGGTGCAGG TGGTGGTCAT GCTCCGGATA TTATCACCGC CTGCGCGCAC CCCAATATTC TGCCTTCCTC AACCAATCCG ACGCTACCCT ATACCGTCAA CACTATTGAT GAGCATCTGG ACATGCTGAT GGTTTGCCAT CATCTTGACC CGGATATCGC CGAGGACGTA GCCTTTGCCG AATCGCGCAT TCGCCAGGAA ACCATTGCCG CGGAAGACGT CCTGCACGAC CTTGGCGCGT TCTCCCTCAC CTCGTCCGAT TCGCAGGCCA TGGGACGCGT CGGAGAAGTA GTGTTACGAA CCTGGCAGGT GGCACACCGG ATGAAAGTTC AGCGCGGCCC GTTACCGGAA GAAAGTGGTG ATAACGACAA CGTCCGCGTG AAGCGCTATA TCGCTAAATA CACCATTAAT CCGGCATTAA CCCACGGTAT TGCTCATGAA GTCGGCTCGA TTGAAGTGGG AAAACTGGCG GATCTGGTGC TCTGGTCCCC GGCGTTCTTT GGCGTAAAAC CGGCGACTAT CGTCAAAGGC GGAATGATAG CCATGGCGCC GATGGGTGAT ATCAACGGCT CTATCCCCAC ACCGCAGCCG GTGCACTATC GCCCAATGTT CGCTGCATTG GGCAGTGCCC GTCACCGCTG TCGTGTGACT TTCCTGTCGC AGGCAGCAGC AGCAAATGGC GTCGCTGAAC AGCTTAACCT GCACAGCACA ACTGCTGTGG TAAAAGGCTG CCGCACAGTA CAAAAAGCCG ATATGCGCCA CAACAGCCTG TTGCCTGATA TAACCGTGGA TTCACAAACC TACGAAGTGC GTATCAACGG CGAACTGATA ACCAGTGAAC CGGCGGACAT TCTGCCAATG GCGCAACGTT ATTTCCTGTT TTAA
|
Protein sequence | MSNISRQAYA DMFGPTTGDK IRLADTELWI EVEDDLTTYG EEVKFGGGKV IRDGMGQGQM LSAGCADLVL TNALIIDYWG IVKADIGVKD GRIFAIGKAG NPDIQPNVTI PIGVSTEIIA AEGRIVTAGG VDTHIHWICP QQAEEALTSG ITTMIGGGTG PTAGSNATTC TPGPWYIYQM LQAADSLPVN IGLLGKGNCS NPDALREQVA AGVIGLKIHE DWGATPAVIN CALTVADEMD VQVALHSDTL NESGFVEDTL TAIGGRTIHT FHTEGAGGGH APDIITACAH PNILPSSTNP TLPYTVNTID EHLDMLMVCH HLDPDIAEDV AFAESRIRQE TIAAEDVLHD LGAFSLTSSD SQAMGRVGEV VLRTWQVAHR MKVQRGPLPE ESGDNDNVRV KRYIAKYTIN PALTHGIAHE VGSIEVGKLA DLVLWSPAFF GVKPATIVKG GMIAMAPMGD INGSIPTPQP VHYRPMFAAL GSARHRCRVT FLSQAAAANG VAEQLNLHST TAVVKGCRTV QKADMRHNSL LPDITVDSQT YEVRINGELI TSEPADILPM AQRYFLF
|
| |