Gene ECH74115_1323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1323 
SymbolureC 
ID6967007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1335487 
End bp1337190 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content54% 
IMG OID643385307 
Producturease subunit alpha 
Protein accessionYP_002269802 
Protein GI209396808 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0804] Urea amidohydrolase (urease) alpha subunit 
TIGRFAM ID[TIGR01792] urease, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAATA TTTCACGCCA GGCCTATGCT GACATGTTCG GCCCTACCAC CGGTGATAAA 
ATTCGTCTGG CAGACACTGA GCTGTGGATC GAGGTCGAAG ATGATTTAAC TACCTACGGC
GAAGAGGTCA AATTCGGCGG CGGTAAAGTA ATCCGCGACG GTATGGGACA GGGGCAAATG
CTCTCCGCCG GCTGCGCTGA TCTGGTGCTG ACCAATGCCC TGATCATCGA TTACTGGGGG
ATCGTTAAAG CCGATATCGG CGTCAAAGAT GGAAGGATAT TTGCTATCGG CAAAGCCGGT
AATCCTGATA TACAACCCAA CGTCACTATC CCAATCGGCG TATCCACGGA AATTATTGCC
GCAGAAGGCA GGATCGTTAC CGCAGGTGGC GTCGATACGC ATATTCACTG GATCTGCCCA
CAGCAGGCTG AAGAAGCGCT GACATCCGGC ATTACCACCA TGATCGGTGG CGGTACTGGC
CCGACAGCGG GTTCTAACGC CACAACCTGT ACCCCAGGAC CATGGTACAT TTATCAAATG
CTGCAGGCTG CAGACAGCCT GCCGGTCAAT ATCGGGTTGC TGGGTAAAGG CAATTGCTCC
AATCCGGATG CGCTTCGTGA GCAGGTCGCG GCCGGGGTTA TCGGCCTCAA AATTCACGAA
GACTGGGGAG CTACACCTGC GGTAATCAAC TGCGCACTGA CTGTAGCCGA CGAAATGGAC
GTTCAGGTTG CGCTACACAG TGATACGCTT AACGAATCAG GATTCGTTGA GGATACTCTG
ACTGCCATCG GCGGGCGCAC TATCCATACC TTCCATACAG AAGGTGCAGG TGGTGGTCAT
GCTCCGGATA TTATCACCGC CTGCGCGCAC CCCAATATTC TGCCTTCCTC AACCAATCCG
ACGCTACCCT ATACCGTCAA CACTATTGAT GAGCATCTGG ACATGCTGAT GGTTTGCCAT
CATCTTGACC CGGATATCGC CGAGGACGTA GCCTTTGCCG AATCGCGCAT TCGCCAGGAA
ACCATTGCCG CGGAAGACGT CCTGCACGAC CTTGGCGCGT TCTCCCTCAC CTCGTCCGAT
TCGCAGGCCA TGGGACGCGT CGGAGAAGTA GTGTTACGAA CCTGGCAGGT GGCACACCGG
ATGAAAGTTC AGCGCGGCCC GTTACCGGAA GAAAGTGGTG ATAACGACAA CGTCCGCGTG
AAGCGCTATA TCGCTAAATA CACCATTAAT CCGGCATTAA CCCACGGTAT TGCTCATGAA
GTCGGCTCGA TTGAAGTGGG AAAACTGGCG GATCTGGTGC TCTGGTCCCC GGCGTTCTTT
GGCGTAAAAC CGGCGACTAT CGTCAAAGGC GGAATGATAG CCATGGCGCC GATGGGTGAT
ATCAACGGCT CTATCCCCAC ACCGCAGCCG GTGCACTATC GCCCAATGTT CGCTGCATTG
GGCAGTGCCC GTCACCGCTG TCGTGTGACT TTCCTGTCGC AGGCAGCAGC AGCAAATGGC
GTCGCTGAAC AGCTTAACCT GCACAGCACA ACTGCTGTGG TAAAAGGCTG CCGCACAGTA
CAAAAAGCCG ATATGCGCCA CAACAGCCTG TTGCCTGATA TAACCGTGGA TTCACAAACC
TACGAAGTGC GTATCAACGG CGAACTGATA ACCAGTGAAC CGGCGGACAT TCTGCCAATG
GCGCAACGTT ATTTCCTGTT TTAA
 
Protein sequence
MSNISRQAYA DMFGPTTGDK IRLADTELWI EVEDDLTTYG EEVKFGGGKV IRDGMGQGQM 
LSAGCADLVL TNALIIDYWG IVKADIGVKD GRIFAIGKAG NPDIQPNVTI PIGVSTEIIA
AEGRIVTAGG VDTHIHWICP QQAEEALTSG ITTMIGGGTG PTAGSNATTC TPGPWYIYQM
LQAADSLPVN IGLLGKGNCS NPDALREQVA AGVIGLKIHE DWGATPAVIN CALTVADEMD
VQVALHSDTL NESGFVEDTL TAIGGRTIHT FHTEGAGGGH APDIITACAH PNILPSSTNP
TLPYTVNTID EHLDMLMVCH HLDPDIAEDV AFAESRIRQE TIAAEDVLHD LGAFSLTSSD
SQAMGRVGEV VLRTWQVAHR MKVQRGPLPE ESGDNDNVRV KRYIAKYTIN PALTHGIAHE
VGSIEVGKLA DLVLWSPAFF GVKPATIVKG GMIAMAPMGD INGSIPTPQP VHYRPMFAAL
GSARHRCRVT FLSQAAAANG VAEQLNLHST TAVVKGCRTV QKADMRHNSL LPDITVDSQT
YEVRINGELI TSEPADILPM AQRYFLF