Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0802 |
Symbol | |
ID | 6971462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 824161 |
End bp | 825093 |
Gene Length | 933 bp |
Protein Length | 310 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643384829 |
Product | allophanate hydrolase, subunit 2 |
Protein accession | YP_002269335 |
Protein GI | 209396411 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1984] Allophanate hydrolase subunit 2 |
TIGRFAM ID | [TIGR00724] biotin-dependent carboxylase uncharacterized domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.316983 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAAGA TTATTCGTGC GGGCATGTAT ACCACTGTGC AAGATGGCGG TCGTCACGGT TTTCGCCAGT CGGGTATCAG CCACTGCGGC GCACTGGATA TGCCTGCGTT ACGCATTGCT AACCTGCTGG TGGGTAATGA TGCCAATGCC CCCGCGCTGG AGATCACGCT CGGTCAGTTA ACGGTTGAGT TCGAAACTGA TGGGTGGTTT GCTCTGACGG GTGCCGGTTG CGAAGCGCGG CTGGATGATA ATGCCGTCTG GACCGGCTGG CGATTGCCGA TGAAAGCAGG CCAGCGTTTA ACGCTTAAAC GCCCGCAGCA CGGGATGCGC AGTTATCTGG CGGTCGCGGG TGGTATTGAT GTTCCGCCGG TAATGGGGTC ATGCAGCACC GATCTCAAAG TGGGGATTGG CGGGCTGGAA GGCCGTTTAC TGAAGGATGG TGACCGACTC CCGATTGGCA AAGCGAAGCG TGATTTTATG GAAGCGCAGG GCGTTAAACA GCTGCTGTGG GGCAACCGCA TTCGCGCCTT GCCGGGGCCG GAATATCATG AGTTCGATCG CGCCTCGCAG GATGCATTCT GGCGTTCGCC CTGGCAGCTT AGCTCGCAAA GTAACCGCAT GGGCTATCGC TTACAGGGGC AAATTTTAAA ACGCACCACC GATCGCGAAC TGTTATCTCA CGGTTTGTTA CCGGGCGTGG TGCAGGTGCC GCATAACGGG CAGCCCATTG TGTTGATGAA CGACGCACAG ACCACCGGTG GTTACCCGCG TATTGCCTGT ATCATTGAGG CTGATATGTA CCATCTGGCG CAAATTCCGC TCGGTCAGCC GATTCATTTT GTCCAGTGTT CACTGGAAGA GGCACTGAAA TCGCGGCAAG ATCAGCAACG TTATTTCGAA CAATTAGCGT GGCGGCTGCA CAATGAAAAT TGA
|
Protein sequence | MLKIIRAGMY TTVQDGGRHG FRQSGISHCG ALDMPALRIA NLLVGNDANA PALEITLGQL TVEFETDGWF ALTGAGCEAR LDDNAVWTGW RLPMKAGQRL TLKRPQHGMR SYLAVAGGID VPPVMGSCST DLKVGIGGLE GRLLKDGDRL PIGKAKRDFM EAQGVKQLLW GNRIRALPGP EYHEFDRASQ DAFWRSPWQL SSQSNRMGYR LQGQILKRTT DRELLSHGLL PGVVQVPHNG QPIVLMNDAQ TTGGYPRIAC IIEADMYHLA QIPLGQPIHF VQCSLEEALK SRQDQQRYFE QLAWRLHNEN
|
| |