Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0617 |
Symbol | allC |
ID | 6966679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 639832 |
End bp | 641067 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643384655 |
Product | allantoate amidohydrolase |
Protein accession | YP_002269169 |
Protein GI | 209397962 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases |
TIGRFAM ID | [TIGR01879] amidase, hydantoinase/carbamoylase family [TIGR03176] allantoate amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTACAC ATTTTCGTCA AACCATAGAA GAAGCGCTGC CCTGGCTTTC CTCTTTTGGC GCTGACCCAG CGGGTGGGAT GACCCGTTTA CTTTATTCGC CGGAATGGCT GGAAACCCAG CAGCAATTAA AAAAAAGAAT GGCCGCAAGT GGGCTGGAAA CACGTTTCGA TGAAGTGGGG AATTTATACG GTCGCCTGAG TGGCACCGAA TATCCACAGG AAGTGATTCT GAGCGGTTCG CATATCGATA CCGTGGTTAA CGGCGGTAAC CTCGACGGGC AATTCGGCGC GCTGGCGGCG CGGCTGGCAA TTGACTGGCT GAAAACTCAA TACGGCGCAC CGCTGCGTAC GGTTGAAGTG GTGGCGATGG CGGAAGAAGA AGGTAGCCGC TTCCCGTATG TTTTCTGGGG CAGTAAAAAT ATCTTTGGGC TGGCGAATCC TGACGACGTG CGGAATATCT GTGATGCCAA AGGAAATAGT TTTGTCGATG CGATGAAGGC TTGCGGATTT ACTCTGCCGG ACGCCCCGCT AACTCCGCGT CAGGATATTA AAGCCTTTGT CGAACTGCAT ATCGAACAGG GCTGTGTGCT GGAAAGTAAT GGGCAATCAA TTGGCGTGGT GAATGCAATT GTCGGGCAAC GTCGCTATAC GGTGACGCTG AACGGCGAAT CAAACCATGC AGGCACCACG CCGATGGGTT ATCGTCGTGA TACGGTTTAC GCTTTCAGTC GCATTTGCCA TCAGTCGATC GAAAAAGCGA AAAAGATGGG CGATCCGCTG GTTCTGACCT TTGGGAAAGT AGAGCCGCGC CCGAATACGG TGAATGTGGT GCCGGGTAAA ACCACGTTCA CCATTGATTG TCGTCATACC GACGCCGCCG TGCTGCGTGA TTTCACCCAA CAGTTAGAAA ACGACATGCG GGCGATTTGC GATGAAATGG ACATTCGTAT TGATATCGAT TTATGGATGG ACGAAGAACC CGTGCCGATG AATAAGGACC TGGTCGCCAC CCTGACAGAA TTGTGTGAAA GTGAAAAACT GAATTACCGG GTGATGCACA GTGGTGCCGG GCACGACGCG CAAATTTTCG CGCCTCGCGT ACCAACCTGC ATGATTTTTA TCCCCAGCAT CAACGGGATC AGCCATAACC CGGCGGAACG CACCAATATT ACCGACCTTG CCGAAGGGGT CAAAACGTTG GCACTCATGC TTTATCAACT TGCCTGGCAG AAATAA
|
Protein sequence | MITHFRQTIE EALPWLSSFG ADPAGGMTRL LYSPEWLETQ QQLKKRMAAS GLETRFDEVG NLYGRLSGTE YPQEVILSGS HIDTVVNGGN LDGQFGALAA RLAIDWLKTQ YGAPLRTVEV VAMAEEEGSR FPYVFWGSKN IFGLANPDDV RNICDAKGNS FVDAMKACGF TLPDAPLTPR QDIKAFVELH IEQGCVLESN GQSIGVVNAI VGQRRYTVTL NGESNHAGTT PMGYRRDTVY AFSRICHQSI EKAKKMGDPL VLTFGKVEPR PNTVNVVPGK TTFTIDCRHT DAAVLRDFTQ QLENDMRAIC DEMDIRIDID LWMDEEPVPM NKDLVATLTE LCESEKLNYR VMHSGAGHDA QIFAPRVPTC MIFIPSINGI SHNPAERTNI TDLAEGVKTL ALMLYQLAWQ K
|
| |