Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0561 |
Symbol | allC |
ID | 6146302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 567925 |
End bp | 569160 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615453 |
Product | allantoate amidohydrolase |
Protein accession | YP_001742660 |
Protein GI | 170681093 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases |
TIGRFAM ID | [TIGR01879] amidase, hydantoinase/carbamoylase family [TIGR03176] allantoate amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTACAC ATTTCCGTCA AGCTATAGAA GAAACGCTGC CCTGGCTTTC CTCTTTTGGC TCTGACCCAA CGGGTGGAAT GACCCGTTTA CTTTATTCGC CGGAATGGTT GGAAACCCAG CAGCAATTTA AAAAAAGAAT GGCAGCAAGC GGGCTGGAAA CACGTTTCGA TGAAGTGGGG AATTTATACG GTCGCCTGAG TGGCACCGAA TATCCACAGG AAGTGGTTCT GAGCGGTTCG CATATCGATA CCGTGGTTAA CGGCGGTAAC CTCGACGGAC AATTCGGCGC ACTGGCGGCG TGGCTGGCAA TTGACTGGTT GAAAACGCAA TACGGCGCGC CGTTACGTAC GGTCGAAGTG GTGGCGATGG CAGAAGAAGA AGGCAGCCGC TTCCCGTATG TCTTCTGGGG CAGTAAAAAT ATCTTTGGGC TGGCGAATCC TGACGACGTG CGGAATATCT GTGATGCCAA AGGAAATAGT TTTGTCGATG CGATGAAGGC TTGCGGATTT ACTCTGCCGA ACGCCCCACT AACTCCGCGT CAGGATATTA AAGCCTTTGT CGAACTGCAT ATCGAACAGG GCTGTGTGCT GGAAAGTAAT GGGCAATCAA TTGGCGTGGT GAATGCAATT GTCGGGCAAC GTCGCTATAC GGTGACGCTG AATGGCGAAT CAAACCATGC AGGCACCACG CCGATGGGTT ATCGTCGTGA TACGGTTTAC GCTTTCAGTC GCATTTGCCA TCAGTCGATC GAAAAAGCGA AAAAGATGGG CGACCCGCTG GTGCTGACCT TTGGCAAGGT GGAACCGCGC CCGAATACGG TAAATGTGGT GCCGGGGAAA ACCACGTTCA CCATTGATTG TCGTCATACC GACGCCGCCG TGCTGCGCAA TTTCACCCAA CAGTTAGAAA ACGACATGCG GGCGATTTGC GATGAAATGG ACATTGGTAT TGATATCGAT TTATGGATGG ACGAAGAACC GGTGCCGATG AATAAGGAGC TGGTCGCCAC CCTGACAGAA TTGTGTGAAA GCGAAAAACT GAATTACCGG GTAATGCACA GTGGTGCCGG GCACGACGCA CAAATTTTCG CGCCTCGCGT GCCGACCTGC ATGATTTTCA TTCCCAGCAT CAACGGGATC AGCCATAACC CGGCGGAACG CACCAATATT ACCGACCTTG CCGAAGGGGT CAAAACGTTG GCACTCATGC TTTATCAACT TGCCTGGCAG AAATAA
|
Protein sequence | MITHFRQAIE ETLPWLSSFG SDPTGGMTRL LYSPEWLETQ QQFKKRMAAS GLETRFDEVG NLYGRLSGTE YPQEVVLSGS HIDTVVNGGN LDGQFGALAA WLAIDWLKTQ YGAPLRTVEV VAMAEEEGSR FPYVFWGSKN IFGLANPDDV RNICDAKGNS FVDAMKACGF TLPNAPLTPR QDIKAFVELH IEQGCVLESN GQSIGVVNAI VGQRRYTVTL NGESNHAGTT PMGYRRDTVY AFSRICHQSI EKAKKMGDPL VLTFGKVEPR PNTVNVVPGK TTFTIDCRHT DAAVLRNFTQ QLENDMRAIC DEMDIGIDID LWMDEEPVPM NKELVATLTE LCESEKLNYR VMHSGAGHDA QIFAPRVPTC MIFIPSINGI SHNPAERTNI TDLAEGVKTL ALMLYQLAWQ K
|
| |