Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0727 |
Symbol | |
ID | 6142735 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 733811 |
End bp | 734743 |
Gene Length | 933 bp |
Protein Length | 310 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641615617 |
Product | allophanate hydrolase, subunit 2 |
Protein accession | YP_001742816 |
Protein GI | 170682154 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1984] Allophanate hydrolase subunit 2 |
TIGRFAM ID | [TIGR00724] biotin-dependent carboxylase uncharacterized domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.25799 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.390135 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAAGA TTATTCGTGC GGGCATGTAT ACCACTGTGC AGGATGGCGG TCGTCACGGT TTTCGCCAGT CGGGTATCAG CCACTGCGGC GCACTGGATA TGCCTGCGCT GCGCATTGCT AACCTGCTGG TGGGTAATGA CGCCAATGCC CCCGCGCTGG AGATCACGCT CGGTCAGTTA ACGGTTGAGT TCGAAACTGA TGGGTGGTTT GCTCTGACGG GGGCCGGTTG CGAAGCGCGG CTGGATGATA ACGCCGTCTG GACCGGCTGG CGATTGCCGA TGAAAGCAGG CCAGCGTTTA ACGCTTAAAC GCCCGCAGCA CGGGATGCGC AGTTATCTGG CGGTCGCGGG TGGTATTGAT GTTCCGGCGG TAATGGGGTC ATGCAGTACC GATCTCAAAG TGGGGATTGG TGGGCTGGAA GGCCGTTTAC TGAAGGATGG TGACCGTCTC CCGATTGGCA AAGCGAAGCG TGATTTTATG GAAGCGCAGG GCGTTAAACA GCTGCTGTGG GGCAACCGCA TTCGCGCCTT GCCGGGGCCG GAATATCATG AGTTCGATCG CACCTCGCAG GATGCATTCT GGCGTTCGCC CTGGCAGCTT AGCTCGCAAA GTAACCGCAT GGGCTATCGC TTACAGGGGC AAATTTTAAA ACGCACCACC GATCGCGAAC TGTTATCTCA CGGTTTGTTA CCGGGCGTGG TGCAGGTGCC GCATAACGGG CAGCCCATTG TGTTGATGAA CGACGCACAG ACCACCGGTG GTTATCCGCG TATTGCCTGT ATCATCGAGG CTGATATGTA CCATCTGGCG CAAATTCCGC TCGGTCAGCC GATTCATTTT GTCCAGTGTT CACTGGAAGA AGCACTGAAA GCACGGCAAG ATCAGCAACG TTATTTCGAA CAATTAGCGT GGCGGCTGCA CAATGAAAAT TGA
|
Protein sequence | MLKIIRAGMY TTVQDGGRHG FRQSGISHCG ALDMPALRIA NLLVGNDANA PALEITLGQL TVEFETDGWF ALTGAGCEAR LDDNAVWTGW RLPMKAGQRL TLKRPQHGMR SYLAVAGGID VPAVMGSCST DLKVGIGGLE GRLLKDGDRL PIGKAKRDFM EAQGVKQLLW GNRIRALPGP EYHEFDRTSQ DAFWRSPWQL SSQSNRMGYR LQGQILKRTT DRELLSHGLL PGVVQVPHNG QPIVLMNDAQ TTGGYPRIAC IIEADMYHLA QIPLGQPIHF VQCSLEEALK ARQDQQRYFE QLAWRLHNEN
|
| |