Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E0629 |
Symbol | |
ID | 6268654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 599552 |
End bp | 600484 |
Gene Length | 933 bp |
Protein Length | 310 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641724827 |
Product | allophanate hydrolase, subunit 2 |
Protein accession | YP_001879366 |
Protein GI | 187732878 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1984] Allophanate hydrolase subunit 2 |
TIGRFAM ID | [TIGR00724] biotin-dependent carboxylase uncharacterized domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGAAGA TTATTCGTGC GGGCATGTAT ACCACTGTGC AGGATGGCGG TCGTCACGGT TTTCGCCAGT CGGGTATCAG CCACTGCGGC GCACTGGATA TGCCTGCGTT ACGCATTGCT AACCTGCTGG TGGGTAATGA CGCCAATGCC CCCGCGCTGG AGATCACGCT CGGTCAGTTA ACGGTTGAGT TCGAAACTGA TGGGTGGTTT GCTCTGACGG GTGCCGGTTG CGAAGCGCGG CTGGATGATA ACGCCGTCTG GACCGGCTGG CGCTTGCCGA TGAAAGCAGG CCAGCGTTTA ACGCTTAAAC GCCCGCAGCA CGGCATGCGC AGTTATCTGG CGGTCGCGGG TGGTATTGAT GTTCCGCCGG TAATGGGGTC ATGCAGCACC GATCTCAACG TGGGGATTGG CGGGCTGGAA GGCCGTTTAC TGAAGGATGG TGACCGACTC CCGATTGGCA AATCGAAGCG TGATTTTATG GAAGCGCAGG GCGTTAAACA GCTGCTGTGG GGCAACCGCA TTCGCGCCTT GCCGGGGCCG GAATATCATG AGTTCGATCG CGCCTCGCAG GATGCATTCT GGCGTTCGCC CTGGCAGCTT AGCCCGCAAA GTAACCGCAT GGGCTATCGC TTACAGGGGC AAATTTTAAA ACGCACCACC GATCGCGAAC TGTTATCTCA CGGTTTGTTA CCGGGTGTGG TGCAGGTGCC GCATAACGGG CAGCCGATTG TGTTGATGAA CGACGCACAG ACCACCGGTG GTTACCCGCG TATTGCCTGT ATCATTGAGG CTGATATGTA CCATCTGGCG CAAATTCCGC TCGGTCAGCC GATTCATTTT GTCCAGTGTT CACTGGAAGA GGCACTGAAA GCGCGGCAAG ATCAACAACG TTATTTCGAA CAATTAGCGT GGCGGCTACA CAATGAAAAT TGA
|
Protein sequence | MLKIIRAGMY TTVQDGGRHG FRQSGISHCG ALDMPALRIA NLLVGNDANA PALEITLGQL TVEFETDGWF ALTGAGCEAR LDDNAVWTGW RLPMKAGQRL TLKRPQHGMR SYLAVAGGID VPPVMGSCST DLNVGIGGLE GRLLKDGDRL PIGKSKRDFM EAQGVKQLLW GNRIRALPGP EYHEFDRASQ DAFWRSPWQL SPQSNRMGYR LQGQILKRTT DRELLSHGLL PGVVQVPHNG QPIVLMNDAQ TTGGYPRIAC IIEADMYHLA QIPLGQPIHF VQCSLEEALK ARQDQQRYFE QLAWRLHNEN
|
| |