Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B0574 |
Symbol | allD |
ID | 6796254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | - |
Start bp | 572843 |
End bp | 573892 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642774855 |
Product | ureidoglycolate dehydrogenase |
Protein accession | YP_002145511 |
Protein GI | 197249061 |
COG category | [C] Energy production and conversion |
COG ID | [COG2055] Malate/L-lactate dehydrogenases |
TIGRFAM ID | [TIGR03175] ureidoglycolate dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATCA GTCGGGAAAC ACTCCATCAG CTTATCGAAA ATAAGCTTTA TAAAGCCGGA CTAAAACGTG AGCACGCCGC CATCGTCGCC GACGTACTGG TTTATGCAGA TGCCAGAGGT ATTCACTCAC ATGGCGCCGT ACGCGTTGAA TATTATGCCG AACGAATTTC AAAAGGCGGC ACCAACCGGG AGCCGACGTT CCGCATTGAG AATACCGGTC CCTGTACGGC GATACTGCAT GCCGATAATG CTGCCGGACA GGTCGCAGCC AAAATGGGAA TGGAGCATGC TATTGAAATA GCCAAAAAAA ATGGCATCGC GGTTGTCGGC ATTAGCAGAA TGGGCCATAG CGGCGCCATC TCTTATTTCG TCCGCCAGGC TGCTCGCGAA GGCCTGATCG GTCTGTCTAT CTGTCAGTCC GATCCTATGG TCGTGCCGTT TGGCGGGGCG GATATTTACT ATGGCACTAA TCCGCTGGCC TTTGCCGCGC CGGGCGAAGG CGATGACATC ATTACCTTCG ATATGGCCAC CACCGTGCAG GCCTGGGGAA AAGTCCTCGA TGCACGGTCC CGCAATGAGT CCATTCCGGA GAGTTGGGCC GTTGATAAAA ACGGCGCGCC GACACATGAT CCTTTTGCCG TCAATGCGTT ATTACCCGCC GCAGGCCCGA AAGGCTACGG CCTGATGATG ATGATCGATA TTCTGTCCGG TATTCTGCTG GGGCTGCCGT TTGGCCGCCA GGTCAGTTCG ATGTATGAAG ATTTACACGC CGGACGCAAT TTAGGACAAC TTCATCTGGT CATTAATCCG GCGTTCTTTT CTTCCTGTGA ATTATTCCGC AAACATATTA GTCAGACCAT GCAGGAACTC AATTCCGTGA AGCCCGCCCC CGGTTTTAAA CAGGTTTATT ATCCTGGACA GGATCAGGAT ATTAAACAGA AAAATGCCGA TATGAATGGT ATCGATATTG TTGATGATAT TTATCAATAT CTGATTTCCG ATGCCCTCTA TCTCAAGTCA TACGAAACAA AAAATCCCTT TGCCCAATAA
|
Protein sequence | MKISRETLHQ LIENKLYKAG LKREHAAIVA DVLVYADARG IHSHGAVRVE YYAERISKGG TNREPTFRIE NTGPCTAILH ADNAAGQVAA KMGMEHAIEI AKKNGIAVVG ISRMGHSGAI SYFVRQAARE GLIGLSICQS DPMVVPFGGA DIYYGTNPLA FAAPGEGDDI ITFDMATTVQ AWGKVLDARS RNESIPESWA VDKNGAPTHD PFAVNALLPA AGPKGYGLMM MIDILSGILL GLPFGRQVSS MYEDLHAGRN LGQLHLVINP AFFSSCELFR KHISQTMQEL NSVKPAPGFK QVYYPGQDQD IKQKNADMNG IDIVDDIYQY LISDALYLKS YETKNPFAQ
|
| |