Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3327 |
Symbol | |
ID | 6486060 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 3231684 |
End bp | 3232691 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642738619 |
Product | ureidoglycolate dehydrogenase |
Protein accession | YP_002042340 |
Protein GI | 194445675 |
COG category | [C] Energy production and conversion |
COG ID | [COG2055] Malate/L-lactate dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 0.388816 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAACAA TATTAGTCAA AGAAAATGAA TTGAAAGCGC TGGCGTTTAA TAAACTCACC CAGGCGGGTC TTGACGCGCA AACGGCGCAG CAGGTAGCCG ACGTTTTGGT TCATGCCGAT ATCACCGGCG TTCACTCCCA CGGCGTTATT CGCGTAGAGC ACTATTGTAC CCGCCTGAAC GCAGGCGGGC TCAATCCAAA AGCAACATTC AGCATCGAAC AGATTTCGCC TTCTGTCGCT ATTCTGGACT CTGATGATGG AATGGGGCAC TGCGCGCTGA TAAAGGCGAC AGACCACGCT ATTAGTCTGG CAAGAGAGAC GGGGCTGGGC TTCGTCAGCG TCAAAAATAC GTCCCACTGC GGCGCGCTCT CGTGGTTTAT TGAGCAGGCG ACAAGCCAGG GAATGGTGGC TATCGCCATG ACGCAAACGG ATACCTGCGT TGCGCCGTAT GGCGGCGCGG AGCGCTTTCT GGGGACTAAC CCAATCGCCT TCGGCTTCCC CGTTAAAGAC AGCCATCCGA TGATCGTCGA TATGGCGACC AGCGCCATTG CGTTTGGTAA AATTCTGCAC GCCAAAGAAA CCGGGAAACC AATAGGTCAT GGCCTGGCGC TGGATAAAGA GGGACATATC ACTACCGATC CGCATAAAAT TGAAAATCTG CTGCCCTTTG GCGGACACAA AGGTTCTGGT ATTGCCCTGG CGATTGATGC GCTCACGGGC GTCCTGATGG GGGCAAACTT TAGCAACCAT ATTGTTCGGA TGTATGGCGA CTATGACAAG ATGCGCAAGT TAGCGAGTCT GGTTATCGTT ATTGATCCGC AGATGTTGGG CAATCCGCTG TTTTCCTCAA TAATGAGCAC AATGGTCAAT GAGCTAAGGG CTGTAAAACC GATGCCCGGT GTTGATAAAG TGCTGGCGCC AAACGATCCG CAAATCGCCT ATAAAGAAAA ATGTTTAAAA GAAGGTATTC CTGTGGCTGA GGGTATTTAC CAGTACCTGA TTGGCTGA
|
Protein sequence | MSTILVKENE LKALAFNKLT QAGLDAQTAQ QVADVLVHAD ITGVHSHGVI RVEHYCTRLN AGGLNPKATF SIEQISPSVA ILDSDDGMGH CALIKATDHA ISLARETGLG FVSVKNTSHC GALSWFIEQA TSQGMVAIAM TQTDTCVAPY GGAERFLGTN PIAFGFPVKD SHPMIVDMAT SAIAFGKILH AKETGKPIGH GLALDKEGHI TTDPHKIENL LPFGGHKGSG IALAIDALTG VLMGANFSNH IVRMYGDYDK MRKLASLVIV IDPQMLGNPL FSSIMSTMVN ELRAVKPMPG VDKVLAPNDP QIAYKEKCLK EGIPVAEGIY QYLIG
|
| |