Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A2212 |
Symbol | solA |
ID | 6873718 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 2119460 |
End bp | 2120578 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642785314 |
Product | N-methyltryptophan oxidase |
Protein accession | YP_002215977 |
Protein GI | 198243040 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01377] sarcosine oxidase, monomeric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.551548 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 0.375717 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATACG ACCTTATTAT TATCGGCAGC GGTTCGGTTG GCGCCGCCGC TGGTTATTAC GCCACCCGCG CCGGGCTAAA GGTCCTGATG ACCGATGCGC ATATGCCGCC TCATCAACAG GGCAGCCACC ACGGCGATAC CCGTCTTATC CGCCATGCTT ATGGTGAAGG CGAAAAATAT GTCCCGCTGA TGCTTCGCGC CCAGACGCTT TGGGATGAAC TCTCCACACA CAATGAAGAG CCTATTTTTG TCCGCTCCGG CGTCGTCAAC CTCGGCCCGG CCGATTCCGC TTTCTTAGCC AACGTCGCAC GAAGCGCGCA ACAGTGGCAA TTGAACGTCG AGCGCCTGGA CGCGACGGCC CTCATGACGC GCTGGCCGGA AATTCGCGTG CCCGATAATT ATATCGGGCT GTTTGAAGCT GACTCCGGTT TTCTGCGCAG CGAATTAGCC ATTACCACAT GGCTTCGTCT GGCCCGAGAG GCTGGCTGCG CACAGCTATT CAACAGCCAG GTAAGCCATA TTCATCATGA TGATAACGGT GTGACGATAG AGACGAGTGA AGGCAGCTAC CACGCCAGCA AGGCGCTAAT TAGCGCGGGC ACCTGGGTCA AAGCGCTGGT ACCGGAGCTG CCCGTTCAGC CCGTACGTAA AGTTTTTGCC TGGTTTAAGG CGGATGGACG TTACAGCACT AAAAACCGCT TTCCGGCCTT TACCGGCGAA ATGCCCAACG GCGATCAATA TTACGGCTTC CCGGCGGAGA ACGACGAGTT AAAAATCGGC AAACACAATG GCGGACAGCT AATACAGGCT CCGGAAGAGC GCAAGCCCTT TGCCGCCGTT GCCAGCGATG GCGCGGAAGC ATTTCCTTTC CTGCGTAACG TACTGCCGGG TATCGGCGGT TGTTTACATG GGGCGGCATG TACCTATGAT AATTCGCCGG ACGAGAATTT TATTATCGAT ACGCTGCCTG GCCATGAGAA CACGCTTGTC ATCACTGGAC TCAGCGGACA TGGTTTTAAA TTTGCCCCGG TGTTAGGAGA AATCGCTGCG GATTTTGCGT TGGGAAAAAC ATCCTCCTTT GATCTGACGC CGTTCCGGCT TTCCCGTTTT AGCCAATAA
|
Protein sequence | MKYDLIIIGS GSVGAAAGYY ATRAGLKVLM TDAHMPPHQQ GSHHGDTRLI RHAYGEGEKY VPLMLRAQTL WDELSTHNEE PIFVRSGVVN LGPADSAFLA NVARSAQQWQ LNVERLDATA LMTRWPEIRV PDNYIGLFEA DSGFLRSELA ITTWLRLARE AGCAQLFNSQ VSHIHHDDNG VTIETSEGSY HASKALISAG TWVKALVPEL PVQPVRKVFA WFKADGRYST KNRFPAFTGE MPNGDQYYGF PAENDELKIG KHNGGQLIQA PEERKPFAAV ASDGAEAFPF LRNVLPGIGG CLHGAACTYD NSPDENFIID TLPGHENTLV ITGLSGHGFK FAPVLGEIAA DFALGKTSSF DLTPFRLSRF SQ
|
| |