Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A1228 |
Symbol | solA |
ID | 6517258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | - |
Start bp | 1204727 |
End bp | 1205845 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642746351 |
Product | N-methyltryptophan oxidase |
Protein accession | YP_002114159 |
Protein GI | 194733961 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01377] sarcosine oxidase, monomeric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.431985 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATACG ACCTTATTAT TATCGGCAGC GGTTCGGTTG GCGCCGCCGC TGGTTATTAC GCCACCCGCG CCGGGCTAAA GGTCCTGATG ACCGATGCGC ATATGCCGCC TCATCAACAG GGCAGCCACC ACGGCGATAC CCGTCTTATC CGCCACGCTT ATGGTGAAGG CGAAAAATAT GTCCCGCTGG TGCTTCGCGC CCAGGCGCTT TGGGATGAAC TCTCCACACA CAATGAAGAG CCTATTTTTG TCCGCTCCGG CGTCGTCAAC CTCGGACCGG CCGATTCCGC TTTCTTAGCC AACGTCGCAC GAAGCGCGCA ACAGTGGCAA TTGAACGTCG AGCGCCTGGA CGCGACGGCC CTCATGACGC GCTGGCCGGA AATTCGCGTG CCCGATAATT ATATCGGGCT GTTTGAAGCT GACTCCGGTT TCCTGCGCAG CGAATTAGCC ATTACCACAT GGCTTCGTCT GGCCCGAGAA GCAGGCTGCG CACAGCTATT CAACAGCCCG GTAAGCCATA TTCACCATGA TGATAACGGT GTGACGATAG AGACGAGTGA AGGCTGCTAC CACGCCAGCA AAGCGCTGAT TAGCGCGGGC ACCTGGGTCA AAGCGCTGGT ACCGGAGCTG CCCGTTCAGC CCGTACGTAA AGTTTTTGCC TGGTTTAAGG CGGATGGACG TTACAGTACT AAAAACCGCT TTCCGGCCTT TACCGGCGAA ATGCCCAACG GCGATCAATA TTACGGCTTC CCGGCGGAGA ACGACGAGTT AAAAATCGGC AAACACAATG GCGGACAGCT AATACAGGCT CCGGAAGAGC GCAAGCCCTT TGCCGCCGTT GCCAGCGATG GCGCGGAAGC ATTTCCTTTC TTGCGTAATG TACTGCCGGG TATCGGCGGT TGTTTACATG GGGCGGCATG TACCTATGAT AATTCGCCGG ACGAGGATTT TATTATCGAT ACGCTGCCTG GCCATGAGAA TACGCTTGTC ATCACTGGAC TCAGCGGACA TGGTTTTAAA TTCGCCCCGG TGTTAGGAGA AATCGCTGCG GATTTTGCGT TGGGAAAAAC GCCCTCCTTT GATCTGACGC CGTTCCGGCT TTCCCGTTTT AGCCAATAA
|
Protein sequence | MKYDLIIIGS GSVGAAAGYY ATRAGLKVLM TDAHMPPHQQ GSHHGDTRLI RHAYGEGEKY VPLVLRAQAL WDELSTHNEE PIFVRSGVVN LGPADSAFLA NVARSAQQWQ LNVERLDATA LMTRWPEIRV PDNYIGLFEA DSGFLRSELA ITTWLRLARE AGCAQLFNSP VSHIHHDDNG VTIETSEGCY HASKALISAG TWVKALVPEL PVQPVRKVFA WFKADGRYST KNRFPAFTGE MPNGDQYYGF PAENDELKIG KHNGGQLIQA PEERKPFAAV ASDGAEAFPF LRNVLPGIGG CLHGAACTYD NSPDEDFIID TLPGHENTLV ITGLSGHGFK FAPVLGEIAA DFALGKTPSF DLTPFRLSRF SQ
|
| |