Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A1257 |
Symbol | solA |
ID | 6483413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 1248207 |
End bp | 1249325 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642736657 |
Product | N-methyltryptophan oxidase |
Protein accession | YP_002040414 |
Protein GI | 194443753 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01377] sarcosine oxidase, monomeric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.381174 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 0.53223 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATACG ACCTTATTAT TATCGGCAGC GGTTCGGTTG GCGCCGCCGC TGGTTATTAC GCCACCCGCG CCGGGCTAAA GGTCCTGATG ACCGATGCGC ATATGCCGCC TCATCAACAG GGCAGCCATC ACGGCGATAC CCGACTTATC CGCCACGCTT ATGGTGAAGG CGAAAAATAT GTCCCGCTGG TGCTTCGCGC CCAGGCGCTT TGGGATGAAC TCTCCACACA CAATGAAGAG CCTATTTTTG TCCGCTCCGG CGTCGTCAAC CTCGGCCCGG CCGATTCCGC TTTCTTAGCC AACGTCGCAC GAAGCGCGCA ACAGTGGCAA TTGAACGTCG AGCGCCTGGA CGCGACGGCC CTCATGACGC GCTGGCCGGA AATTCGCGTG CCCGATAATT ATATCGGGCT GTTTGAAGCT GACTCCGGTT TCCTGCGCAG CGAATTAGCC ATTACCACAT GGCTTCGTCT GGCCCGAGAG GCAGGCTGCG CACAGCTATT TAACAGCCCG GTAAGCCATA TTCACCATGA TGATAACGGT GTGACGATAG AGACGAGTGA AGGCTGCTAC CACGCCAGCA AGGCGCTGAT TAGCGCGGGC ACCTGGGTCA AAGCGCTGGT ACCGGAGCTG CCCGTTCAGC CCGTACGTAA AGTTTTTGCC TGGTTTAAGG CGGATGGACG TTACAGCACT AAAAACCGCT TTCCGGCCTT TACCGGCGAA ATGCCCAACG GCGATCAATA TTACGGCTTC CCGGCGGAGA ACGACGAGTT AAAAATCGGC AAACACAATG GCGGACAGCT AATACAGGCT CAGGAAGAGC GCAAGCCCTT TGCCGCCGTT GCCAGCGATG GCGCGGAAGC ATTTCCTTTC CTGCGTAACG TACTGCCGGG TATCGGCGGT TGTTTACATG GGGCAGCATG TACCTATGAT AATTCGCCGG ACGAGGATTT TATTATCGAT ACGCTGCCTG GCCATGAGAA TACGCTTGTC ATCACTGGAC TCAGCGGACA TGGTTTTAAA TTCGCCCCGG TGTTAGGAGA AATCGCTGCA GATTTTGCGT TGGGAAAAAC GCCCTCCTTT GATCTGACGC CGTTCCGGCT GTCCCGTTTT AGCCAATAA
|
Protein sequence | MKYDLIIIGS GSVGAAAGYY ATRAGLKVLM TDAHMPPHQQ GSHHGDTRLI RHAYGEGEKY VPLVLRAQAL WDELSTHNEE PIFVRSGVVN LGPADSAFLA NVARSAQQWQ LNVERLDATA LMTRWPEIRV PDNYIGLFEA DSGFLRSELA ITTWLRLARE AGCAQLFNSP VSHIHHDDNG VTIETSEGCY HASKALISAG TWVKALVPEL PVQPVRKVFA WFKADGRYST KNRFPAFTGE MPNGDQYYGF PAENDELKIG KHNGGQLIQA QEERKPFAAV ASDGAEAFPF LRNVLPGIGG CLHGAACTYD NSPDEDFIID TLPGHENTLV ITGLSGHGFK FAPVLGEIAA DFALGKTPSF DLTPFRLSRF SQ
|
| |