Gene SeD_A2212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2212 
SymbolsolA 
ID6873718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2119460 
End bp2120578 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content55% 
IMG OID642785314 
ProductN-methyltryptophan oxidase 
Protein accessionYP_002215977 
Protein GI198243040 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR01377] sarcosine oxidase, monomeric form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.551548 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value0.375717 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATACG ACCTTATTAT TATCGGCAGC GGTTCGGTTG GCGCCGCCGC TGGTTATTAC 
GCCACCCGCG CCGGGCTAAA GGTCCTGATG ACCGATGCGC ATATGCCGCC TCATCAACAG
GGCAGCCACC ACGGCGATAC CCGTCTTATC CGCCATGCTT ATGGTGAAGG CGAAAAATAT
GTCCCGCTGA TGCTTCGCGC CCAGACGCTT TGGGATGAAC TCTCCACACA CAATGAAGAG
CCTATTTTTG TCCGCTCCGG CGTCGTCAAC CTCGGCCCGG CCGATTCCGC TTTCTTAGCC
AACGTCGCAC GAAGCGCGCA ACAGTGGCAA TTGAACGTCG AGCGCCTGGA CGCGACGGCC
CTCATGACGC GCTGGCCGGA AATTCGCGTG CCCGATAATT ATATCGGGCT GTTTGAAGCT
GACTCCGGTT TTCTGCGCAG CGAATTAGCC ATTACCACAT GGCTTCGTCT GGCCCGAGAG
GCTGGCTGCG CACAGCTATT CAACAGCCAG GTAAGCCATA TTCATCATGA TGATAACGGT
GTGACGATAG AGACGAGTGA AGGCAGCTAC CACGCCAGCA AGGCGCTAAT TAGCGCGGGC
ACCTGGGTCA AAGCGCTGGT ACCGGAGCTG CCCGTTCAGC CCGTACGTAA AGTTTTTGCC
TGGTTTAAGG CGGATGGACG TTACAGCACT AAAAACCGCT TTCCGGCCTT TACCGGCGAA
ATGCCCAACG GCGATCAATA TTACGGCTTC CCGGCGGAGA ACGACGAGTT AAAAATCGGC
AAACACAATG GCGGACAGCT AATACAGGCT CCGGAAGAGC GCAAGCCCTT TGCCGCCGTT
GCCAGCGATG GCGCGGAAGC ATTTCCTTTC CTGCGTAACG TACTGCCGGG TATCGGCGGT
TGTTTACATG GGGCGGCATG TACCTATGAT AATTCGCCGG ACGAGAATTT TATTATCGAT
ACGCTGCCTG GCCATGAGAA CACGCTTGTC ATCACTGGAC TCAGCGGACA TGGTTTTAAA
TTTGCCCCGG TGTTAGGAGA AATCGCTGCG GATTTTGCGT TGGGAAAAAC ATCCTCCTTT
GATCTGACGC CGTTCCGGCT TTCCCGTTTT AGCCAATAA
 
Protein sequence
MKYDLIIIGS GSVGAAAGYY ATRAGLKVLM TDAHMPPHQQ GSHHGDTRLI RHAYGEGEKY 
VPLMLRAQTL WDELSTHNEE PIFVRSGVVN LGPADSAFLA NVARSAQQWQ LNVERLDATA
LMTRWPEIRV PDNYIGLFEA DSGFLRSELA ITTWLRLARE AGCAQLFNSQ VSHIHHDDNG
VTIETSEGSY HASKALISAG TWVKALVPEL PVQPVRKVFA WFKADGRYST KNRFPAFTGE
MPNGDQYYGF PAENDELKIG KHNGGQLIQA PEERKPFAAV ASDGAEAFPF LRNVLPGIGG
CLHGAACTYD NSPDENFIID TLPGHENTLV ITGLSGHGFK FAPVLGEIAA DFALGKTSSF
DLTPFRLSRF SQ