Gene SbBS512_E2268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2268 
SymbolsolA 
ID6269619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2062601 
End bp2063719 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content52% 
IMG OID641726285 
ProductN-methyltryptophan oxidase 
Protein accessionYP_001880769 
Protein GI187730375 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR01377] sarcosine oxidase, monomeric form 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATACG ATCTCATCAT TATTGGCAGC GGTTCCGTAG GCGCTGCCGC CGGGTATTAT 
GCAACCCGCG CCGGTTTAAA CGTGCTAATG ACCGACGCCC ATATGCCACC GCATCAACAC
GGCAGCCACC ACGGCGATAC GCGATTAATT CGCCATGCTT ATGGTGAAGG CGAAAAGTAT
GTCCCGTTGG TCCTCCGCGC GCAAACGCTG TGGGATGAAC TTTCCCGCCA CAACGAAGAT
GATCCCATTT TTGTACGCTC TGGTGTCATT AACCTCGGCC CGGCTGACTC CGCATTTCTC
GCCAACGTCG CCCACAGCGC CGAACAGTGG CAACTCAACG TTGAAAAGCT CGATGCGCAA
GGGATTATGG CCCGCTGGCC AGAAATACGC GTCCCGGACA ACTACATCGG CTTATTTGAG
ACTGATTCCG GTTTTTTGCG CAGCGAACTG GCGATTAAAA CCTGGATCCA ACTGGCGAAG
GAAGCGGGCT GTGCGCAACT GTTCAACTGC CCGGTCACCG CAATTCGTCA TGACGATGAT
GGCGTAACTA TTGAAACCGT TGACGGTGAG TATCAGGCGA AAAAAGCGAT TGTCTGCGCG
GGAACATGGG TAAAAGACCT GCTCCCGGAG CTGCCTGTCC AGCCTGTACG TAAAGTATTT
GCCTGGTATC AGGCCGATGG CCGCTATAGC GTGAAGAATA AATTCCCGGC GTTTACCGGT
GAACTGCCCA ATGGCGATCA ATATTATGGT TTTCCGGCAG AAAACGACGC GTTGAAGATT
GGCAAACATA ACGGAGGCCA GGTTATCCAT TCAGCGGATG AACGTGTTCC GTTTGCGGAA
GTGGTCAGCG ATGGTTCGGA AGCCTTCCCG TTCTTGCGCA ATGTATTGCC GGGTATCGGT
TGCTGCCTGT ACGGCGCTGC CTGCACCTAT GATAATTCGC CTGACGAAGA TTTTATTATC
GATACCCTAC CCGGCCACGA TAATACACTG CTCATTACCG GCCTGAGTGG GCACGGTTTT
AAATTTGCGT CAGTTTTAGG GGAAATAGCT GCCGATTTTG CGCAAGACAA AAAAAGCGAT
TTTGATTTGA CGCCATTCAG GCTTTCCCGC TTCCAATAA
 
Protein sequence
MKYDLIIIGS GSVGAAAGYY ATRAGLNVLM TDAHMPPHQH GSHHGDTRLI RHAYGEGEKY 
VPLVLRAQTL WDELSRHNED DPIFVRSGVI NLGPADSAFL ANVAHSAEQW QLNVEKLDAQ
GIMARWPEIR VPDNYIGLFE TDSGFLRSEL AIKTWIQLAK EAGCAQLFNC PVTAIRHDDD
GVTIETVDGE YQAKKAIVCA GTWVKDLLPE LPVQPVRKVF AWYQADGRYS VKNKFPAFTG
ELPNGDQYYG FPAENDALKI GKHNGGQVIH SADERVPFAE VVSDGSEAFP FLRNVLPGIG
CCLYGAACTY DNSPDEDFII DTLPGHDNTL LITGLSGHGF KFASVLGEIA ADFAQDKKSD
FDLTPFRLSR FQ