Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2268 |
Symbol | solA |
ID | 6269619 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 2062601 |
End bp | 2063719 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641726285 |
Product | N-methyltryptophan oxidase |
Protein accession | YP_001880769 |
Protein GI | 187730375 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01377] sarcosine oxidase, monomeric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATACG ATCTCATCAT TATTGGCAGC GGTTCCGTAG GCGCTGCCGC CGGGTATTAT GCAACCCGCG CCGGTTTAAA CGTGCTAATG ACCGACGCCC ATATGCCACC GCATCAACAC GGCAGCCACC ACGGCGATAC GCGATTAATT CGCCATGCTT ATGGTGAAGG CGAAAAGTAT GTCCCGTTGG TCCTCCGCGC GCAAACGCTG TGGGATGAAC TTTCCCGCCA CAACGAAGAT GATCCCATTT TTGTACGCTC TGGTGTCATT AACCTCGGCC CGGCTGACTC CGCATTTCTC GCCAACGTCG CCCACAGCGC CGAACAGTGG CAACTCAACG TTGAAAAGCT CGATGCGCAA GGGATTATGG CCCGCTGGCC AGAAATACGC GTCCCGGACA ACTACATCGG CTTATTTGAG ACTGATTCCG GTTTTTTGCG CAGCGAACTG GCGATTAAAA CCTGGATCCA ACTGGCGAAG GAAGCGGGCT GTGCGCAACT GTTCAACTGC CCGGTCACCG CAATTCGTCA TGACGATGAT GGCGTAACTA TTGAAACCGT TGACGGTGAG TATCAGGCGA AAAAAGCGAT TGTCTGCGCG GGAACATGGG TAAAAGACCT GCTCCCGGAG CTGCCTGTCC AGCCTGTACG TAAAGTATTT GCCTGGTATC AGGCCGATGG CCGCTATAGC GTGAAGAATA AATTCCCGGC GTTTACCGGT GAACTGCCCA ATGGCGATCA ATATTATGGT TTTCCGGCAG AAAACGACGC GTTGAAGATT GGCAAACATA ACGGAGGCCA GGTTATCCAT TCAGCGGATG AACGTGTTCC GTTTGCGGAA GTGGTCAGCG ATGGTTCGGA AGCCTTCCCG TTCTTGCGCA ATGTATTGCC GGGTATCGGT TGCTGCCTGT ACGGCGCTGC CTGCACCTAT GATAATTCGC CTGACGAAGA TTTTATTATC GATACCCTAC CCGGCCACGA TAATACACTG CTCATTACCG GCCTGAGTGG GCACGGTTTT AAATTTGCGT CAGTTTTAGG GGAAATAGCT GCCGATTTTG CGCAAGACAA AAAAAGCGAT TTTGATTTGA CGCCATTCAG GCTTTCCCGC TTCCAATAA
|
Protein sequence | MKYDLIIIGS GSVGAAAGYY ATRAGLNVLM TDAHMPPHQH GSHHGDTRLI RHAYGEGEKY VPLVLRAQTL WDELSRHNED DPIFVRSGVI NLGPADSAFL ANVAHSAEQW QLNVEKLDAQ GIMARWPEIR VPDNYIGLFE TDSGFLRSEL AIKTWIQLAK EAGCAQLFNC PVTAIRHDDD GVTIETVDGE YQAKKAIVCA GTWVKDLLPE LPVQPVRKVF AWYQADGRYS VKNKFPAFTG ELPNGDQYYG FPAENDALKI GKHNGGQVIH SADERVPFAE VVSDGSEAFP FLRNVLPGIG CCLYGAACTY DNSPDEDFII DTLPGHDNTL LITGLSGHGF KFASVLGEIA ADFAQDKKSD FDLTPFRLSR FQ
|
| |