Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2070 |
Symbol | solA |
ID | 6143871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2086711 |
End bp | 2087829 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641616946 |
Product | N-methyltryptophan oxidase |
Protein accession | YP_001744122 |
Protein GI | 170680185 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01377] sarcosine oxidase, monomeric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.384764 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.194177 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATACG ATCTCATCAT TATTGGCAGC GGTTCCGTAG GCGCTGCCGC CGGGTATTAT GCAACCCGCG CCGGTTTAAA CGTGCTAATG ACCGACGCCC ATATGCCACC GCATCAACAC GGCAGCCACC ACGGCGATAC GCGATTAATT CGCCATGCTT ATGGTGAAGG CGAAAAGTAT GTCCCGCTGG TCCTCCGCGC GCAAATGCTG TGGGATGAAC TTTCCCGCCA CAACGAAGAT GATCCCATTT TTGTACGCTC TGGTGTCATT AACCTCGGTC CGGCTGACTC AGCATTTCTC GCCAACGTCG CCCACAGCGC CGAACAGTGG CAACTCAACG TTGAACAGCT CGACGCGCAA GGGATTATGG CCCGCTGGCC AGAAATACGC GTCCCGGACA ACTACATCGG TTTATTTGAA ACTGATTCCG GTTTTTTGCG CAGCGAACTG GCGATTAAAA CCTGGATCCA ACTGGCGAAG GAAGCGGGCT GTGCGCAGCT GTTCAACTGC CCGGTCACCG AAATTCGTCA TGACGATGAT GGCGTAACTA TTGAAACGGC AGACGGTGAG TATCAGGCGA AAAAAGCGAT TGTCTGCGCG GGAACATGGG TAAAAGACCT ACTCCCGGAG CTGCCTGTCC AGCCCGTACG CAAAGTATTT GCCTGGTATC AGGCCGATGG TCGCTATAGC GTGAAGAATA AATTCCCGGC GTTTACCGGT GAACTGCCCA ATGGCGATCA ATATTATGGT TTTCCGGCAG AAAACGACGC GTTGAAGATT GGCAAACATA ACGGAGGCCA GGTTATCCAT TCAGCGGATG AACGTGTTCC GTTTGCGGAA GTGGTCAGCG ATGGTTCGGA AGCCTTCCCG TTCTTGCGCA ATGTGTTGCC GGGTATCGGT TGCTGCCTGT ACGGCGCTGC CTGCACCTAT GATAATTCTC CTGACGAGGA TTTTATTATC GATACTCTAC CCGCCCACGA TAATACACTG CTCATTACCG GCCTGAGTGG GCACGGTTTT AAATTTGCGT CAGTTTTAGG GGAAATAGCT GCCGATTTTG CGCAAGACAA AAAAAGCGAT TTTGATTTGA CGCCATTCAG GCTTTCCCGC TTCCAATAA
|
Protein sequence | MKYDLIIIGS GSVGAAAGYY ATRAGLNVLM TDAHMPPHQH GSHHGDTRLI RHAYGEGEKY VPLVLRAQML WDELSRHNED DPIFVRSGVI NLGPADSAFL ANVAHSAEQW QLNVEQLDAQ GIMARWPEIR VPDNYIGLFE TDSGFLRSEL AIKTWIQLAK EAGCAQLFNC PVTEIRHDDD GVTIETADGE YQAKKAIVCA GTWVKDLLPE LPVQPVRKVF AWYQADGRYS VKNKFPAFTG ELPNGDQYYG FPAENDALKI GKHNGGQVIH SADERVPFAE VVSDGSEAFP FLRNVLPGIG CCLYGAACTY DNSPDEDFII DTLPAHDNTL LITGLSGHGF KFASVLGEIA ADFAQDKKSD FDLTPFRLSR FQ
|
| |