Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_1182 |
Symbol | solA |
ID | 5588901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 1196932 |
End bp | 1198050 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640924881 |
Product | N-methyltryptophan oxidase |
Protein accession | YP_001462293 |
Protein GI | 157155462 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01377] sarcosine oxidase, monomeric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATACG ATCTCATCAT TATTGGCAGC GGTTCCGTAG GCGCTGCCGC CGGGTATTAT GCAACCCGCG CCGGTTTAAA CGTGCTAATG ACCGACGCCC ATATGCCACC GCATCAACAC GGCAGCCACC ACGGCGATAC GCGATTAATT CGCCATGCTT ATGGTGAAGG CGAAAAGTAT GTCCCGCTGG TCCTCCGCGC GCAAACGCTG TGGGATGAAC TTTCCCGCCA CAACGAAGAT GATCCCATTT TTGTACGCTC TGGTGTCATT AACCTCGGCC CGGCTGACTC CGCATTTCTC GCCAACGTCG CCCACAGCGC CGAACAGTGG CAACTCAACG TCGAAAAGCT CGACGCGCAA GGGATTATGG CCCGCTGGCC AGAAATACGC GTCCCGGACA ACTACATCGG CTTATTTGAG ACTGATTCCG GTTTTTTGCG CAGCGAACTG GCGATTAAAA CCTGGATCCA ACTGGCGAAG GAAGCGGGCT GTGCGCAGCT GTTCAACTGC CCGGTCACCG CAATTCGTCA TGACGATGAT GGCGTAACTA TTGAAACGGC TGACGGAGAG TATCAGGCGA AAAAAGCGAT TGTCTGCGCG GGAACATGGG TAAAAGACCT GCTCCCGGAG CTGCCTGTCC AGCCCGTACG CAAAGTATTT GCCTGGTATC AGGCCGATGG TCGCTATAGC GTGAAGAATA AATTCCCGGC GTTTACCGGT GAACTGCCCA ATGGCGATCA ATATTATGGT TTTCCGGCAG AAAACGACGC GTTGAAGATT GGCAAACATA ACGGAGGCCA GGTTATCCAT TCAGCGGATG AACGTGTTCC GTTTGCGGAA GTGGTCAGCG ATGGTTCGGA AGCCTTCCCG TTCTTGCGCA ATGTATTGCC GGGTATCGGT TGCTGCCTGT ACGGCGCTGC CTGCACCTAT GATAATTCGC CTGACGAAGA TTTTATTATC GATACCCTAC CCGACCACGA TAATACACTG TTCATTACCG GCCTGAGTGG GCACGGTTTT AAATTTGCGT CAGTTTTAGG GGAAATAGCT GCCGATTTTG CGCAAGACAA AAAAAGCGAT TTTGATTTAA CGCCATTCAG GCTTTCCCGC TTCCAATAA
|
Protein sequence | MKYDLIIIGS GSVGAAAGYY ATRAGLNVLM TDAHMPPHQH GSHHGDTRLI RHAYGEGEKY VPLVLRAQTL WDELSRHNED DPIFVRSGVI NLGPADSAFL ANVAHSAEQW QLNVEKLDAQ GIMARWPEIR VPDNYIGLFE TDSGFLRSEL AIKTWIQLAK EAGCAQLFNC PVTAIRHDDD GVTIETADGE YQAKKAIVCA GTWVKDLLPE LPVQPVRKVF AWYQADGRYS VKNKFPAFTG ELPNGDQYYG FPAENDALKI GKHNGGQVIH SADERVPFAE VVSDGSEAFP FLRNVLPGIG CCLYGAACTY DNSPDEDFII DTLPDHDNTL FITGLSGHGF KFASVLGEIA ADFAQDKKSD FDLTPFRLSR FQ
|
| |