Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1214 |
Symbol | |
ID | 6145897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1218917 |
End bp | 1219921 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616092 |
Product | putative sulfite oxidase subunit YedY |
Protein accession | YP_001743275 |
Protein GI | 170681163 |
COG category | [R] General function prediction only |
COG ID | [COG2041] Sulfite oxidase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000894438 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.649922 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGA ATCAATTTTT AAAAGAATCA GATGTTACGG CCGAGTCGGT ATTCTTTATG AAGCGTCGAC AGGTGTTAAA AGCACTGGGC ATCAGCGCAG CTGCACTTTC TTTGCCTCAC GCTGCGCATG CCGATCTGCT TAGCTGGTTT AAAGGGAACG ATCGCCCGCC CGCCCCCGCC GGAAAACCGC TGGAGTTCAG CAAGCCTGCC GCCTGGCAAA ATAACCTGCC ACTGACGCCA GTAGATAAAG TCTCCGGTTA TAACAACTTC TATGAATTCG GGCTGGATAA AGCCGATCCC GCCGCTAATG CTGGTAGCCT GAAAACCGAT CCATGGACAC TGAAAATCAG CGGCGAAGTG GCAAAACCAT TGACCCTCGA TCACGATGAT TTAACCCGTC GCTTCCCGCT GGAAGAGCGT ATTTATCGTA TGCGCTGCGT GGAAGCGTGG TCGATGGTGG TGCCGTGGAT TGGTTTTCCG CTGCACAAAT TGCTGGCGCT TGCCGAACCC ACCAGCAATG CGAAGTATGT CGCTTTCGAA ACAATTTATG CACCGGAACA GATGCCTGGC CAGCAGGACC GCTTTATCGG CGGCGGGCTG AAATATCCTT ATGTCGAAGG ATTGCGTCTC GACGAAGCAA TGCATCCGCT CACACTGATG ACCGTGGGTG TTTATGGCAA GGCGTTACCG CCACAAAATG GCGCGCCGGT ACGACTGATT GTGCCGTGGA AATATGGCTT TAAAGGGATT AAATCGATAG TCAGTATTAA GCTGACCCGC GAGCGTCCGC CAACCACCTG GAATCTGGCA GCGCCTGACG AATACGGTTT TTACGCCAAC GTTAATCCGC ATGTTGATCA CCCGCGCTGG TCACAGGCTA CCGAACGATT TATTGGTTCA GGCGGCATCC TCGATGTTCA GCGCCAGCCA ACGCTACTGT TTAATGGTTA CGCCGACCAG GTGGCATCGC TGTATCGTGG CCTGGATTTG CGGGAGAATT TCTGA
|
Protein sequence | MKKNQFLKES DVTAESVFFM KRRQVLKALG ISAAALSLPH AAHADLLSWF KGNDRPPAPA GKPLEFSKPA AWQNNLPLTP VDKVSGYNNF YEFGLDKADP AANAGSLKTD PWTLKISGEV AKPLTLDHDD LTRRFPLEER IYRMRCVEAW SMVVPWIGFP LHKLLALAEP TSNAKYVAFE TIYAPEQMPG QQDRFIGGGL KYPYVEGLRL DEAMHPLTLM TVGVYGKALP PQNGAPVRLI VPWKYGFKGI KSIVSIKLTR ERPPTTWNLA APDEYGFYAN VNPHVDHPRW SQATERFIGS GGILDVQRQP TLLFNGYADQ VASLYRGLDL RENF
|
| |