Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2184 |
Symbol | ssuD |
ID | 6142745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2191457 |
End bp | 2192602 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641617060 |
Product | alkanesulfonate monooxygenase |
Protein accession | YP_001744234 |
Protein GI | 170680263 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03565] alkanesulfonate monooxygenase, FMNH(2)-dependent |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.505795 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.982689 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCTGA ATATGTTCTG GTTTTTACCG ACCCACGGTG ACGGGCATTA TCTGGGAACG GAAGAAGGTT CACGTCCGGT TGATCACGGT TATCTGCAAC AAATTGCGCA AGCGGCGGAT CGTCTGGGCT ATACCGGTGT GCTGATCCCG ACGGGGCGAT CGTGTGAAGA TGCGTGGCTG GTGGCGGCGT CGATGATCCC GGTGACACAA CGGCTGAAGT TTCTTGTCGC CCTGCGCCCC AGTGTGACAT CGCCTACCGT TGCCGCCCGC CAGGCCGCCA CGCTTGACCG TCTCTCTAAC GGACGTGCGT TGTTTAACCT GGTCACAGGC AGCGATCCCC AAGAACTGGC AGGCGACGGC GTGTTCCTTG ATCATAGCGA GCGTTACGAA GCCTCGGCAG AATTTACCCA AGTCTGGCGG CGGTTGTTGC TTGGCGAGAC AGTGGATTTC AACGGCAAAC ATATTCATGT ACGCGGAGCC AAACTGCTCT TCCCGCCGAT TCAACAGCCG TATCCTCCGC TTTACTTTGG CGGATCGTCA GATGTCGCCC AGGAGCTGGC GGCAGAACAA GTTGATCTCT ACCTCACCTG GGGCGAACCA CCGGAACTGG TAAAAGAGAA AATCGAACAA GTGCGGGCGA AAGCTGCCGC GCATGGACGC AAAATTCGTT TCGGTATTCG TCTGCATGTG ATTGTTCGTG AAACCAACGG CGAAGCGTGG CAGGCCGCCG AGCGGTTAAT CTCGCATCTT GATGATGAAA CTATCGCCAA AGCACAGGCC GCATTCGCCC GAACGGATTC CGTAGGACAA CAGCGAATGG CGGCGCTACA TAACGGCAAG CGCGACAATC TGGAGATTAG CCCCAATTTA TGGGCGGGTG TTGGCTTAGT GCGCGGCGGT GCCGGGACGG CGCTGGTGGG CGATGGTCCT ACGGTCGCCG CGCGAATCAA CGAATACGCT GCACTTGGCA TCGACAGTTT TGTGCTTTCG GGCTACCCGC ATCTGGAAGA AGCGTATCGG GTCGGCGAGT TACTGTTCCC ACATCTGGAT GTCGCCATCC CGGAAATTCC CCAGCCACAA CCGCTGAATC CGCAAGGCGA AGCGGTGGCG AATGATTTTA TCCCCCGTAA CGTCGCGCAA AGCTAA
|
Protein sequence | MSLNMFWFLP THGDGHYLGT EEGSRPVDHG YLQQIAQAAD RLGYTGVLIP TGRSCEDAWL VAASMIPVTQ RLKFLVALRP SVTSPTVAAR QAATLDRLSN GRALFNLVTG SDPQELAGDG VFLDHSERYE ASAEFTQVWR RLLLGETVDF NGKHIHVRGA KLLFPPIQQP YPPLYFGGSS DVAQELAAEQ VDLYLTWGEP PELVKEKIEQ VRAKAAAHGR KIRFGIRLHV IVRETNGEAW QAAERLISHL DDETIAKAQA AFARTDSVGQ QRMAALHNGK RDNLEISPNL WAGVGLVRGG AGTALVGDGP TVAARINEYA ALGIDSFVLS GYPHLEEAYR VGELLFPHLD VAIPEIPQPQ PLNPQGEAVA NDFIPRNVAQ S
|
| |