Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1516 |
Symbol | sufS |
ID | 6144297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1502341 |
End bp | 1503561 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641616394 |
Product | bifunctional cysteine desulfurase/selenocysteine lyase |
Protein accession | YP_001743574 |
Protein GI | 170680646 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000472284 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.000000602764 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACTTTTT CCGTCGACAA AGTGCGGGCC GACTTTCCGG TGCTTTCTCG TGAGGTGAAC GGTTTGCCGC TGGCTTATCT CGACAGCGCC GCCAGTGCAC AGAAACCGAG CCAGGTGATT GACGCCGAGG CCGAGTTTTA TCGTCATGGC TACGCGGCGG TGCATCGCGG TATTCATACC TTAAGCGCCC AGGCGACCGA GAAAATGGAG AACGTGCGCA AGCGGGCATC GCTGTTTATT AATGCCCGTT CGGCGGAAGA GCTGGTGTTC GTCCGCGGCA CGACGGAAGG GATCAATCTG GTCGCCAATA GCTGGGGTAA CAGCAATGTG CGGGCGGGCG ATAACATCAT CATCAGTCAG ATGGAGCACC ACGCTAACAT TGTCCCCTGG CAGATGCTTT GCGCACGCGT TGGCGCAGAG CTGCGTGTGA TCCCGCTCAA CCCCGACGGT ACGCTGCAAC TGGAGATGCT ACCTAATCTG TTCGATGAGA AAACTCGCCT GCTGGCAATT ACTCATGTCT CCAACGTGCT GGGAACGGAA AATCCGCTGG CGGAAATGAT CACGCTTGCG CACCAGCATG GCGCAAAAGT GCTGGTGGAT GGCGCTCAGG CGGTGATGCA TCATCCGGTG GATGTTCAGG CGCTGGATTG CGATTTTTAC GTGTTTTCCG GGCATAAACT GTATGGCCCC ACCGGGATTG GCATTCTTTA TGTCAAAGAA GCCTTGTTGC AGGAGATGCC GCCGTGGGAA GGGGGCGGTT CTATGATCGC CACCGTCAGC CTGAGTGAAG GCACTACCTG GACCAAAGCA CCATGGCGGT TTGAAGCCGG TACACCCAAT ACCGGGGGCA TCATTGGTCT TGGCGCGGCG CTGGAGTATG TGTCGGCGCT GGGGCTTAAT AACATAGCCG AGTATGAACA GAATCTGATG CACTACGCGC TATCACAGCT GGAATCTGTA CCGGATCTCA CTCTCTATGG CCCGCAGAAC AGGCTTGGCG TTATTGCTTT TAATCTCGGT AAACACCACG CCTATGATGT TGGCAGTTTT CTCGATAACT ACGGCATTGC TGTGCGTACC GGACATCACT GCGCAATGCC ATTGATGGCC TATTACAACG TTCCTGCTAT GTGTCGGGCG TCGCTGGCCA TGTATAACAC CCATGAAGAA GTGGATCGTC TGGTGACCGG CCTGCAACGT ATTCACCGTT TGCTGGGATA A
|
Protein sequence | MTFSVDKVRA DFPVLSREVN GLPLAYLDSA ASAQKPSQVI DAEAEFYRHG YAAVHRGIHT LSAQATEKME NVRKRASLFI NARSAEELVF VRGTTEGINL VANSWGNSNV RAGDNIIISQ MEHHANIVPW QMLCARVGAE LRVIPLNPDG TLQLEMLPNL FDEKTRLLAI THVSNVLGTE NPLAEMITLA HQHGAKVLVD GAQAVMHHPV DVQALDCDFY VFSGHKLYGP TGIGILYVKE ALLQEMPPWE GGGSMIATVS LSEGTTWTKA PWRFEAGTPN TGGIIGLGAA LEYVSALGLN NIAEYEQNLM HYALSQLESV PDLTLYGPQN RLGVIAFNLG KHHAYDVGSF LDNYGIAVRT GHHCAMPLMA YYNVPAMCRA SLAMYNTHEE VDRLVTGLQR IHRLLG
|
| |