Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2394 |
Symbol | sufS |
ID | 6971908 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2265309 |
End bp | 2266529 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643386267 |
Product | bifunctional cysteine desulfurase/selenocysteine lyase |
Protein accession | YP_002270749 |
Protein GI | 209398201 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0201879 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.000057861 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACTTTTT CCGTCGACAA AGTGCGGGCC GACTTTCCGG TGCTTTCGCG TGAGGTAAAC GGTTTGCCGC TGGCTTATCT CGACAGCGCC GCCAGTGCGC AGAAACCGAG CCAGGTGATT GACGCCGAGG CCGAGTTTTA TCGTCATGGC TACGCGGCGG TGCATCGCGG TATTCATACC TTAAGCGCCC AGGCGACCGA GAAAATGGAG AACGTGCGCA AGCGGGCATC GCTGTTTATT AATGCCCGTT CGGCGGAAGA GCTGGTGTTC GTCCGCGGCA CGACGGAAGG GATCAATCTG GTCGCCAATA GCTGGGGCAA CAGCAACGTG CGGGCGGGCG ATAACATCAT CATAAGTCAG ATGGAGCACC ACGCTAACAT TGTTCCCTGG CAGATGCTTT GCGCACGCGT TGGCGCAGAG CTGCGTGTGA TCCCGCTCAA TCCCGACGGT ACGTTGCAAC TGGAGACGCT GCCTACGCTG TTTGATGAGA AAACTCGCCT GCTGGCAATT ACTCATGTCT CCAACGTGCT TGGCACAGAA AATCCACTGG CGGAAATGAT CACGCTTGCG CACCAGCATG GCGCAAAAGT GCTGGTGGAT GGCGCTCAGG CGGTGATGCA TCATCCGGTG GATGTTCAGG CGCTGGATTG CGATTTTTAC GTGTTTTCCG GGCATAAACT GTATGGCCCC ACCGGGATTG GCATTCTTTA TGTCAAAGAA GCCTTGCTAC AGGAGATGCC GCCGTGGGAA GGGGGCGGTT CTATGATCGC CACCGTCAGC CTGAGTGAAG GCACTACCTG GACCAAAGCA CCGTGGCGGT TTGAAGCCGG TACACCCAAT ACCGGGGGCA TCATTGGTCT TGGCGCGGCG CTGGAGTATG TTTCGGCGCA GGGGCTTAAT AACATAGCCG AGTATGAACA GAATCTGATG CATTACGCGC TATCACAGCT GGAATCTGTA CCGGATCTCA CTCTCTATGG CCCACAAAAC AGGCTTGGCG TTATTGCTTT TAATCTCGGT AAACACCACG CCTATGATGT TGGCAGTTTT CTCGATAATT ACGGCATTGC TGTGCGTACC GGACATCACT GCGCTATGCC ATTAATGGCC TATTATAACG TCCCTGCTAT GTGTCGGGCG TCGCTGGCCA TGTATAACAC CCATGAAGAA GTGGATCGTC TGGTGACCGG ACTGCAACGT ATTCACCGTC TGCTGGGATA A
|
Protein sequence | MTFSVDKVRA DFPVLSREVN GLPLAYLDSA ASAQKPSQVI DAEAEFYRHG YAAVHRGIHT LSAQATEKME NVRKRASLFI NARSAEELVF VRGTTEGINL VANSWGNSNV RAGDNIIISQ MEHHANIVPW QMLCARVGAE LRVIPLNPDG TLQLETLPTL FDEKTRLLAI THVSNVLGTE NPLAEMITLA HQHGAKVLVD GAQAVMHHPV DVQALDCDFY VFSGHKLYGP TGIGILYVKE ALLQEMPPWE GGGSMIATVS LSEGTTWTKA PWRFEAGTPN TGGIIGLGAA LEYVSAQGLN NIAEYEQNLM HYALSQLESV PDLTLYGPQN RLGVIAFNLG KHHAYDVGSF LDNYGIAVRT GHHCAMPLMA YYNVPAMCRA SLAMYNTHEE VDRLVTGLQR IHRLLG
|
| |