Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_1896 |
Symbol | sufS |
ID | 5586353 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 1885122 |
End bp | 1886342 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640925571 |
Product | bifunctional cysteine desulfurase/selenocysteine lyase |
Protein accession | YP_001462974 |
Protein GI | 157158206 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000000554851 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTTTT CCGTCGACAA AGTGCGGGCC GACTTTCCGG TGCTTTCGCG TGAGGTAAAC GGTTTGCCGC TGGCTTATCT CGACAGCGCC GCCAGTGCGC AGAAACCAAG CCAGGTGATT GACTCCGAAG CCGAGTTTTA TCGTCACGGC TACGCGGCGG TGCATCGCGG TATTCATACC TTAAGCGCCC AGGCGACCGA GAAAATGGAG AACGTGCGCA AGCGGGCATC GCTGTTTATT AATGCCCGTT CGGCGGAAGA GCTGGTGTTC GTCCGCGGCA CGACGGAAGG GATCAATCTG GTCGCCAATA GTTGGGGCAA CAGCAACGTG CGGGCGGGCG ATAACATCAT CATCAGTCAG ATGGAGCACC ACGCTAACAT TGTTCCCTGG CAGATGCTTT GCGCACGCGT TGGCGCAGAG CTGCGTGTGA TCCCGCTCAA TCCCGACGGT ACGTTGCAAC TGGAGACGCT GCCTACGCTG TTTGATGAGA AAACTCGCCT GCTGGCAATT ACTCATGTCT CCAACGTGCT TGGCACAGAA AATCCACTGG CGGAAATGAT CACGCTTGCG CACCAGCATG GCGCAAAAGT GCTGGTGGAT GGCGCTCAGG CGGTGATGCA TCATCCGGTG CATGTTCAGG CGCTGGATTG CGACTTTTAC GTGTTCTCCG GGCATAAACT GTATGGCCCC ACCGGAATTG GCATTCTTTA TGTGAAAGAA GCCTTGTTGC AGGAGATGCC GCCGTGGGAA GGGGGCGGTT CTATGATCGC CACCGTCAGC CTGAGTGAAG GCACTACCTG GACCAAAGCA CCATGGCGGT TTGAAGCCGG TACACCCAAT ACCGGGGGCA TCATTGGTCT TGGCGCGGCG CTGGAGTATG TTTCGGCGCT GGGGCTTAAT AGCATAGCCG AGTATGAACA GAATCTGATG CATTATGCGC TATCACAGCT GGAATCTGTA CCGGATCTCA CTCTCTATGG CCCACAAAAC AGGCTTGGCG TTATTGCTTT TAATCTTGGT AAACACCACG CCTATGATGT TGGCAGTTTT CTCGATAACT ACGGCATTGC TGTGCGTACC GGACATCACT GCGCTATGCC ATTAATGGCC TATTACAACG TCCCTGCTAT GTGTCGGGCG TCGCTGGCCA TGTATAACAC CCATGAAGAA GTGGATCGTC TGGTGACCGG ACTGCAACGT ATTCACCGTC TGCTGGGATA A
|
Protein sequence | MTFSVDKVRA DFPVLSREVN GLPLAYLDSA ASAQKPSQVI DSEAEFYRHG YAAVHRGIHT LSAQATEKME NVRKRASLFI NARSAEELVF VRGTTEGINL VANSWGNSNV RAGDNIIISQ MEHHANIVPW QMLCARVGAE LRVIPLNPDG TLQLETLPTL FDEKTRLLAI THVSNVLGTE NPLAEMITLA HQHGAKVLVD GAQAVMHHPV HVQALDCDFY VFSGHKLYGP TGIGILYVKE ALLQEMPPWE GGGSMIATVS LSEGTTWTKA PWRFEAGTPN TGGIIGLGAA LEYVSALGLN SIAEYEQNLM HYALSQLESV PDLTLYGPQN RLGVIAFNLG KHHAYDVGSF LDNYGIAVRT GHHCAMPLMA YYNVPAMCRA SLAMYNTHEE VDRLVTGLQR IHRLLG
|
| |