Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1881 |
Symbol | sufS |
ID | 6270579 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 1722276 |
End bp | 1723496 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641725944 |
Product | bifunctional cysteine desulfurase/selenocysteine lyase |
Protein accession | YP_001880440 |
Protein GI | 187730071 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.000647321 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTTTT CCGTCGACAA AGTGCGGGCC GACTTTCCGG TGCTTTCGCG TGAGGTAAAC GGTTTGCCGC TGGCTTATCT CGACAGCGCC GCCAGTGCAC AGAAACCGAG CCAGGTGATT GACGCCGAGG CCGAGTTTTA TCGTCATGGC TACGCGGCGG TGCATCGCGG TATTCATACC TTAAGCGCCC AGGCGACCGA GAAAATGGAG AACGTGCGCA AGCGGGCATC GCTGTTTATT AATGCCCGTT CGGCGGAAGA GCTGGTGTTC GTCCGCGGCA CGACGGAAGG GATCAATCTG GTCGCCAATA GCTGGGGCAA CAGCAATGTG CGGGCGGGCG ATAACATCAT CATCAGTCAG ATGGAGCACC ACGCTAACAT TGTCCCCTGG CAGATGCTTT GCGCACGCGT TGGCGCAGAG CTGCGTGTGA TCCCTCTCAA CCCCGACGGT ACGCTGCAAC TGGAGACGCT GCATACGCTG TTTGATGAGA AAACTCGCCT GCTGGCAATT ACTCATGTCT CCAACGTGCT TGGCACAGAA AATCCACTGG CGGGAATGAT CACGCTTGCG CACCAGCATG GCGCAAAAGT GCTGGTGGAT GGCGCTCAGG CGGTGATGCA TCATCCGGTG GATGTTCAGG CGCTGGATTG CGATTTTTAC GTGTTTTCCG GGCATAAACT GTATGGCCCC ACCGGGATTG GCATTCTTTA TGTCAAAGAA GCCTTGTTGC AGGAGATGCC GCCGTGGGAA GGGGGCGGTT CTATGATCGC CACCGTCAGC CTGAGTGAAG GCACTACCTG GACCAAAGCA CCATGGCGGT TTGAAGCCGG TACACCCAAT ACCGGGGGCA TCATTGGTCT TGGCGCGGCG CTGGAATATG TTTCGGTGCT GGGGCTTAAT AACATAGCCG AGTATGAACT GAATCTGATG CATTACGCGC TATCACAGCT GGAATCTGTA CCGAATCTCA CTCTCTATGG CCCACAAAAC AGGCTTGGCG TTATTGCTTT TAATCTCGGA AAACACCACG CCTATGATGT TGGCAGTTTT CTCGATAATT ACGGCATTGC TGTGCGTACC GGACATCACT GCGCTATGCC ATTAATGGCC TATTACAACG TCCCTGCGAT GTGTCGGGCG TCGCTGGCCA TGTATAACAC TCATGAAGAA GTGGATCGTC TGGTGACCGG CCTGCAACGT ATTCACCGTC TGCTGGGATA A
|
Protein sequence | MTFSVDKVRA DFPVLSREVN GLPLAYLDSA ASAQKPSQVI DAEAEFYRHG YAAVHRGIHT LSAQATEKME NVRKRASLFI NARSAEELVF VRGTTEGINL VANSWGNSNV RAGDNIIISQ MEHHANIVPW QMLCARVGAE LRVIPLNPDG TLQLETLHTL FDEKTRLLAI THVSNVLGTE NPLAGMITLA HQHGAKVLVD GAQAVMHHPV DVQALDCDFY VFSGHKLYGP TGIGILYVKE ALLQEMPPWE GGGSMIATVS LSEGTTWTKA PWRFEAGTPN TGGIIGLGAA LEYVSVLGLN NIAEYELNLM HYALSQLESV PNLTLYGPQN RLGVIAFNLG KHHAYDVGSF LDNYGIAVRT GHHCAMPLMA YYNVPAMCRA SLAMYNTHEE VDRLVTGLQR IHRLLG
|
| |