Gene SbBS512_E1881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1881 
SymbolsufS 
ID6270579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1722276 
End bp1723496 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content55% 
IMG OID641725944 
Productbifunctional cysteine desulfurase/selenocysteine lyase 
Protein accessionYP_001880440 
Protein GI187730071 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.000647321 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTTTT CCGTCGACAA AGTGCGGGCC GACTTTCCGG TGCTTTCGCG TGAGGTAAAC 
GGTTTGCCGC TGGCTTATCT CGACAGCGCC GCCAGTGCAC AGAAACCGAG CCAGGTGATT
GACGCCGAGG CCGAGTTTTA TCGTCATGGC TACGCGGCGG TGCATCGCGG TATTCATACC
TTAAGCGCCC AGGCGACCGA GAAAATGGAG AACGTGCGCA AGCGGGCATC GCTGTTTATT
AATGCCCGTT CGGCGGAAGA GCTGGTGTTC GTCCGCGGCA CGACGGAAGG GATCAATCTG
GTCGCCAATA GCTGGGGCAA CAGCAATGTG CGGGCGGGCG ATAACATCAT CATCAGTCAG
ATGGAGCACC ACGCTAACAT TGTCCCCTGG CAGATGCTTT GCGCACGCGT TGGCGCAGAG
CTGCGTGTGA TCCCTCTCAA CCCCGACGGT ACGCTGCAAC TGGAGACGCT GCATACGCTG
TTTGATGAGA AAACTCGCCT GCTGGCAATT ACTCATGTCT CCAACGTGCT TGGCACAGAA
AATCCACTGG CGGGAATGAT CACGCTTGCG CACCAGCATG GCGCAAAAGT GCTGGTGGAT
GGCGCTCAGG CGGTGATGCA TCATCCGGTG GATGTTCAGG CGCTGGATTG CGATTTTTAC
GTGTTTTCCG GGCATAAACT GTATGGCCCC ACCGGGATTG GCATTCTTTA TGTCAAAGAA
GCCTTGTTGC AGGAGATGCC GCCGTGGGAA GGGGGCGGTT CTATGATCGC CACCGTCAGC
CTGAGTGAAG GCACTACCTG GACCAAAGCA CCATGGCGGT TTGAAGCCGG TACACCCAAT
ACCGGGGGCA TCATTGGTCT TGGCGCGGCG CTGGAATATG TTTCGGTGCT GGGGCTTAAT
AACATAGCCG AGTATGAACT GAATCTGATG CATTACGCGC TATCACAGCT GGAATCTGTA
CCGAATCTCA CTCTCTATGG CCCACAAAAC AGGCTTGGCG TTATTGCTTT TAATCTCGGA
AAACACCACG CCTATGATGT TGGCAGTTTT CTCGATAATT ACGGCATTGC TGTGCGTACC
GGACATCACT GCGCTATGCC ATTAATGGCC TATTACAACG TCCCTGCGAT GTGTCGGGCG
TCGCTGGCCA TGTATAACAC TCATGAAGAA GTGGATCGTC TGGTGACCGG CCTGCAACGT
ATTCACCGTC TGCTGGGATA A
 
Protein sequence
MTFSVDKVRA DFPVLSREVN GLPLAYLDSA ASAQKPSQVI DAEAEFYRHG YAAVHRGIHT 
LSAQATEKME NVRKRASLFI NARSAEELVF VRGTTEGINL VANSWGNSNV RAGDNIIISQ
MEHHANIVPW QMLCARVGAE LRVIPLNPDG TLQLETLHTL FDEKTRLLAI THVSNVLGTE
NPLAGMITLA HQHGAKVLVD GAQAVMHHPV DVQALDCDFY VFSGHKLYGP TGIGILYVKE
ALLQEMPPWE GGGSMIATVS LSEGTTWTKA PWRFEAGTPN TGGIIGLGAA LEYVSVLGLN
NIAEYELNLM HYALSQLESV PNLTLYGPQN RLGVIAFNLG KHHAYDVGSF LDNYGIAVRT
GHHCAMPLMA YYNVPAMCRA SLAMYNTHEE VDRLVTGLQR IHRLLG