Gene Dshi_1623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1623 
SymbolsufS 
ID5712768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1688923 
End bp1690143 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content58% 
IMG OID641267539 
Productcysteine desulfurase 
Protein accessionYP_001532966 
Protein GI159044172 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.0344832 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACAACG TTGATGAGAT CCGGTCTGAT TTTCCGATTC TGTCGCGCCA AGTGAACGGC 
AAGCCACTGG TCTATCTCGA CAATGGGGCG TCGGCGCAAA AGCCACAGGT CGTGATTGAT
GCAGTCACCC GCGCTTACGC TGAGGAATAC GCCAATGTAC ACCGCGGCCT GCACTATTTG
AGCAATCTCG CCACCGAGAA ATACGAAAGC GTGCGTGGTA CTATCGCGCG GTTCTTGGGC
GTGGCGGACG AAAACCAGAT TGTCCTGAAT TCCGGGACCA CCGAAGGGAT CAACCTCGTC
GCCTATGGCT GGGCCATGCC CCGGATGGTG GCTGGCGACG AGATCGTTCT GTCGGTGATG
GAGCACCACG CCAACATCGT GCCATGGCAT TTTTTGCGAG CGCGACAGGG CGTGGTCCTG
AAATGGGTCG ACGTTGATGC CACCGGTGCG CTTGATGCGC AGAAAGTCAT CGACGCGATC
GGTCCGAAAA CCAAGCTCGT AGCCATCACG CAACTGTCGA ATGTCCTAGG CTGCAAGGTC
GACGTAAAGG CGATCACCGA GGCGGCCCAC GCCAAGGGCG TGGCTGTTTT GGTGGATGGC
AGCCAGGCGG CCGTTCATAT GCCCGTGAGT GTCGACGATC TTGGCTGCGA TTTCTATGCC
ATCACAGGGC ACAAGCTCTA TGGGCCCTCG GGGTCCGGTG CGATCTTCAT CAAGTCCGAG
CGGATGGCTG AGATGCGGCC TTTCATCGGG GGTGGGGATA TGATCCGCGA TGTGACGCGG
GAGTTTGTCA CCTACAACGA CCCGCCAATG AAGTTCGAGG CCGGCACACC GGGTATTGTG
CAGACCATCG GACTTGGCGT GGCTCTCGAT TACATGATGG GTCTTGGGAT GGAGAATATC
GCTGCCCATG AAGACAAGCT GCGGGATTAT GCGCGCACCC GGCTCGATGG ATTGAACTGG
TTGAATGTGC AGGGTCAGAC ACCGGACAAG GCTGCTATTT TCTCGTTCAC GCTGGAGGGG
GCAGCACATG CCCATGATAT CTCCACCGTG CTAGACAAGA AGGGCGTTGC AGTACGCGCT
GGCCATCACT GCGCCCAGCC TTTGATGGAA CATATGGGCG TTCCAGCGAC CTGTCGCGCA
TCCTTCGGGC TCTACAATAC AGAGGCCGAG GTGGATGTGC TGGTGGATGC GCTGGAGCTT
TGTCACGAGC TGTTCGGGTA G
 
Protein sequence
MYNVDEIRSD FPILSRQVNG KPLVYLDNGA SAQKPQVVID AVTRAYAEEY ANVHRGLHYL 
SNLATEKYES VRGTIARFLG VADENQIVLN SGTTEGINLV AYGWAMPRMV AGDEIVLSVM
EHHANIVPWH FLRARQGVVL KWVDVDATGA LDAQKVIDAI GPKTKLVAIT QLSNVLGCKV
DVKAITEAAH AKGVAVLVDG SQAAVHMPVS VDDLGCDFYA ITGHKLYGPS GSGAIFIKSE
RMAEMRPFIG GGDMIRDVTR EFVTYNDPPM KFEAGTPGIV QTIGLGVALD YMMGLGMENI
AAHEDKLRDY ARTRLDGLNW LNVQGQTPDK AAIFSFTLEG AAHAHDISTV LDKKGVAVRA
GHHCAQPLME HMGVPATCRA SFGLYNTEAE VDVLVDALEL CHELFG