Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1623 |
Symbol | sufS |
ID | 5712768 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 1688923 |
End bp | 1690143 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641267539 |
Product | cysteine desulfurase |
Protein accession | YP_001532966 |
Protein GI | 159044172 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.0344832 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACAACG TTGATGAGAT CCGGTCTGAT TTTCCGATTC TGTCGCGCCA AGTGAACGGC AAGCCACTGG TCTATCTCGA CAATGGGGCG TCGGCGCAAA AGCCACAGGT CGTGATTGAT GCAGTCACCC GCGCTTACGC TGAGGAATAC GCCAATGTAC ACCGCGGCCT GCACTATTTG AGCAATCTCG CCACCGAGAA ATACGAAAGC GTGCGTGGTA CTATCGCGCG GTTCTTGGGC GTGGCGGACG AAAACCAGAT TGTCCTGAAT TCCGGGACCA CCGAAGGGAT CAACCTCGTC GCCTATGGCT GGGCCATGCC CCGGATGGTG GCTGGCGACG AGATCGTTCT GTCGGTGATG GAGCACCACG CCAACATCGT GCCATGGCAT TTTTTGCGAG CGCGACAGGG CGTGGTCCTG AAATGGGTCG ACGTTGATGC CACCGGTGCG CTTGATGCGC AGAAAGTCAT CGACGCGATC GGTCCGAAAA CCAAGCTCGT AGCCATCACG CAACTGTCGA ATGTCCTAGG CTGCAAGGTC GACGTAAAGG CGATCACCGA GGCGGCCCAC GCCAAGGGCG TGGCTGTTTT GGTGGATGGC AGCCAGGCGG CCGTTCATAT GCCCGTGAGT GTCGACGATC TTGGCTGCGA TTTCTATGCC ATCACAGGGC ACAAGCTCTA TGGGCCCTCG GGGTCCGGTG CGATCTTCAT CAAGTCCGAG CGGATGGCTG AGATGCGGCC TTTCATCGGG GGTGGGGATA TGATCCGCGA TGTGACGCGG GAGTTTGTCA CCTACAACGA CCCGCCAATG AAGTTCGAGG CCGGCACACC GGGTATTGTG CAGACCATCG GACTTGGCGT GGCTCTCGAT TACATGATGG GTCTTGGGAT GGAGAATATC GCTGCCCATG AAGACAAGCT GCGGGATTAT GCGCGCACCC GGCTCGATGG ATTGAACTGG TTGAATGTGC AGGGTCAGAC ACCGGACAAG GCTGCTATTT TCTCGTTCAC GCTGGAGGGG GCAGCACATG CCCATGATAT CTCCACCGTG CTAGACAAGA AGGGCGTTGC AGTACGCGCT GGCCATCACT GCGCCCAGCC TTTGATGGAA CATATGGGCG TTCCAGCGAC CTGTCGCGCA TCCTTCGGGC TCTACAATAC AGAGGCCGAG GTGGATGTGC TGGTGGATGC GCTGGAGCTT TGTCACGAGC TGTTCGGGTA G
|
Protein sequence | MYNVDEIRSD FPILSRQVNG KPLVYLDNGA SAQKPQVVID AVTRAYAEEY ANVHRGLHYL SNLATEKYES VRGTIARFLG VADENQIVLN SGTTEGINLV AYGWAMPRMV AGDEIVLSVM EHHANIVPWH FLRARQGVVL KWVDVDATGA LDAQKVIDAI GPKTKLVAIT QLSNVLGCKV DVKAITEAAH AKGVAVLVDG SQAAVHMPVS VDDLGCDFYA ITGHKLYGPS GSGAIFIKSE RMAEMRPFIG GGDMIRDVTR EFVTYNDPPM KFEAGTPGIV QTIGLGVALD YMMGLGMENI AAHEDKLRDY ARTRLDGLNW LNVQGQTPDK AAIFSFTLEG AAHAHDISTV LDKKGVAVRA GHHCAQPLME HMGVPATCRA SFGLYNTEAE VDVLVDALEL CHELFG
|
| |