Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2088 |
Symbol | |
ID | 4896117 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 2212090 |
End bp | 2213310 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640112682 |
Product | SufS subfamily cysteine desulfurase |
Protein accession | YP_001043963 |
Protein GI | 126462849 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.519115 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGACG TCGCCTCCAT CCGCCGGGAC TTTCCGATCC TTTCGCGCGA AGTGAACGGC AAGCCGCTGG TCTATCTCGA CAATGGCGCC TCGGCGCAGA AGCCGCAGGT GGTGATCGAG GCGATGAACC TCGCCTACAG CCACGAATAT GCGAATGTCC ACCGCGGGCT GCACTATCTG TCGAACCTCG CGACCGACAA GTACGAGGCG GTGCGGGCGA CCATCGCGAC CTTCCTCAAC GCGCCCTCGC CCGAAGAGAT CGTCTTCACC ACCGGCACCA CCGAGGGGAT CAACCTCGTC TCCTACGGCT GGGCCGCGCC GCGGCTTCAG CCGGGCGACG AGATCGTGCT CTCGATCATG GAGCATCACG CCAACATCGT GCCGTGGCAC TTCCTGCGCG AGCGGCAGGG CGTGGTGCTG AAATGGGTCG ATGTGGATCA GAACGGCGAT CTCGACCCGC AGGCGGTGAT CGATGCGATC GGCCCGAAGA CGAAGCTCGT CGCGGTCACT CACATGTCGA ACGTGCTCGG CACCGTGGTG GATGTGGCCG CGATCTGCGC CGGGGCGCGC GACAGGGGCG TGCCGGTGCT GGTCGACGGG TCGCAGGCGG CCGTGCACAT GCCGGTCGAT GTGGCGGCCA TCGGCTGCGA CTTCTACGCC ATCACCGGCC ACAAGCTCTA CGGCCCCTCG GGCTCGGGCG CGATCTGGAT CCGGTCGGAG CGGATGGAGG AGATGCGTCC CTTCATCGGC GGCGGCGACA TGATCCACGA GGTGACGCGC GATACCGTCA CCTATGCCAG GCCCCCGATG CGGTTCGAGG CGGGCACGCC CGGCATCGTG CAGCAGATCG GCCTCGGCGT GGCGCTGCAC TACATGATGA ATGTGGGAAT GGCCGAGATC GCCGCGCATG AACGCACGCT GCGCGACTAT GCGCGCGATA GGCTCGCGGG CCTCAACTGG CTCGACGTGC AGGGCAATTC GGCGGGGAAG GGGGCGATCT TCTCCTTCAC GATCCGCGGC GGGGCCCATG CGCACGACAT CTCGACCGTG CTCGACCGCA AGGGCGTCGC GGTGCGCGCC GGCACCCACT GCGCCATGCC GCTCATGCAG CATATGGGGG TCGGCGCCAC CTGCCGCGCC TCCTTCGCCA TGTACAACAC CCCCGACGAG GTGGACCGTC TGGTCGAGGC GCTGGAGCTC TGCCACGAGC TCTTCGGCTG A
|
Protein sequence | MFDVASIRRD FPILSREVNG KPLVYLDNGA SAQKPQVVIE AMNLAYSHEY ANVHRGLHYL SNLATDKYEA VRATIATFLN APSPEEIVFT TGTTEGINLV SYGWAAPRLQ PGDEIVLSIM EHHANIVPWH FLRERQGVVL KWVDVDQNGD LDPQAVIDAI GPKTKLVAVT HMSNVLGTVV DVAAICAGAR DRGVPVLVDG SQAAVHMPVD VAAIGCDFYA ITGHKLYGPS GSGAIWIRSE RMEEMRPFIG GGDMIHEVTR DTVTYARPPM RFEAGTPGIV QQIGLGVALH YMMNVGMAEI AAHERTLRDY ARDRLAGLNW LDVQGNSAGK GAIFSFTIRG GAHAHDISTV LDRKGVAVRA GTHCAMPLMQ HMGVGATCRA SFAMYNTPDE VDRLVEALEL CHELFG
|
| |