Gene Rsph17029_2088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2088 
Symbol 
ID4896117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2212090 
End bp2213310 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content67% 
IMG OID640112682 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_001043963 
Protein GI126462849 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.519115 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGACG TCGCCTCCAT CCGCCGGGAC TTTCCGATCC TTTCGCGCGA AGTGAACGGC 
AAGCCGCTGG TCTATCTCGA CAATGGCGCC TCGGCGCAGA AGCCGCAGGT GGTGATCGAG
GCGATGAACC TCGCCTACAG CCACGAATAT GCGAATGTCC ACCGCGGGCT GCACTATCTG
TCGAACCTCG CGACCGACAA GTACGAGGCG GTGCGGGCGA CCATCGCGAC CTTCCTCAAC
GCGCCCTCGC CCGAAGAGAT CGTCTTCACC ACCGGCACCA CCGAGGGGAT CAACCTCGTC
TCCTACGGCT GGGCCGCGCC GCGGCTTCAG CCGGGCGACG AGATCGTGCT CTCGATCATG
GAGCATCACG CCAACATCGT GCCGTGGCAC TTCCTGCGCG AGCGGCAGGG CGTGGTGCTG
AAATGGGTCG ATGTGGATCA GAACGGCGAT CTCGACCCGC AGGCGGTGAT CGATGCGATC
GGCCCGAAGA CGAAGCTCGT CGCGGTCACT CACATGTCGA ACGTGCTCGG CACCGTGGTG
GATGTGGCCG CGATCTGCGC CGGGGCGCGC GACAGGGGCG TGCCGGTGCT GGTCGACGGG
TCGCAGGCGG CCGTGCACAT GCCGGTCGAT GTGGCGGCCA TCGGCTGCGA CTTCTACGCC
ATCACCGGCC ACAAGCTCTA CGGCCCCTCG GGCTCGGGCG CGATCTGGAT CCGGTCGGAG
CGGATGGAGG AGATGCGTCC CTTCATCGGC GGCGGCGACA TGATCCACGA GGTGACGCGC
GATACCGTCA CCTATGCCAG GCCCCCGATG CGGTTCGAGG CGGGCACGCC CGGCATCGTG
CAGCAGATCG GCCTCGGCGT GGCGCTGCAC TACATGATGA ATGTGGGAAT GGCCGAGATC
GCCGCGCATG AACGCACGCT GCGCGACTAT GCGCGCGATA GGCTCGCGGG CCTCAACTGG
CTCGACGTGC AGGGCAATTC GGCGGGGAAG GGGGCGATCT TCTCCTTCAC GATCCGCGGC
GGGGCCCATG CGCACGACAT CTCGACCGTG CTCGACCGCA AGGGCGTCGC GGTGCGCGCC
GGCACCCACT GCGCCATGCC GCTCATGCAG CATATGGGGG TCGGCGCCAC CTGCCGCGCC
TCCTTCGCCA TGTACAACAC CCCCGACGAG GTGGACCGTC TGGTCGAGGC GCTGGAGCTC
TGCCACGAGC TCTTCGGCTG A
 
Protein sequence
MFDVASIRRD FPILSREVNG KPLVYLDNGA SAQKPQVVIE AMNLAYSHEY ANVHRGLHYL 
SNLATDKYEA VRATIATFLN APSPEEIVFT TGTTEGINLV SYGWAAPRLQ PGDEIVLSIM
EHHANIVPWH FLRERQGVVL KWVDVDQNGD LDPQAVIDAI GPKTKLVAVT HMSNVLGTVV
DVAAICAGAR DRGVPVLVDG SQAAVHMPVD VAAIGCDFYA ITGHKLYGPS GSGAIWIRSE
RMEEMRPFIG GGDMIHEVTR DTVTYARPPM RFEAGTPGIV QQIGLGVALH YMMNVGMAEI
AAHERTLRDY ARDRLAGLNW LDVQGNSAGK GAIFSFTIRG GAHAHDISTV LDRKGVAVRA
GTHCAMPLMQ HMGVGATCRA SFAMYNTPDE VDRLVEALEL CHELFG