Gene Rsph17025_2356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2356 
Symbol 
ID5083849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2402181 
End bp2403401 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content67% 
IMG OID640483919 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_001168550 
Protein GI146278391 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGACG TCGATGCCAT CCGGCAGGAC TTTCCGATCC TCGCGCGCGA GGTGAACGGC 
AAGCCGCTCG TCTATCTCGA CAACGGCGCC TCGGCCCAGA AGCCGGCCTC GGTGATCGAG
GCGATGAACC TCGCCTACAG CCACGAATAT GCCAACGTCC ACCGCGGGCT GCACTACCTG
TCGAACCTCG CGACCGACAA GTACGAGGCC GTGCGCGCGA CCATCGCCAC CTTCCTCAAC
GCGCCTTCGC CCGAGGAGAT CGTCTTCACC ACCGGCACGA CCGAGGGGAT CAACCTTGTC
TCCTACGGCT GGGCGGCGCC CCGGCTTCAG CCGGGCGACG AGATCGTGCT CTCGATCATG
GAGCATCACG CCAACATCGT GCCCTGGCAC TTCCTGCGCG AGCGGCAGGG CGTGGTGCTG
AAATGGGTCG ATGTGGATCA GAACGGCGAC CTCGATCCGC AGGCGGTGCT GGATGCCATC
GGGCCCAGGA CGAAGCTCGT GGCGGTCACG CACATGTCGA ACGTGCTGGG CACGGTGGTC
GATGTGGCCG CGATCTGCGC GGGTGCGCGC GCCAGGGGCG TGCCGGTGCT TGTGGACGGC
TCGCAGGCGG CGGTGCACAT GCCGGTCGAT GTCAGCGCGA TCGGCTGCGA CTTCTACGCC
ATCACCGGCC ACAAGCTCTA CGGTCCCTCG GGTTCGGGCG CGATCTGGAT CCGCTCCGAG
CGGATGGAGG AGATGCGCCC CTTCCTCGGC GGCGGCGACA TGATCCATGA GGTGACGCGG
GATACCGTCA CCTACGCCAA GCCGCCCATG AGGTTCGAGG CGGGCACCCC CGGCATCGTC
CAGCAGATCG GCCTCGGCGT GGCGCTGCAT TACATGATGA ACGTGGGCAT GGCCGAGATT
GCCGCGCACG AGCGCATCCT GCGCGATCAC GCGCGGGACC GGCTGGCCGC CCTCAACTGG
CTGGATGTGC AGGGCAACTC GGCGGGGAAG GGAGCGATCT TCTCCTTCAC CATCCGGGGG
GGCGCGCATG CGCACGACAT CTCGACCGTT CTCGACCGCA AGGGCGTGGC GGTGCGGGCG
GGCACCCATT GCGCCATGCC GCTGATGCAG CATTACGGGC TCGGCGCCAC CTGCCGCGCC
TCCTTCGCCA TGTACAACAC CACTGACGAG GTGGACCGGC TGATCGAGGC GCTTGAACTC
TGCCACGACC TGTTCGGCTG A
 
Protein sequence
MFDVDAIRQD FPILAREVNG KPLVYLDNGA SAQKPASVIE AMNLAYSHEY ANVHRGLHYL 
SNLATDKYEA VRATIATFLN APSPEEIVFT TGTTEGINLV SYGWAAPRLQ PGDEIVLSIM
EHHANIVPWH FLRERQGVVL KWVDVDQNGD LDPQAVLDAI GPRTKLVAVT HMSNVLGTVV
DVAAICAGAR ARGVPVLVDG SQAAVHMPVD VSAIGCDFYA ITGHKLYGPS GSGAIWIRSE
RMEEMRPFLG GGDMIHEVTR DTVTYAKPPM RFEAGTPGIV QQIGLGVALH YMMNVGMAEI
AAHERILRDH ARDRLAALNW LDVQGNSAGK GAIFSFTIRG GAHAHDISTV LDRKGVAVRA
GTHCAMPLMQ HYGLGATCRA SFAMYNTTDE VDRLIEALEL CHDLFG