Gene Rsph17025_0754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_0754 
Symbol 
ID5083518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp764290 
End bp766116 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content68% 
IMG OID640482312 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_001166965 
Protein GI146276806 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.633954 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATCC CGACACCTAC AAGTGACCTC GCGCCCGCAG CGGCGGGCTT GCCGGACGTG 
GGGCTGCTGA CGGCACTGGC CAACCAGTTC TTCGCCGCCC TGCCGGGGAC CACGGCACCG
CTTTCGGCCG GCGGACCCTT TGCCCATCCG ATGCCGAACG CCCCCCTGCC CCCAGGCCTT
GCGCCCGGGC CGCTGGCGAA CGGGCCCATG CTGCCGGGGA CCCTTGCCCC CGGCGCGAAT
ATCGTGCCGT CAACGCCCGA ACAGACGCTC GCCGGTATCG CCTCGGCACC CGCCCTTATC
CCCCATTCGG CGGCGGTGAA CGGGCTGCCG GATCATGCGG CCATCGTGCC GCCTGCCGTG
AATGGCCGGT TCGGGGGGCA TTCGTTGGCT GTGCCGCAGG CGGTGCGGCC GCCCGCCCCC
GCGCAACGCC CGCCGCGCGA GGATCGGGTT ACCGCACGCA GCCACGGTCT TCCTAGCGAG
GACGACCTGC GTGCGCTGCT GGCAGGCGGC CCTGTAGCCC CTGCCCTGCC GACGGCACAC
GCTCCGACCG AGCCCACGTT CTACTTCGTT CCAGCACGGC CTGCGGTGCC GCAAGTGCCC
GCCTCAGGTC ATCCGGCCTT CGACGTTCAT GCGGTGCGGC GCGACTTCCC GATCCTGCAA
GAGCGCGTGA ACGGCCGGCC GCTGGTCTGG TTCGACAACG CCGCGACCAC GCAAAAGCCG
CTGGAGGTAC TGGATCGGCT GGACCGTTTC TACCGGCGCG AGAACTCGAA CATTCACCGC
GCGGCGCATG AACTGGCCGG CCGCGCAACG GACGCCTACG AAAAGGCGCG CGAGACGGTG
CGTGCCTTCC TTGGCGCCTC GTCCGCAGAG GAGATCATCT TCGTCCGCGG CGCGACGGAA
GGGATCAACC TGATCGCCAG AACCTGGGGC GCGAAGAACA TCCATGACGG CGACGAGATC
CTCGTCTCAC AGCTCGAACA CCACGCCAAC ATAGTGCCCT GGCAGCAACT GGCGCAGGAC
AAGGGCGCGC GGCTGAAGGT GATCCCGGTT GATGACAGCG GTCAGGTGAT ACTCGAGGAA
GCCCGGCGGC TGATTTCGAC CCGGACGAAG ATCGTGGCCG TGACGCAGGT GTCGAATGCG
CTTGGCACCG TGGTGCCGGT GGTGGAGATC GTGGACCTCG CCCATCGCGC CGGTGCGGTG
GCACTGGTCG ACGGCGCCCA ATCGGTCAGC CACATGCGGG TGGACGTGCA GGCAATCGGG
GCGGATTTCT TCGTCTTCTC GGGCCACAAG GTGTTCGGCC CCACAGGCAT CGGCGCCGTC
TACGGCCGGA AGGCGCTGCT CGACGACATG CCGCCCTGGC AGGGCGGAGG CAACATGATC
GCCGACGTGA CCTTCGAGCG GACGGTGTTC CAGCCGCCAC CGCACCGGTT CGAGGCCGGC
ACCGGAAACA TCGCGGACGC GGTGGGCCTT GGTGCCGCGC TGGACTATGT CGGCCGGATC
GGGATCGAGA CGATCGCACG TTACGAACAC GACCTTCTGG CCTACGCCAC CCACCAGCTT
GCGCCGATCA AGGGCGTGCG GCTGATCGGA ACCGCGCGCG ACAAGGCGTC GGTGTTGTCC
TTTGTGCTGG AAGGGATGAA GCCAGAGGAG GTCGGCCGCG CGCTGAACGC GGAAGGCATC
GCCGTCAGGT CGGGGCATCA TTGCGCCCAG CCGATCCTGC GCCGCTTCGG GCTGGAGGCG
ACGGTGCGGC CGTCGCTGGC CTTCTACAAT ACCTGCGAAG AGGTGGATCG CTTCATCACG
GTCGTCAGGC GTCTGGCGCG GGCCTGA
 
Protein sequence
MTIPTPTSDL APAAAGLPDV GLLTALANQF FAALPGTTAP LSAGGPFAHP MPNAPLPPGL 
APGPLANGPM LPGTLAPGAN IVPSTPEQTL AGIASAPALI PHSAAVNGLP DHAAIVPPAV
NGRFGGHSLA VPQAVRPPAP AQRPPREDRV TARSHGLPSE DDLRALLAGG PVAPALPTAH
APTEPTFYFV PARPAVPQVP ASGHPAFDVH AVRRDFPILQ ERVNGRPLVW FDNAATTQKP
LEVLDRLDRF YRRENSNIHR AAHELAGRAT DAYEKARETV RAFLGASSAE EIIFVRGATE
GINLIARTWG AKNIHDGDEI LVSQLEHHAN IVPWQQLAQD KGARLKVIPV DDSGQVILEE
ARRLISTRTK IVAVTQVSNA LGTVVPVVEI VDLAHRAGAV ALVDGAQSVS HMRVDVQAIG
ADFFVFSGHK VFGPTGIGAV YGRKALLDDM PPWQGGGNMI ADVTFERTVF QPPPHRFEAG
TGNIADAVGL GAALDYVGRI GIETIARYEH DLLAYATHQL APIKGVRLIG TARDKASVLS
FVLEGMKPEE VGRALNAEGI AVRSGHHCAQ PILRRFGLEA TVRPSLAFYN TCEEVDRFIT
VVRRLARA