Gene RPC_3637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3637 
Symbol 
ID3970652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4044565 
End bp4046601 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content71% 
IMG OID637926745 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_533491 
Protein GI90425121 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.51832 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.140258 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAGC CTCACGCGGC CAGCGGCGCC GCGTTGCCGC CAGACCCCGC CGGCACCGCC 
TGGCCCGATC CCACGGTGAT CGCGCGGCTC GCCAATGCGC TGTTCCAGGC GCCGCCGAAC
CAGGCGCCGC CGAGTTCGAC GGGCGCGCCG CTCGGCGCGC AGAGCATTCC GCTGGCGCCG
CAGACCCCGA TCACCACGCC GGCGCCAACG GTGACCACGA GCGCCGCGCC ATGGGTGCCG
GGCAATCCCT CGGCGGGTCC GCCCGACCTG CCGCCGACCA CGATCCCCTC GGTGGTGCCG
ACGCCGAGCG TCACCGCGCC GCAGCCGCCG TCGCAGTCCT CGCTGCCGCC GGGCCTTGAG
GTGCCGCAGC CCGGCGCGTC GCCGGGCGCG ATCGGGCCGC AGCCGATCGA TCTGCAAGCC
GTCGATCTGC AGGGCGCCGC GCCATTCAAC CTCGGCGATG CGGTGTACTT CGCCCCGCCC
GACCGTCGCG TCGCCGCCGA CGCGCCGAGC GGCGGCGCCG CGGCCGGTGC CGAGCCCGCC
GCGTATTTCC TGTCGGAAGC GCCGTTCGCC TCGCATCACG CGCCGCAGCC GTCTGCCTCG
CAATCGACCA CGCCGCCCGG CATCGCTCCG ACCGGCAGCG GCGATCCGAC TCATCTCGAC
GCGGTGCCGA CGCAGAGCTA CCTCAGCGCC GACGCGGCGT CGCAGCCGCC GCGCGCCGAT
CTCGGCAAGC CAAGTTCGGA CTTCGCGGTG ATGACGCCGA ATCTGCGCGC AGTGTTGACG
CCGGCATTCG GCGGCGGCGC GCATCCGTTC GATCCGCACG CCATCAAACG CGACTTCCCG
ATCCTGCAAA CGCGGGTGCA TGGCAAGCGG CTGGTCTGGC TCGACAACGC CGCCACCACG
CAGAAGCCGC AGGCGGTGAT CGACCGCCTG GCGCACTTCT ACAGCCACGA GAATTCCAAC
ATCCACCGCG CCGCGCACGA GCTCGCGGCG CGCTCCACCG ACGCCTATGA GGCGGCGCGC
GAAAAAGTCC GCCGCTTTCT CGGCGCGCCC TCGCCGCGCG ACATCATCTT CGTGCGCGGC
GCTACCGAGG GCATCAACCT GGTGGCGCAG GCCTGGGGCC GCCGCAACAT CGGCGAAGGC
GACGAGATCG TGGTGTCGTG GCTCGAGCAC CACGCCAACA TCGTGCCCTG GCAGCAGCTC
TGCGCCGAGA AAGGCGCGCG GCTGCGCGTC GCGCCGGTCG ACGACCACGG CCAGATCATC
CTTGAAGAGT ATGAGAAGCT GCTCGGGCCG AACACCAAGC TGGTGTCGAT CACCCAGGTC
TCCAACGCGC TCGGCACCGT CGTCCCGGTC ACCGAGATCA CCGCGATCGC GCATCGCCAC
GGCGCTTGCG TGCTGATCGA CGGCGCGCAA TCGGTGTCGC ACATGCCAGT CGACGTGCAG
GCGATCGGCT GCGACTTCTT CATATTCTCC GGCCACAAGG TGTTCGGGCC GACCGGGATC
GGCGCGGTCT ATGGCAAGGA TTCCGTGCTC GCCCACATGC CGCCGTGGCA GGGCGGCGGC
AACATGATCG CCGACGTCAG CTTCGAGAAG ACCATCTATC AGGGACCGCC CGACCGCTTC
GAGGCCGGCA CCGGCAACAT CGCCGACGCG GTCGGCCTCG GCGCCGCGAT CGACTACGTC
GAGGCGATCG GCATGGCGGC GATCGAACGC TACGAGCACG AGTTGCACGG CTACGCCACC
GAACGGATGC AGGGCGTCCC CGGGCTGAAG ATGATCGGCA CCGCCAAGGA CAAGGCCAGC
GTGCTGTCGT TCGTGCTCGA CGGCCACAAC CCGGTCGACG TCGGCAAGGC GCTCGACCAG
GACGGCATCG CGGTGCGCGC CGGTCATCAC TGCGCGCAGC CGATCCTGCG GCGGTTCGGG
CTGGAAGCCA CGGTGCGACC GTCGCTGGCG TTCTACAACA CCTGCGAGGA CGTCGACGCG
TTGGTGGCGG CGTTGCAGCG GCTGCAGAGC GGCGCCCCGC GCGGGCGGGT GGTGTAG
 
Protein sequence
MSEPHAASGA ALPPDPAGTA WPDPTVIARL ANALFQAPPN QAPPSSTGAP LGAQSIPLAP 
QTPITTPAPT VTTSAAPWVP GNPSAGPPDL PPTTIPSVVP TPSVTAPQPP SQSSLPPGLE
VPQPGASPGA IGPQPIDLQA VDLQGAAPFN LGDAVYFAPP DRRVAADAPS GGAAAGAEPA
AYFLSEAPFA SHHAPQPSAS QSTTPPGIAP TGSGDPTHLD AVPTQSYLSA DAASQPPRAD
LGKPSSDFAV MTPNLRAVLT PAFGGGAHPF DPHAIKRDFP ILQTRVHGKR LVWLDNAATT
QKPQAVIDRL AHFYSHENSN IHRAAHELAA RSTDAYEAAR EKVRRFLGAP SPRDIIFVRG
ATEGINLVAQ AWGRRNIGEG DEIVVSWLEH HANIVPWQQL CAEKGARLRV APVDDHGQII
LEEYEKLLGP NTKLVSITQV SNALGTVVPV TEITAIAHRH GACVLIDGAQ SVSHMPVDVQ
AIGCDFFIFS GHKVFGPTGI GAVYGKDSVL AHMPPWQGGG NMIADVSFEK TIYQGPPDRF
EAGTGNIADA VGLGAAIDYV EAIGMAAIER YEHELHGYAT ERMQGVPGLK MIGTAKDKAS
VLSFVLDGHN PVDVGKALDQ DGIAVRAGHH CAQPILRRFG LEATVRPSLA FYNTCEDVDA
LVAALQRLQS GAPRGRVV