Gene RPD_3227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3227 
Symbol 
ID4023734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3581450 
End bp3583438 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content70% 
IMG OID637963429 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_570353 
Protein GI91977694 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.545837 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCGC CTGAGTCCCC GGCGCCGCAT CCCGGCTCGG CGGTGGCGGG CTCGCAGTTC 
GGCCTGCCGG ACGCCGGGAC GCTCTCGCCT CCCGCGACTG CCGCTCCGCT CGCGCCGGAT
GTCGCCGTGA TCGCGCAACT CGCCAATGCG TTCTTTGCGG CTTTGCCGAA CGGCCCGGCG
CCGGAGCCGG GCGCGGCGCT CGGCTCGGCG CCGCTGTTCG TGGCCGAACC GTTGCAAAAC
GCCATTCCGG GCACGCCGGC GCTGACGCCG TCCTACAAGC CGAACCACAA TCCCGGCGCG
GCCTCGCCGA TTCCCACGGT CGGCGCGGCG CGCCCCTCTG CGCCGATCTT CGCGCTCGAC
CCCGGGTTGA CGCCGGAGCC CGCTGCGGCC CGCGCGCCTG CTCAGCCGCA GGGTCTGACC
GTGCCGCTGG TGACGACTCC GGTCGCGCCG CCGGCGCAGC CATCGCCGCC GTCGGCGCCG
TCATTTACGC GTGACACCGA CCTTGCGGCC TTGCCCGGCC GGCTCGATGA CACGCGCAGC
CTGGCGCCGC GCTACGACGC GCCAAACGCC GGACCGTTCG GCGGCAGCTT CGCCGGCGCG
TCGGAGCCGC TGTATTTTCT TGCCGGCAAT CCGGCGCTGG CGCGCGCGAC GCCGGCCGTG
GCGCCGCCAT CGCCGCCGCG CGTGGAGACC CTCGATCTCG GCGCGTTGGC TGCGCAGCAT
CGCACCGACG TCGCCGTCGG CGATCTGCCC GGCCTGCGCG CGTTCGACGC TAATCTGTTT
CGTCGCGATT TCCCGATCCT GCGGGAAACC GTCAACGGCC GGCCGCTGAT CTGGCTCGAC
AACGGCGCGA CGACGCAGAA GCCGCAATGC GTGATCGATC GCCTTGCGTA TTTCTACGCC
CACGAAAATT CCAACATCCA TCGCGCTGCG CATACGCTCG CGGCGCGTTC CACCGACGCC
TATGAGGCGG CGCGCGACAA GGTCCGCGCC TTCATCAACG CGCCGCAGGT CGCCGACATC
GTGTTCGTGC GCGGCGCCAC CGAGGCGATC AATCTGGTTG CTCAGGCCTG GGGCCGGCGC
AATGTCAGTG AGGGCGACGA GATCGTCGTC AGCCATCTCG AGCACCACGC CAATATCGTG
CCGTGGCAGC AGCTCGCGGC CGAGAAGGGC GCGCGGCTGC GCGTCGCCCC GGTCGACGAT
CACGGCCAGA TCATCCTTGA AGAGTACGAG AAGCTTCTCA ATCCGCGCAC CCGCATCGTC
GCCTTCACGC AAGTATCGAA TGCGCTCGGC ACGGTGACGC CGGTCGCCGA GATGACGGCG
CTGGCGCATC GCCACGGCGC CAAGGTGCTG GTCGACGGCG CCCAGGGCGT CTGTCATATG
CCGGTCGACG TGCAGGCGCT GGACGTCGAT TTCTACGCCT TCTCCGGCCA CAAGATGTTC
GCGCCGACCG GCATCGGCGT GCTGTACGGC AAGGCCGATG TGCTGGAAGC GATGCCGCCG
TGGCAGGGCG GCGGCAATAT GATCGCCGAC GTCACCTTCG AGAAGACGGT GTTTCAAGGG
GCCCCGGACC GGTTCGAGGC CGGCACCGGC AACATCGCCG ACGCCGTCGG CCTCGGCGCC
GCGATCGACT ATCTCAGCCG CATCGGCATG GCGAACATCG CCGCGCATGA GCACGAGCTG
CTGGCCTACG GCACCCAGGC GCTGCTCGCC GTGCCGGGCC TGAAGCTGAT CGGCACCGCG
CGCGAGAAAG CCGGCATCCT GTCATTCGTG CTCGATGGCT GCCGCAGTGA AGATGTCGGC
CGTGCGCTCG ATCGCGAAGG CATCGCGGTG CGGGCCGGGC ATCATTGCGC CCAGCCGATC
CTGCGCCGGT TCGGCCTCGA GAGCACGGTG CGGCCTTCGC TCGCGCTCTA CAACACCACC
GCGGACATCG ATGCTCTGGT CGATGCGCTG AAGCGTCTGC AGAGCGGTCG GGGCGTTCAC
TGGAGCTGA
 
Protein sequence
MSAPESPAPH PGSAVAGSQF GLPDAGTLSP PATAAPLAPD VAVIAQLANA FFAALPNGPA 
PEPGAALGSA PLFVAEPLQN AIPGTPALTP SYKPNHNPGA ASPIPTVGAA RPSAPIFALD
PGLTPEPAAA RAPAQPQGLT VPLVTTPVAP PAQPSPPSAP SFTRDTDLAA LPGRLDDTRS
LAPRYDAPNA GPFGGSFAGA SEPLYFLAGN PALARATPAV APPSPPRVET LDLGALAAQH
RTDVAVGDLP GLRAFDANLF RRDFPILRET VNGRPLIWLD NGATTQKPQC VIDRLAYFYA
HENSNIHRAA HTLAARSTDA YEAARDKVRA FINAPQVADI VFVRGATEAI NLVAQAWGRR
NVSEGDEIVV SHLEHHANIV PWQQLAAEKG ARLRVAPVDD HGQIILEEYE KLLNPRTRIV
AFTQVSNALG TVTPVAEMTA LAHRHGAKVL VDGAQGVCHM PVDVQALDVD FYAFSGHKMF
APTGIGVLYG KADVLEAMPP WQGGGNMIAD VTFEKTVFQG APDRFEAGTG NIADAVGLGA
AIDYLSRIGM ANIAAHEHEL LAYGTQALLA VPGLKLIGTA REKAGILSFV LDGCRSEDVG
RALDREGIAV RAGHHCAQPI LRRFGLESTV RPSLALYNTT ADIDALVDAL KRLQSGRGVH
WS