Gene RPB_2990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2990 
Symbol 
ID3910789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3403472 
End bp3404758 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content63% 
IMG OID637884896 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_486603 
Protein GI86750107 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.398941 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.104458 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCACT GCTGCGATCG CTCTCGAGTT GGAAGGCCTG ACATGGCGCA TCCCGCGGTT 
TCGAATGGGT CCTACGACGT CGCCAAAGTC CGCGAGGATT TTCCGGCGCT GGCGCTGAAG
GTTTACGGCA AGGATCTGGT GTATCTCGAC AACGCCGCCT CGGCGCAGAA GCCGCGCGCC
GTGCTGGAGC GGATGACCAA GGCGTATGAG AGCGAATACG CCAATGTGCA TCGCGGGCTG
CATTATCTCG CCAACGCGGC GACCGAAGCC TATGAGGGCG GTCGCACCCG CGTGCAGCAT
TTCCTCAACG CCAAGCGGCC GGAAGAGATC ATCTTCACCC GCAACGCCAC CGAGGCGATC
AATCTGGTGG CGTCGTCGTT CGGCGCGCCG AATATCGGCG AGGGCGACGA GATCGTGCTC
TCGATCATGG AGCACCATTC CAACATCGTG CCGTGGCACT TCTTGCGCGA ACGTCAGGGT
GCTGTTCTCA AATGGGCGCC GGTCGACGAC GACGGCAATT TCCTGATCGA CGAATTCGAG
AAGCTGCTGT CGCCGAAGAC CAAGCTGGTC GCGATCACGC AGATGTCGAA CGCGCTCGGC
ACCATCGTGC CGGTGAAAGA GGTGGTGAAG CTGGCGCACG ACCGCGGCAT TCCGGTGCTG
GTCGACGGCA GCCAGGGCGC GGTGCATCTC ACCATCGACG TCCAGGACAT CGACTGCGAT
TTCTACATCA TGACCGGCCA CAAGCTGTAC GGCCCGACCG GGATCGGCGT GCTGTACGGC
AAATACGACG TCCTCGCCAA GATGCGGCCG TTCAACGGCG GCGGCGAGAT GATTCGCGAA
GTGGCGCAGG ACTGGGTGAC CTACGGCGAC CCGCCGCACC GGTTCGAGGC CGGCACCCCG
GCGATCGTCG AGGCGGTCGG GCTCGGGGCG GCGATCGACT ACGTCAATTC GATCGGCAAG
GAGCGCATCG CCGCGCACGA ACACGATCTT TTGACGTATG CGGAACAGCG ATTGCGCGAG
ATCAATTCGC TGCGCATCAT CGGCACCGCC AAGGGCAAGG GGCCGGTGAT TTCCTTCGAG
ATGAAGGGCG CGCACCCGCA CGACATCGCC ACCGTGATCG ACCGCCAGGG CATCGCGGTG
CGGGCGGGAA CCCATTGCGT GATGCCGTTG CTGGAGCGGT TCCAGGTCAC GGCGACGTGC
CGGGCGTCGT TCGGCATGTA TAATACCCGT GAGGAAGTCG ACCAACTCGC TAATGCGCTG
ATCAAGGCGC GGGACCTGTT CGCATGA
 
Protein sequence
MRHCCDRSRV GRPDMAHPAV SNGSYDVAKV REDFPALALK VYGKDLVYLD NAASAQKPRA 
VLERMTKAYE SEYANVHRGL HYLANAATEA YEGGRTRVQH FLNAKRPEEI IFTRNATEAI
NLVASSFGAP NIGEGDEIVL SIMEHHSNIV PWHFLRERQG AVLKWAPVDD DGNFLIDEFE
KLLSPKTKLV AITQMSNALG TIVPVKEVVK LAHDRGIPVL VDGSQGAVHL TIDVQDIDCD
FYIMTGHKLY GPTGIGVLYG KYDVLAKMRP FNGGGEMIRE VAQDWVTYGD PPHRFEAGTP
AIVEAVGLGA AIDYVNSIGK ERIAAHEHDL LTYAEQRLRE INSLRIIGTA KGKGPVISFE
MKGAHPHDIA TVIDRQGIAV RAGTHCVMPL LERFQVTATC RASFGMYNTR EEVDQLANAL
IKARDLFA