Gene RPB_1753 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1753 
Symbol 
ID3909740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2005442 
End bp2007061 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content68% 
IMG OID637883647 
Productsulfite reductase 
Protein accessionYP_485372 
Protein GI86748876 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0369] Sulfite reductase, alpha subunit (flavoprotein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.274813 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0164644 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGA ATATGCCGCC GCCGATCCCC ATGCTGGTCC CGGAGACCGC GCCGTTCTCC 
GACGAGCAGC GCGCCTGGCT GAACGGTTTC TTCGCCGGCC TCGTCTCGCT CGATGACGCG
GGCGTCACCG CGCTATCGAG CGAACAGGCC GCCGCATTGC TGGCCGGCGG CCCGGCGCCC
ACCGCGGAGG ACGACGATGG CGGCGCGCCG TGGCACGACC AGACGCTGCC GATCGGGGAG
CGGATGCAGC TCGCCGACGG CAAGCCGTTA CGCTGGAAGC TGATGGCCGC GATGGCGCAG
CAGGATTGCG GCCAATGCGG CTACGATTGC CGCAACTACT CGGCGGCGAT CTTCGAAGGG
AAAGAGACGC GGCTGAATCT ATGCGCCCCT GGCGGCAAGG ACACCGCCCG CATGGTCAAG
ACGCTGGCCG AGCAGATCGG CAGCGCACCG AAGGCCGACA ACGCGCGATC GCTCGCGACC
GATGCGGCGC CCGCCGTGGC GCTGCCGCCG CGCGGCACCT CGCGCGACAA TCCGGCCACG
GCCAAAGTGC TGTCGCGCCG CAAGCTGAAC AAGGACGGCT CCGAGAAGGA AACCTGGCAC
ATCGAGTTCG ACCTCGAAGA CGGCCTCGCC TACGAGGTCG GCGATTCCTT CGGGCTGTTT
CCGGGCAACG ATCCCAGGCT GGTCGAGCTG GTACTGAAGG CGCTCGGCGC CCCCGCGACG
TTCCCGATCG GCGACCGCAC GCTGCGCGAG GCGCTGATCG ACAGCGTGTC GCTGGCGCCC
GCGCCCGACA TGCTGTTCCA GCTGATCAGC TACATCACCG GTGGCGACAA GCGGAAGAGA
GCCCGCGCGC TCGCCAATGG CGAGGATCCG GACGGCGACG CCGCGACGCT CGACGTGCTG
GCGGCGCTGG AGAAGTTTCC CGGCATCCGC CCCGATCCGG AAGCCTTCGT CGAGGCGCTC
GATCCGCTGC AGCCGCGGCT GTATTCGATC TCGTCGTCGC CGAAGACGAC TCCCGGCCGC
TTGTCGCTGA CGGTGGATTG CGTGCGCTAC ACCATCGGCA AGCGGCAACG GCTCGGCGTC
TGCTCGACCG GCCTCGCCGA ACGCGTGACG CCCGGCGACA CCGTGCGCGT CTATGTGCAG
AAGGCGCACA ATTTCGCGCT GCCGGCCGAT CCGAACCAGC CGATCATCAT GATCGGCCCC
GGCACCGGCG TCGCACCCTT CCGCGCCTTC CTGCACGAGC GGCAGGCGGT GGCCGCGCCC
GGCAAGAACT GGTTGTTCTT CGGCCATCAG CGCTCGGCCT GTGATTTCTT CTACGACGAC
GAACTCAACG CGATGAAGCG CAGCGGTCTC CTCACGCGAC TGTCGTTGGC ATGGTCGCGC
GACAGCGGCG AAAAGATCTA CGTGCAGGAC CGGATGCGCG AGGTCGGCCG CGATCTGTGG
AGCTGGCTCA CCGAAGGCGC GAACATCTAT GTCTGCGGCG ACGCCAAGCG GATGGCCAAG
GACGTCGAGC TAGCGCTGGT CGACATCGTC GCGCAGCACG GCGCGCGCAC GCCGGCGGAG
GCCACCGCCT TCGTCTCCGA GCTGAAGAAG CAGGGCCGCT ACCAGCAGGA CGTGTATTGA
 
Protein sequence
MSQNMPPPIP MLVPETAPFS DEQRAWLNGF FAGLVSLDDA GVTALSSEQA AALLAGGPAP 
TAEDDDGGAP WHDQTLPIGE RMQLADGKPL RWKLMAAMAQ QDCGQCGYDC RNYSAAIFEG
KETRLNLCAP GGKDTARMVK TLAEQIGSAP KADNARSLAT DAAPAVALPP RGTSRDNPAT
AKVLSRRKLN KDGSEKETWH IEFDLEDGLA YEVGDSFGLF PGNDPRLVEL VLKALGAPAT
FPIGDRTLRE ALIDSVSLAP APDMLFQLIS YITGGDKRKR ARALANGEDP DGDAATLDVL
AALEKFPGIR PDPEAFVEAL DPLQPRLYSI SSSPKTTPGR LSLTVDCVRY TIGKRQRLGV
CSTGLAERVT PGDTVRVYVQ KAHNFALPAD PNQPIIMIGP GTGVAPFRAF LHERQAVAAP
GKNWLFFGHQ RSACDFFYDD ELNAMKRSGL LTRLSLAWSR DSGEKIYVQD RMREVGRDLW
SWLTEGANIY VCGDAKRMAK DVELALVDIV AQHGARTPAE ATAFVSELKK QGRYQQDVY