Gene RPB_1401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1401 
Symbol 
ID3908351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1592161 
End bp1593813 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content64% 
IMG OID637883295 
Productsulfite reductase, hemoprotein subunit 
Protein accessionYP_485022 
Protein GI86748526 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0155] Sulfite reductase, beta subunit (hemoprotein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0403368 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGCTT ATGACGAGAT CGACCAGACG CTGGTCAATG AACGGGTCAC CGAATTTCGC 
GACCAGGTGC GGCGCCGCCT CAGCGGTGAA CTGACCGAGG ACGAGTTCAA GCCGTTGCGG
CTGATGAACG GCGTCTATCT GCAGCTCCAC GCCTATATGT TCCGCGTCGC GATCCCTTAC
GGCACGCTGT CGTCGGCGCA GCTGCGCAAG CTCGCACACG TCGCCCGCAA ATACGATCGC
GGCTACGGCC ATTTCACCAC CCGGCAGAAC ATCCAGTTCA ATTGGATCGC GCTGAAGGAT
CTGCCCGACG CGCTCGCCGA TCTCGCCGAG GTCGGCATCC ACGCGATGCA GACCTCCGGC
AACTGCACGC GCAACGTCAC TGCCGATCAA TGGGCCGGTG TCGCGCCGGG CGAGGTCGAG
GATCCCCGCG TCTGGGCCGA AATCCTGCGC CAGCACACCG CGCTGCATCC GGAATTCTCG
TTTCTGCCGC GCAAGTTCAA ATTCGCCATC ACCGCAGCCG ATCACGACCG CGCGGCCATC
AAGGTGCACG ATATCGGCCT GAAGCTGATC AGGAACGAGC AGGGCGAGAC CGGTTTCGAG
GTGCTGGTCG GCGGCGGGCT CGGCCGTTCG CCGTTCATCG CCAAGACCAT CAAGCCGTTC
GTCGCCGGCC GCGACATCCT CAGCTATGTC GAGGCGATCC TGCGCGTTTA CAATCAGTAC
GGCCGCCGCG ACAACATCTA CAAGGCGCGC ATCAAGATCC TGGTCCACGA ACTCGGCATC
GAGAAATTCG CCGCCGAGGT CGAGCAGACG TGGCGCCAGA TCGCCGAAGG TCCGCTGACG
CTCGACGACG AGATGATCGA GGACATCCGC GCGCGCTTCG TCTATCCGGC CTATGAGAGG
TTGTCGGACG ACCCGGCCGA GTTGCGGGCC GCCGCCGATC CGAAATTCGA GGCGTGGCGC
AGCAATTCGG TCGCGCCGCA TCGCCAGCCC GGCTACGCCA TCGTCACGCT GTCGCTGAAG
CCGGTCGGTG GCCCGCCCGG CGACGCCACC GCCGATCAGA TGGACGCGAT TGCCGATCTC
GCCGACAGAT ACTCATTCGG CGAAATCCGC GTCGGCCACG AGCAGAATCT GGCGCTGCCG
CATGTCGCCC AGCGCGATCT GCCGGCGCTG TGGAGCGCGC TCGACGCGCT CGGCGTTGCC
ACGCCGAACG TCAATCTGGT CAGCGACATC ATCGCCTGCC CGGGGCTCGA CTATTGCTCG
CTCGCCAATG CCCGCTCGAT CCCGATCGCA CAGGAGCTGA CGCGGCGCTT TTCCAATCAC
GAGACCGCCA AGCTGATCGG CCGGCTGCAC GTCAACATCT CCGGCTGCAT CAACGCCTGC
GGCCATCACC ATGTCGGCCA TATCGGGATT CTCGGCGTCG AGAAGAACGA CCAGGAGGTC
TATCAGATCA CCATCGGCGG CCGCGCCGAC GAGCACGCCA GGCTGGGCGA ATTGATCGGG
CCCGCGGTGC CCTATGCCGA GGTCCCCGAC GTCATCGAAG ACATCGTCGA GGCCTATCTG
GCGTTGCGCG ACAAGCCGGA GGAGCTGTTC GTCGACACCG TCAAGCGGCT CGGCGTGCAG
CCTTTCAAGG AGCGGGTCTA TGCCACTCGT TAA
 
Protein sequence
MYAYDEIDQT LVNERVTEFR DQVRRRLSGE LTEDEFKPLR LMNGVYLQLH AYMFRVAIPY 
GTLSSAQLRK LAHVARKYDR GYGHFTTRQN IQFNWIALKD LPDALADLAE VGIHAMQTSG
NCTRNVTADQ WAGVAPGEVE DPRVWAEILR QHTALHPEFS FLPRKFKFAI TAADHDRAAI
KVHDIGLKLI RNEQGETGFE VLVGGGLGRS PFIAKTIKPF VAGRDILSYV EAILRVYNQY
GRRDNIYKAR IKILVHELGI EKFAAEVEQT WRQIAEGPLT LDDEMIEDIR ARFVYPAYER
LSDDPAELRA AADPKFEAWR SNSVAPHRQP GYAIVTLSLK PVGGPPGDAT ADQMDAIADL
ADRYSFGEIR VGHEQNLALP HVAQRDLPAL WSALDALGVA TPNVNLVSDI IACPGLDYCS
LANARSIPIA QELTRRFSNH ETAKLIGRLH VNISGCINAC GHHHVGHIGI LGVEKNDQEV
YQITIGGRAD EHARLGELIG PAVPYAEVPD VIEDIVEAYL ALRDKPEELF VDTVKRLGVQ
PFKERVYATR