Gene Rpal_4232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4232 
Symbol 
ID6411916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4542288 
End bp4543907 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content67% 
IMG OID642714114 
Productsulfite reductase 
Protein accessionYP_001993203 
Protein GI192292598 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0369] Sulfite reductase, alpha subunit (flavoprotein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGA ACCTGCCGCC GCCGATCCCG CTACTGGTGC CGGAGACCGC ACCGTTCTCC 
GAGGAGCAAC GCGCCTGGCT CAACGGCTTC TTCGCCGGCC TGGTGTCGCT CGATGGGCAA
GGCGTCACCG CGCTGTCGCC CGATCAGGCC GCGGCATTCA TCGCCGGCGG TCCCGCGCCA
GCGGCCGAAG ACGATGACGG CGGCGCACCG TGGCACGACC AGAGCCTGCC GATCGCCGAG
CGAATGCAGC TTGCGGAGGG CAAGCCGCTG CGCTGGAAGC TGATGGCCGC GATGGCACAG
CAGGATTGCG GTCAATGCGG CTACGACTGC CGCAACTACT CGGGCGCGAT CTTCGAAGGC
CAAGAGACGC GGCTGAACCT GTGCGCCCCT GGCGGCAAGG ACACCGCGCG GATGGTGAAG
GCGCTGGCCG AACAGATCGG TTCGGCCCCA ACGCTGGACA ACGCCCCCTC GCTCGCGGCC
GACACTGCTC CAGTCGCAGC GCTGCCCCCG CGCGGCACCT CGCGCGACAA TCCGGCAACC
GCCAAGGTGT TGTCGCGCAA GCGCCTCAAC AAGCCGGGCT CGGAGAAAGA GACCTGGCAT
GTCGAGTTCG AGCTAGAGGA TTGTCTGAGC TATGAGGCCG GCGATTCCTT CGGGCTGTTT
CCGACCAACG ATCCAGTGCT GGTCGACGCG GTGCTGCATG CGCTCGGCGC GCCGGGGGAG
TTTCCGATTG CGCAATCGAC GCTGCGGCAA ACGCTGCTCG ACAGCGTGTC GCTGTCGCCC
GCACCCGACA TGCTGTTCCA ATTGATCAGC TACATCACCG GCGGCGACAA GCGGAAGAAG
GCTCGCGCAC TCGCCAGCGG CGAGGATCCG GACGGCGACG CCGCGACGCT CGACGTGCTC
GCCGCGCTGG AGAAATTCCC CGGTCTCCGT CCCGATCCGG AAGCCTTCAT CGAAGCACTC
GATCCGCTGC AGCCGCGGCT GTACTCGATC TCGTCGTCGC CGAAGACCAC GCCCGGCCGG
CTGTCGCTGA CGGTCGATTG CGTGCGCTAC ACAATCGGCA AGCGGCAGCG GCTCGGCGTC
TGCTCGACCG GGCTCGCCGA GCGGGTCCGG CCGGGCGAGA CGTTGCGCGC CTATGTGCAG
AAGGCGCATC ATTTCGCGCT GCCGTCCGAT CCCAACCAGC CAATCATCAT GATCGGCCCC
GGCACCGGCG TCGCGCCGTT CCGCGCATTC CTGCACGAGC GCCAGGCAAT CCAGGCGCCG
GGCAAGAACT GGCTGTTCTT CGGCCATCAG CGCTCGGCCT CCGATTTCTT CTACGAGGAC
GAACTGAAGG CGATGAAGAA TGCCGGCCAT CTGACACGGC TGACGCTGGC GTGGTCGCGC
GACAGCGGCG AGAAGATCTA CGTCCAGGAC CGGATGCGCG AGGTCGGCCG CGACCTGTGG
AGCTGGCTCA CCGAAGGCGC CAGCCTGTAC GTCTGCGGCG ACGCCAAGCG CATGGCCAAG
GACGTCGAGC GCGCGCTGGT CGACATCGTC GCCCAGCACG GCGCCCGTTC AGCAGCGGAG
GCGACGGCCT TCGTGTCGGA GCTGAAAAAA CAGGGGCGCT ATCAGCAGGA CGTGTATTAG
 
Protein sequence
MSQNLPPPIP LLVPETAPFS EEQRAWLNGF FAGLVSLDGQ GVTALSPDQA AAFIAGGPAP 
AAEDDDGGAP WHDQSLPIAE RMQLAEGKPL RWKLMAAMAQ QDCGQCGYDC RNYSGAIFEG
QETRLNLCAP GGKDTARMVK ALAEQIGSAP TLDNAPSLAA DTAPVAALPP RGTSRDNPAT
AKVLSRKRLN KPGSEKETWH VEFELEDCLS YEAGDSFGLF PTNDPVLVDA VLHALGAPGE
FPIAQSTLRQ TLLDSVSLSP APDMLFQLIS YITGGDKRKK ARALASGEDP DGDAATLDVL
AALEKFPGLR PDPEAFIEAL DPLQPRLYSI SSSPKTTPGR LSLTVDCVRY TIGKRQRLGV
CSTGLAERVR PGETLRAYVQ KAHHFALPSD PNQPIIMIGP GTGVAPFRAF LHERQAIQAP
GKNWLFFGHQ RSASDFFYED ELKAMKNAGH LTRLTLAWSR DSGEKIYVQD RMREVGRDLW
SWLTEGASLY VCGDAKRMAK DVERALVDIV AQHGARSAAE ATAFVSELKK QGRYQQDVY