Gene RPD_4244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4244 
Symbol 
ID4024765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4712135 
End bp4713937 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content69% 
IMG OID637964450 
Productprotein-disulfide reductase 
Protein accessionYP_571362 
Protein GI91978703 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4232] Thiol:disulfide interchange protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.438007 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACC GCATTGTCCG CATTGTTATC CTGCTCGTCG CGATCGCGGC GGGCGCCGAG 
GCAGCGTATG CCGCCAAGGC CGCGCCGTTC CAGCTCAGCG CCCAGCCCGG CGCCAACGGC
GTCGATCTGA CCTGGCGGAT CGCCTCCGGC GATTATCTCT ACCGCGACAA GATCGTCGTC
ACCACCGCCG ATGGCGCCCG CGTTGCGGTC CAGACCCCGA CTGGCGAGCT CAAGGACGAT
CCGAATTTCG GCATGACCGA GATCTATCAC CGCAGCGTGA CCGCGACGAT CCCTGCTGAC
GCGGTGAAGA GTGCGAGCCG CTTGATGGTG ACGTATCAGG GCTGCGCCGA GCGCGGCATC
TGCTATCCGC CGATGACGGC AAGTGTCGAT CTCGGCACCT ATCAGGTCTC AGCCGCGAGT
GGAGAAACGC CGAGCGCCGG CCAGGCCCGG ACATCGGACC TGCCCATTAT TCCGCAGCTC
GCCGAGCCGG CAGCCGAGCC GGCGACGGTC GCCGCATCGG TGCTGCCGTC GATGACGCAG
GGCTGGTTGC CGCTGCTGCT CGCCTTCGCC GGGTTCGGAC TGCTGCTGGC GTTGACACCC
TGCGTGCTGC CGATGATCCC GATCGTCGCC GGCATGCTGA CCCGGTCCGG CCCCGGCATC
TCGCCGGCGC GTGGCTTCGC GCTGGCCGCC ATCTACACCC TGGCGATGGC GTCGGCTTAC
GCCGCGCTGG GCGTCGCAGC GGCGTGGTCG GGGCAGAATC TACAAGGCGC GTTGCAGGCG
CCGCTGGCGC TGGCCGTGAT GGCGTCGATC TATGTTGCGC TGGCGCTGTC GAGCTTCGGC
TTGTTCGAGC TGCAATTTCC GGCCGGGTTC GGCGGCAACC TCGCCGGCCG GCTGAACGGC
CGCGCCGGAC CGTTGCTCGG CGCAGCGGCG CTCGGCTTCA CCTCCGCGCT GATCGTCGGA
CCATGCGTGA CTCCACCGCT CGCCGCGGCG CTGCTCTATG TCGCGCAGAC CGGCGATGCG
CCGCGCGGCG CATCGGCGCT GTTCGCGCTC GGCCTCGGCA TGGGATTCCC TTTGATCCTG
GTCGGCCTGT TCGGCGCCGG CGTGCTGCCG CGGTCCGGCC CCTGGCTGGT GACGATCCGC
AAGCTGTTCG GCTTCGTGTT TCTCGGCCTC GCGGTCGCGC TGATCTCGCG GGTATTGCCC
GGCGTGGTGA CGCTGCTGTT GTGGGCCGGC ATCGCGTTCG GTCTCGCCGC GTTTCTCGGC
GCATTCGATC AGCTCGACCG GCTCGGCGGC GCGCTCAGGC GTTCAGGCAA GGCGGCGGGT
CTCGCCGTCT TCGTTTACGG CGCGACGCTG ATCGTCGGCG CCGCCGGCGG CAGCGACGAT
CCGTTGCGGC CGCTCGCGGT GTTCGGCGCC GACCCGACGC CGGCGACCGC GATCGTCGCC
CGGACGGTGA CCTCGATGCC CGCGCTGGAC CAGGCGATCA GCGACGCGCG GGCGCGCGGC
AAGCCGATCA TGATCGACTT CACCGCGGAG TGGTGCACCT CGTGCAAGAC CATGGATCGC
AACGTGTTCG GCGATCCCGC CGTCCGGCAA CGCCTGAAGG ACGTCGCGCT GATCCGCGCC
GACGTCACCA AGACCAACGC CGACACCGCG GCGCTGATGA AGCGCTTCGA CGTCGTCGGC
CCGCCGACCG TGGTGTTTCT CGATCAGCGC GACGGCAGCG AAATCCCCTC CGCCCGCACC
ATTGGCGAAG TCTCCGCCGA CGCGTTCTTC CAGACGCTCC AGCGCGTCGG TTCGTCGTCC
TAA
 
Protein sequence
MADRIVRIVI LLVAIAAGAE AAYAAKAAPF QLSAQPGANG VDLTWRIASG DYLYRDKIVV 
TTADGARVAV QTPTGELKDD PNFGMTEIYH RSVTATIPAD AVKSASRLMV TYQGCAERGI
CYPPMTASVD LGTYQVSAAS GETPSAGQAR TSDLPIIPQL AEPAAEPATV AASVLPSMTQ
GWLPLLLAFA GFGLLLALTP CVLPMIPIVA GMLTRSGPGI SPARGFALAA IYTLAMASAY
AALGVAAAWS GQNLQGALQA PLALAVMASI YVALALSSFG LFELQFPAGF GGNLAGRLNG
RAGPLLGAAA LGFTSALIVG PCVTPPLAAA LLYVAQTGDA PRGASALFAL GLGMGFPLIL
VGLFGAGVLP RSGPWLVTIR KLFGFVFLGL AVALISRVLP GVVTLLLWAG IAFGLAAFLG
AFDQLDRLGG ALRRSGKAAG LAVFVYGATL IVGAAGGSDD PLRPLAVFGA DPTPATAIVA
RTVTSMPALD QAISDARARG KPIMIDFTAE WCTSCKTMDR NVFGDPAVRQ RLKDVALIRA
DVTKTNADTA ALMKRFDVVG PPTVVFLDQR DGSEIPSART IGEVSADAFF QTLQRVGSSS