Gene RPD_4054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4054 
Symbol 
ID4024571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4504908 
End bp4506350 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content66% 
IMG OID637964257 
Producthypothetical protein 
Protein accessionYP_571174 
Protein GI91978515 
COG category[R] General function prediction only 
COG ID[COG3800] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.568607 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0971132 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGGTG AATCCGGGAA AAAGCTGTTC GTCGGACCCC GGTTCCGCCG AATCCGTCAG 
CAGCTTGGCC TGTCGCAGAC CCAGATCGCC GAGGGACTGG GGATCTCGCC GAGCTATATC
AACCTGATCG AGCGGAACCA ACGGCCGGTG ACCGCACAGA TCCTGCTGCG ATTGGCGGAA
ACCTACGACC TCGATCTGCG CGACCTCGCC ACCGCCGACG AGGACCGGTT CTTCGCCGAG
CTCAACGAAA TCTTTTCGGA CCCGCTGTTC CGCCAGATCG ACCTGCCGAA GCAGGAGCTG
CGCGACCTCG CAGAGCTTTG CCCCGGCGTC ACCCATTCAC TCCAGCGGCT TTACGCCGCC
TACACCGAGG CGCGGCGCGG CGAAACGATG GTCGCGGCGC AGATGGCCGA CCGCGAACAG
ATCCGCTACG AGGCCAACCC GATCGAGCGC GTTCGCGATC TGATCGAGGC CAACCGCAAC
TATTTTCCGG AGCTGGAGCA GGCGGCGGAA GCGGTGCGCG ACGAACTCAA CGTCAGTTCG
CAGGATGTCT ACGGCGCGCT CGACGACCGC CTGCGCGAGC GCCACGCGAT CACAACCCGG
ATCATGCCGG TCGACGTGAT GCGGGAGACG CTGCGCCGGT TCGACCGCCA CCGCCGGCAA
TTGCTGATCT CGGAATTGAT CGACGGGCCG GGCCGCGCCT TCCAGATCGC GTTCCAGACC
GGCCTCAGCG AGCATGGCGG CGTGATCGAC GCCATCGTGC ACCGCTCCGG CGCGCTCGAC
GAGCCGGCGC GGCGGCTGTA CCGCATCACC CTCGGCAACT ACTTCGCCGC CGCGGTGATG
ATGCCCTACG CCGCGTTCCT CACCGCCGCG GAGCAGCTCA GCTACGACGT CAACGTGCTG
GCGCAGCGCT TCAACGCCGG TTTCGAACAG GTCTGCCACC GCCTCACTAC GCTGCAGCGG
CCGAACGCGC GCGGCGTGCC GTTCTTCCTG CTGCGGGTCG ACAATGCCGG TAACGTCTCC
AAGCGTTTCT CCTCCGGCAC CTTCCCGTTC TCGAAATTCG GCGGCACCTG CCCGTTGTGG
AACGTGCACT CGACCTTCGA CACGCCGGAT CGCCTCCTGA AGCAGGTGAT CGAACTGCCC
GACGGCAGCC GCTACTTCTC GATCGCCCAG ATGGTCCGCC GGCCGGTAGC GCCGCATCCG
CAACCGCAGC CGCGCTTCGC GCTCGGCCTC GGCTGCGAAA TCCGCCACGC CGCCAAACTG
ATCTACGCCG CCGGGATGGA TCTGGAGAAA GCCGAAGGCA CCCCGATCGG CGTCAACTGC
CGCCTCTGCG AACGCGAACA CTGCAGCCAG CGCGCCGAGC CGCCGATCAC CCGGACGCTG
ATCCTGGACG AGAACACAAG GCGAGCGAGC AGCTTTGCGT TCAGCAATGC GCGGGAGTTG
TGA
 
Protein sequence
MAGESGKKLF VGPRFRRIRQ QLGLSQTQIA EGLGISPSYI NLIERNQRPV TAQILLRLAE 
TYDLDLRDLA TADEDRFFAE LNEIFSDPLF RQIDLPKQEL RDLAELCPGV THSLQRLYAA
YTEARRGETM VAAQMADREQ IRYEANPIER VRDLIEANRN YFPELEQAAE AVRDELNVSS
QDVYGALDDR LRERHAITTR IMPVDVMRET LRRFDRHRRQ LLISELIDGP GRAFQIAFQT
GLSEHGGVID AIVHRSGALD EPARRLYRIT LGNYFAAAVM MPYAAFLTAA EQLSYDVNVL
AQRFNAGFEQ VCHRLTTLQR PNARGVPFFL LRVDNAGNVS KRFSSGTFPF SKFGGTCPLW
NVHSTFDTPD RLLKQVIELP DGSRYFSIAQ MVRRPVAPHP QPQPRFALGL GCEIRHAAKL
IYAAGMDLEK AEGTPIGVNC RLCEREHCSQ RAEPPITRTL ILDENTRRAS SFAFSNAREL