Gene RPD_1756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1756 
Symbol 
ID4022238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1972381 
End bp1973571 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content67% 
IMG OID637961950 
Productlow temperature requirement A 
Protein accessionYP_568893 
Protein GI91976234 
COG category[S] Function unknown 
COG ID[COG4292] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.812736 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGAAGG ATTCGACCAG CGCCGGAAAC CTGCTTCGCG TGCGAAAGCC GCACGAGCAC 
AGCCGCGTCA CTTACGTCGA ATTGTTCTTC GATCTGGTGT TCGTCTTCGC CGTTACCCAG
GTCTCGCATT TCCTGCTGGC GCATTTCACG CCGATCGGCG CGCTGCAGAC GCTGATCCTG
ATGCTCGCGA TCTGGTGGGT GTGGGTCTAC ACCTCCTGGA TCACCAACTG GCTCGACCCC
GAACGCACCA GCGTTCGGGT GATGCTGTTC ACGCTGATGA GCGCGGGCCT GTTGCTGTCG
ACCTCGATCC CGCACGCTTT CGAGAGCCGC GGCCTGGCGT TCGCACTGGC CTTCGTGGCG
ATGCAGGTCG GCCGCAGCGC GTTCGCGACC TTCACCACGC CGAAATCGGA CCCGCTGCGG
CTCAATCTGG CCCGCATCCT GGCCTGGCTC AGCGTCAGCG CGGTGTTCTG GATCGCCGGC
GGGCTGCTCG ACGGAAACAT CCGGATCGCG CTGTGGCTGA TCGCGCTGGC AATCGAATAT
GCCGGACCGT TCGCCGGCTT CTACGTGCCG CGGATCGGCG CCTCCACGAC GCAGGACTGG
AGCGTCGAAG GCGGCCACAT GGCGGAGCGC TGTGCGCTGT TCATCATCAT CGCGCTCGGC
GAATCGATCG TGGTGACGGG GACGACCTTT GCGGGCATCG AATGGACGAT CGTCGGCTTC
GCCGCCGTCG CGTCGGCGTT GCTCGGCGCA ATCGCGATGT GGTGGATCTA CTTCCACATC
GGCGTGCATC TCGGCTCGGA GCGTATCTCG CAGTCGAGCG ACCCGGGGCG GCTGGCGCGG
CTCGCTTACA CCTATCTGCA TTTGCCGATC GTTGCCGGCA TCGTCGTCAC TGCGGTCGGC
GACGAAATGC TGCTGGCGCA CCCCTCCGGC CACGCCGACG CGAAAATGAT CCTCGCCACG
GTCGGCGGGC CGGTGCTGTT CCTGATCGGC GTGTTGCTGT TCAAGCGGGC GGTGCGCGGG
CATTTGCAGC CGTCGCACAT GGTCGGCATC GCGGCCTTGC TCGCGCTGGC GCCGGTGGGC
CCGTTCGTCT CGCCGTTGGC GCTGTCCGCG GCCACGACGG CGACGCTGCT CGCGGTCGGC
TACTGGGAAG CCGTGTCGCT CGGCGCCGCC AGCCGATCGC CGGTGGAGTA G
 
Protein sequence
MAKDSTSAGN LLRVRKPHEH SRVTYVELFF DLVFVFAVTQ VSHFLLAHFT PIGALQTLIL 
MLAIWWVWVY TSWITNWLDP ERTSVRVMLF TLMSAGLLLS TSIPHAFESR GLAFALAFVA
MQVGRSAFAT FTTPKSDPLR LNLARILAWL SVSAVFWIAG GLLDGNIRIA LWLIALAIEY
AGPFAGFYVP RIGASTTQDW SVEGGHMAER CALFIIIALG ESIVVTGTTF AGIEWTIVGF
AAVASALLGA IAMWWIYFHI GVHLGSERIS QSSDPGRLAR LAYTYLHLPI VAGIVVTAVG
DEMLLAHPSG HADAKMILAT VGGPVLFLIG VLLFKRAVRG HLQPSHMVGI AALLALAPVG
PFVSPLALSA ATTATLLAVG YWEAVSLGAA SRSPVE