Gene RPD_2501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2501 
Symbol 
ID4022992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2795473 
End bp2796588 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content68% 
IMG OID637962694 
Producthypothetical protein 
Protein accessionYP_569632 
Protein GI91976973 
COG category[S] Function unknown 
COG ID[COG1289] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0697497 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.340753 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCATC ACTGGACTTG GCACAGGGTG AAGGCGCTGG CGCGCGAAGA GTGGCGCGCG 
CTGGTCACCA TCAATCCGAG CGACCGGCCT TGGCAGATGC CGGCGTCCGT CGCGCTGGCC
GCGGGGGCGC CGATGCTGAT CGGCGCCTAT TTCGATCACC TCGACTACGG CCTGATCTCC
TCGCTCGGCG GCATGGCGTT TCTGTATCTG CCGCGCACGC CGCTGCATCA TCGCATGGTG
TGGATGATGG CGGCGGCGTT CGGCTTCCTC GCCTGCTACA CGGTCGGCCT GATCGTGCAT
CTGCTGCCTT GGCTGCTGGT GCCAGCGATC ACCCTCACCG CGATCATGGT GACGATGGTG
TGCCGGTTCT ACCGGGTCGG TCCGCCCGGC AGCCTGTTCT TCGTGATGGC GGCCTCGATC
GCCGCCTATA CGCCGGGCGA CCTGATGCAG GTGCCGCTGA AGGTCGGGCT ATTCGCGATG
GGCAGCCTGC TCGCGACGCT GATCGCCTTC GCCTACACCC TGTTCGTCTT GCGCATCCGC
GAGCCGCTGC CGATCGAGCC GCTGGCGCCG GCGGATTTCG AGATCGTCGT GCTCGACTCG
GTGCTGATCG GCTGTGCGGT CGGCGTCTCG CTGGCTCTGG CGCAGGCGCT GCAACTGGAA
CGCCCCTATT GGGTGCCGGT GAGCTGCCTC GCGGTGATCC AGGGCCTGTC GGTGCGCGCG
ATCTGGAACA GGCAGCTGCA TCGCATCCTC GGCACCGTGC TCGGGCTGGT GCTCGCCGCG
GCCTTGCTGG CGCTGCCGCT GGAGAAATGG AGCATCGCGC TGATGGTGCT CGGGCTCAGC
TTCGTGATCG AAACCGCGGT GATCCGGCAC TACGGCTTCG CGGTGATCTT CATCACGCCC
TTGACGATCT TCCTCGCCGA CGCCGCCACG CTCGGCCAGG AAGCCCCGAG CGCGATCATC
GAGGCGCGGC TGATCGACAC CCTGCTCGGC TGTCTGGTAG GCTTCATCGG CGGCATCCTG
CTGCACAACG CCGCGTTCCG CCGGCTGGTG CGGCCGGCGA TCCGCAAACT GACGCCGCTC
CGGCTGGTGC CGGATCGCGC GCCGCGGCAG CCGTGA
 
Protein sequence
MRHHWTWHRV KALAREEWRA LVTINPSDRP WQMPASVALA AGAPMLIGAY FDHLDYGLIS 
SLGGMAFLYL PRTPLHHRMV WMMAAAFGFL ACYTVGLIVH LLPWLLVPAI TLTAIMVTMV
CRFYRVGPPG SLFFVMAASI AAYTPGDLMQ VPLKVGLFAM GSLLATLIAF AYTLFVLRIR
EPLPIEPLAP ADFEIVVLDS VLIGCAVGVS LALAQALQLE RPYWVPVSCL AVIQGLSVRA
IWNRQLHRIL GTVLGLVLAA ALLALPLEKW SIALMVLGLS FVIETAVIRH YGFAVIFITP
LTIFLADAAT LGQEAPSAII EARLIDTLLG CLVGFIGGIL LHNAAFRRLV RPAIRKLTPL
RLVPDRAPRQ P