Gene RPD_0338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0338 
Symbol 
ID4020798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp392283 
End bp393380 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content65% 
IMG OID637960517 
Producthypothetical protein 
Protein accessionYP_567477 
Protein GI91974818 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.88474 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.474349 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCCT CGGAGAACAG GTTCTTCATC TTGCTGCTGT TGGTAGCCTC GATCGGCTTC 
GGCTGGATCT TGCTGCCGCT TTACAGCGCA GTGCTGTGGG GCGTGGTGAT CGCGATCCTG
TTCGCGCCGC TGTACCGGCA ACTGAACCGC GCGTTCGGCT TCCGCCGCAA TCTGGCCGCG
CTGACGACGG TCGCGATCAT CGTCACCATG GTGATCCTTC CGATCTCGCT GATCGGCGCG
GCGCTGGCGC AGGAAGCCCG GGCGATGTTC CTCCGGATCG AGTCCGGCAA TCTGGACATG
CTTCAGTATC TCAAACAGGT TCTGGCGGAG CCGCCGGCCT GGATCAGCGG GGTGCTGGAG
CGATTCAGCA TCGGCAACCT CTCCGACCTT CAGGAACGGC TTTCCGCCGT GTTGATGCGC
GGCAGTCAGT ATCTCGCGGG ACAGGCCATC GGCATCGGTC AGGGAACGGT CGAGATCGTC
ATCAACCTGG GCGTGATGGT CTATCTGCTG TTCTTCCTGC TCCGCGACGG CGATCTGCTG
GCGGCGCGTA TCCGGAGGGC CGTGCCGCTG TCGGTCGAGC AGGAAAGCCA GTTGCTGCGC
AAGTTCACGG TCGTGATCCG CGCGACCGTC AAGGGCAACA TGCTGATCGC GCTGATCCAG
GGCGCGCTCG GCGGCGTGAT GTTCTATATC CTCGGCGTCA ACGGCGCGCT GCTGTGGGGC
GTGGTGATGG CGTTCCTGTC GCTGCTGCCC GCGGTCGGCG CCGGTCTGGT CTGGCTGCCG
GTGGCGCTCT ATTTGCTCGG CACCGGATCG ATCTGGCAGG GCGTTGCGCT GATCGCCTTC
GGCACGATGG TGATCGGCAC GGTCGACAAC GTGCTGCGTC CGGTGCTGGT CGGCAAGGAC
ACGCGGATGC CGGACTACGT CGTGCTGATC TCGACGCTCG GCGGCATCCA GGTTTTTGGC
CTCAACGGCT TCGTCATCGG GCCGGTGATC GCGGCGATCT TCATCGCGGC CTGGGATATC
TACTCCGCCT CGCGGGAAGC CGCGGCGGGG ATCGGCGAAA CGACCACCGG TGGCGTGCCG
ACCAGCGATG CGCCATGA
 
Protein sequence
MRASENRFFI LLLLVASIGF GWILLPLYSA VLWGVVIAIL FAPLYRQLNR AFGFRRNLAA 
LTTVAIIVTM VILPISLIGA ALAQEARAMF LRIESGNLDM LQYLKQVLAE PPAWISGVLE
RFSIGNLSDL QERLSAVLMR GSQYLAGQAI GIGQGTVEIV INLGVMVYLL FFLLRDGDLL
AARIRRAVPL SVEQESQLLR KFTVVIRATV KGNMLIALIQ GALGGVMFYI LGVNGALLWG
VVMAFLSLLP AVGAGLVWLP VALYLLGTGS IWQGVALIAF GTMVIGTVDN VLRPVLVGKD
TRMPDYVVLI STLGGIQVFG LNGFVIGPVI AAIFIAAWDI YSASREAAAG IGETTTGGVP
TSDAP