Gene RPD_0285 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0285 
Symbol 
ID4020744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp330097 
End bp331176 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content67% 
IMG OID637960465 
Producthypothetical protein 
Protein accessionYP_567426 
Protein GI91974767 
COG category[S] Function unknown 
COG ID[COG4246] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.106647 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACTG GCGACGGCGG GGCCGCAGCG GGGCGGTTGC TCAGCCGCCG CCGCGTCGTC 
GCACTCGCCG CGGGTCTCGC TTTCGCGGCG CACCCGGCGC TGGCGCAGAG CCAAGCGCAG
CGCTTCGATC CGGAGGAATT CTCCACCCCG GCCCCCGAGC GCATCGAGGT GCGGGCGCGG
CCGATCGAGT CCTTCGATCT CCGCGACCGC GCCAGCCGAC GGTTCGGCGC GCTGCAGTTT
CGCAGCGGTC TGGTGCTGAC CTCGCCGTTC CGCGGCTTCG GCGGCCTGTC GGGGCTGAAG
CTCGATCCGA AAGGCGAGCG CTTCGTCGCG ATCAATGACC GCGGCGCCTG GATCACCGGC
CGCATCGTGT ACAGCGGCGC CGAGATGACC GGGCTCGCCG ACGTCGAGGC GGCGCCGCTG
CTCGGCCCCG ATGGACAGCC GCTGACGCGG CGCAAATGGT ACGATAGTGA ATCGCTCGCA
TTCGACGGCG GCACAGCCTA TGTCGGCTTC GAACGCGTCA ACCAGATCGT GAAATTCGAC
TTCGGCCGCG ACGGCGTCCG GGCGTTGGGG CAGCCGATCG CGGTGCCGCC GGCGCTGCGC
AAATTGCCGA ACAACAAGGG CATCGAATCG CTGGTGGCGG TGCCGAAGGG CCAGCCGCTG
GCCGGAACGC TGATTGCGAT CTCCGAGCGT GGTCTCGACG CCGATCGCAA CGTCATCGGC
TTTCTGATCG GCGGAAAAAC CCCTGGCCAG TTCGCGGTTC GCCGCACCGA GGATTTCGAC
ATCAGCGATG CGATGCTGCT ACCATCGGGC CAATTGCTGA TCCTCGAGCG CAAGTTCTCC
TGGATCCACG GCGTGCACAT CCGGATCAGG CGGATCGCGC TGTCGACGCT GACGCCCGGC
GCGATCGTCG ACGGCCCCGC GCTTTTCAAC GCCGATCTAG GCCACGAGAT CGACAACATG
GAAGGCATCG ACGCGCATCT CGACGCGTCC GGCGCCACCG TGCTGACGCT GGTCTCGGAC
GACAATTTTT CGATGCTGCA GCGGACGTTG CTGTTGCAGT TCACCCTCGT CGAGGATTGA
 
Protein sequence
MTTGDGGAAA GRLLSRRRVV ALAAGLAFAA HPALAQSQAQ RFDPEEFSTP APERIEVRAR 
PIESFDLRDR ASRRFGALQF RSGLVLTSPF RGFGGLSGLK LDPKGERFVA INDRGAWITG
RIVYSGAEMT GLADVEAAPL LGPDGQPLTR RKWYDSESLA FDGGTAYVGF ERVNQIVKFD
FGRDGVRALG QPIAVPPALR KLPNNKGIES LVAVPKGQPL AGTLIAISER GLDADRNVIG
FLIGGKTPGQ FAVRRTEDFD ISDAMLLPSG QLLILERKFS WIHGVHIRIR RIALSTLTPG
AIVDGPALFN ADLGHEIDNM EGIDAHLDAS GATVLTLVSD DNFSMLQRTL LLQFTLVED