Gene RPD_0468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0468 
Symbol 
ID4020936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp539674 
End bp541113 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content72% 
IMG OID637960655 
Producthypothetical protein 
Protein accessionYP_567607 
Protein GI91974948 
COG category[S] Function unknown 
COG ID[COG4223] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.252316 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAAGA ACAGGCCCGG ACACGAAAAT ACGCAGCACG AAAAGGACCC GCGCGAGAAC 
GCGGCGTCCG TTGGCGCAAG TGATGCGCAT CCGGAACAAG CCGCCGAGGT GGATGTGGAC
GCGCCGATCG AGGCGCTGGC CCTGAACCCC GTCGAGCCGA CCGCGGAGGA GCTTCGCGAC
GATCCGGCGC TGGAGCTATC CGCAGAGCAT CGGCAGCAGG ACGTGATCGA CGCCGAGCCG
CTGCCGGAGA CGGCGACTGA CGATCCTCCC GAGACCCGGT TCGCCTCCAC ATCGGACAAA
GCCGAGGAAG CCGCGGCGCG GTCTGCGGCC GCCGCCGAGC CGCGCCGGCC AGGCATCGTC
GCGGCGATGC TGCCGCCGGT GCTGGCGGTC GCGATCGCCG CCGCGGTGGT CGTCGGCGCC
GCCAAGACCG GGCTGCTGCC GCAGTTCCTG TCCTCGACCA GCGTGAGCGC GCCGGAGGGC
GACGTTGCGG CGATCGATGC GCTGAAGGCG CGGATCGCCG ACCTCGAAGC GCGGCCGACG
CCGACCGGTT CGAACACCGC CGCCGCAACG CCTGCCGCTG CAGCCGATCC GGCGCTGGCC
GGCAAGGTCG ACGCGCTGGA GAAGACCGTC GCCGCGCTGC GCGACGACCT CGCCACGCTC
CGCGACCGAT CCGAACAGCT CGCCAGCGCG TTGAAGGAGG TCAAGGCCGC GCCGTCCGAG
CCTACCGCGA CCGCAAGCGA ACCGCCGGCG ATGGCCTCGA CCGACAAGAC CCCGCCCGAC
AAGGCCGCTG CCGACAAGGC CGCGACCGAT TCGGCGGCGG CGCTGGCCGC GATCAACGCC
CGCTTGACCG AGCTCGAACA CGCCGCCAAG ACCGCGACCG AGGCCGCCCC GCAAGCGCCG
CAGCCCGCGG TCGTGTCCGA CGACGCGCCG CTGCGCCGTC TCGTCACCGC GACCATGCTC
GATCTGACGG TGAAGCAGGG CGCGCCCTAT GCGGCGATCC TGAAGGCCGC CGAGCCGCTC
GCCACCGAAA CGGGAGCGCT GAAGCCGCTG GAGCCGTTCG CGGCGACCGG GGTTCCCGCC
GCCGCCGCCC TCGGCCGCGA GCTGATCGCC CTGCTGCCGA AGCTGTTGTC GGGCGCCGAG
GGCGCCAGCA ACGCGAATTT CATCGACCGC TTCCAGTCCA ATGCGGAGCG GCTGATCCGG
ATCCAGCGTT CCGACGCCAC CGCCGGGATC GATCGCACCG CGATCGTCGG CCGCATTACC
GCGGCGGCGC AGCGCGGCGA CCTTGCCGAG GCGCGGCGCG AGCTGAAAGC GCTTGCGCCG
GCCGACCGCG CTCCTGTTCA ATCCTGGATC GACAAATCCG AGGCGCGCGA CCAAGCTCTC
GCCGCCTCGC ATTCCTTCGC CACCGCTGCG CTAGCCGCGC TGCAGAAACC GTCGCCATAG
 
Protein sequence
MVKNRPGHEN TQHEKDPREN AASVGASDAH PEQAAEVDVD APIEALALNP VEPTAEELRD 
DPALELSAEH RQQDVIDAEP LPETATDDPP ETRFASTSDK AEEAAARSAA AAEPRRPGIV
AAMLPPVLAV AIAAAVVVGA AKTGLLPQFL SSTSVSAPEG DVAAIDALKA RIADLEARPT
PTGSNTAAAT PAAAADPALA GKVDALEKTV AALRDDLATL RDRSEQLASA LKEVKAAPSE
PTATASEPPA MASTDKTPPD KAAADKAATD SAAALAAINA RLTELEHAAK TATEAAPQAP
QPAVVSDDAP LRRLVTATML DLTVKQGAPY AAILKAAEPL ATETGALKPL EPFAATGVPA
AAALGRELIA LLPKLLSGAE GASNANFIDR FQSNAERLIR IQRSDATAGI DRTAIVGRIT
AAAQRGDLAE ARRELKALAP ADRAPVQSWI DKSEARDQAL AASHSFATAA LAALQKPSP