Gene RPD_1900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1900 
Symbol 
ID4022382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2134256 
End bp2135494 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content66% 
IMG OID637962093 
Productradical SAM family protein 
Protein accessionYP_569036 
Protein GI91976377 
COG category[R] General function prediction only 
COG ID[COG4277] Predicted DNA-binding protein with the Helix-hairpin-helix motif 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.308107 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTGC AGCGCAAGCT GGCCATTCTG GCGGATGCGG CGAAATACGA CGCCTCCTGC 
GCCTCGAGCG GGACGGAGAA ACGCGACAGC CGCGACGGCA AGGGGCTGGG CTCGACCGCG
CCCGGCATGG GGATCTGCCA TTCCTATGCG CCGGACGGCC GCTGCATCTC ACTGTTGAAG
GTACTGCTCA CCAACGCCTG CAACTATGAC TGCCTGTATT GCGTCAACCG CGCTTCGTCG
AACGTGCCGC GCGCGCGCTT CACCGTCGAC GAGGTGGTGC AGCTCACGCT CGACTTCTAT
CGCCGCAACT ACATCGAGGG GCTGTTCCTG TCGTCGGGCA TCATCCGCAG CGCCGACTAC
ACCATGGAGC AGATCGTCGA GGTCGCGCGC CGCTTGCGGG AGGAGCACCA TTTCCGCGGC
TACATCCATC TCAAGACAAT CCCGGAGGCC GACGACGCGC TGATCAGCAA AGCCGGCCGC
TATGCCGATC GCCTCAGCAT CAATATTGAA GTGCCTGAGG AGCAAAGCCT CGCCGCGCTG
GCGCCGGAGA AGAACGTCCG CGCCATCCGC CGCACTATGG GGCGGCTGCG GCTGAAGCTC
GACGAGGCCA AGGAGGCGCG CACCGCGCCG AGCCGCGCCA AGCCGCCGCG CTTCGCCCCG
GCCGGCCAGA GCACACAGAT GATCGTCGGC GCCGACGCCG CCACCGACCA GACCATCCTC
GACACCAGCG CCAATCTCTA CGGTTCCTAC AATCTCAAGC GGGTGTACTA CTCGGCGTTC
AGCCCGATTC CGGATTCCAG CCGCGCCCTG CCGCTGCAGG CTCCGCCGCT GATCCGCGAG
CACCGGCTGT ATCAGGCCGA CTGGCTGATG CGGTTCTACG GCTTCGACGC CGGCGAAATC
ATTGACCCGT CCGCAGGTAT GCTGTCGCTG GAGATCGACC CGAAGCTCGC CTGGGCGCTG
CGGCATCGCG AGCGCTTCCC GCTCGACGTC AACCGCGCCA GCCGCGAGGA TCTGCTTCGG
GTTCCGGGCT TCGGCCGCAA AGCCGTCGAG CGCATCATCG CAACGCGGCG ACACAGCGCG
ATCCGCAGCA TGGATCTCGC GCGCCTGCAC ATCCCGCGGA ACAAGGCGCT GCCGTTCATC
GTTCTCTCCG ACCACCGCCC GACGCCGCAT CTCCTCGACA GCGCGCGGCT GGCGGAACGG
TTCCGGCCGA AGGCGCAGCA ACTTGGATTT GGATTCTAA
 
Protein sequence
MDVQRKLAIL ADAAKYDASC ASSGTEKRDS RDGKGLGSTA PGMGICHSYA PDGRCISLLK 
VLLTNACNYD CLYCVNRASS NVPRARFTVD EVVQLTLDFY RRNYIEGLFL SSGIIRSADY
TMEQIVEVAR RLREEHHFRG YIHLKTIPEA DDALISKAGR YADRLSINIE VPEEQSLAAL
APEKNVRAIR RTMGRLRLKL DEAKEARTAP SRAKPPRFAP AGQSTQMIVG ADAATDQTIL
DTSANLYGSY NLKRVYYSAF SPIPDSSRAL PLQAPPLIRE HRLYQADWLM RFYGFDAGEI
IDPSAGMLSL EIDPKLAWAL RHRERFPLDV NRASREDLLR VPGFGRKAVE RIIATRRHSA
IRSMDLARLH IPRNKALPFI VLSDHRPTPH LLDSARLAER FRPKAQQLGF GF