Gene RPD_0008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0008 
Symbol 
ID4020462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp9267 
End bp10505 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content60% 
IMG OID637960184 
Productdivergent AAA region 
Protein accessionYP_567149 
Protein GI91974490 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGTTC GCGAAATTGG CGCATGGCGG CTGGTCGGCG GCGAAACCGA CCGCATCGAG 
TGCAAGGCAG GTTTCCGGCT TCAGCCGGAG GATCGGTTCT CGAAGGCGCT TCGCGCGATC
GCTGGGTTGG CCAACAACAA GGGCGGCTAC ATCCTATTCG GCGTCACGGA CGGGACCTAC
CAGGCGGATG GACTTTCCGA CGACGTATTC ACAAAATCAG ACATCTCGCT TCTCAACAGA
ATCTTGGCGA GCGCTCTTGA CCCCGTTCCT CACGTCACAA AGGGCCTCAT CGAGCTCGGC
GGAAAGCAGG TGGGTGTTCT CTACGTGGAA AAGCACGATC ATGGCCCCGT CATTGCCGTC
AAGAACGTCA GTCAAGATGT GAAGGAAGGA GGCATCTATT TCCGGTACGT CGGAGAAACC
CGCCTGATCA AGCCTGGAGA GCTCAGGCAG ATCATCGCCG CGCGCGAACA GCGGGCGGTC
GCTGAATTCA GTGCCCGCAT GAATCGCGTC GCTGTTGGTA AGGAAGCTAC GATCGACCTC
GATTCCGGCG AGGTCGCCGG CACAAGCGGC AAATTCCTCA TCGACAAATC TTTGCTCTCC
AGCATTCAGT TCGTGCGCGA GGGCGAGTTC GACGAAAAAA AGGGAGCGCC TGCACTCAGA
CTGATTGGCG ACGTCGAGCC CGTTTCGGCG GTGGAAAGGG AGCGAACGAG GGTTATCCGC
GAGAACGTGA CCCCCGACGC GGTCGTCCGC AACTTCCTGC GGAACGAGAA GGTCGCGGAG
CCGACGCAGT ACATCCATTT CCAGGCTCAC TCCCAACGGA AGTGGTTTCC CGTATGGTTC
TACATAGATC AGACGCGGTC GACCGCCTCC GAGGTCGCCG AGGATCTGCG CAAACAGGTC
GCCACCTATC CGTCGTCGCG CGACGCGCTG GTCGACCGGC TCGCGGGAAA GGACGCAGCC
TTCCGCCAAT CCACCGGGAA GGCCGAAGCT CTGCGCGCGA AGTTAGCGCG GGGCGACATC
AAGGCCCCGA CCGACATCGA CGCAGACGTC GTTTTCGCTG GTGCTGTCCA AGCGCTGCCT
ACGACTATGA AGCCAAAAGA CCTTGAGAGC ATTAGGACGG CCCTGCTCGA TTGCCTGGAT
CGCGCGCAGG ACACCGACCC CCGCAGCAGC AATCGTCGCG GAGCCATCTA CCGGGCCGCA
TGCCGTCTCG ACGAGTTGCT TTACTCGAAG AAAAGGTGA
 
Protein sequence
MFVREIGAWR LVGGETDRIE CKAGFRLQPE DRFSKALRAI AGLANNKGGY ILFGVTDGTY 
QADGLSDDVF TKSDISLLNR ILASALDPVP HVTKGLIELG GKQVGVLYVE KHDHGPVIAV
KNVSQDVKEG GIYFRYVGET RLIKPGELRQ IIAAREQRAV AEFSARMNRV AVGKEATIDL
DSGEVAGTSG KFLIDKSLLS SIQFVREGEF DEKKGAPALR LIGDVEPVSA VERERTRVIR
ENVTPDAVVR NFLRNEKVAE PTQYIHFQAH SQRKWFPVWF YIDQTRSTAS EVAEDLRKQV
ATYPSSRDAL VDRLAGKDAA FRQSTGKAEA LRAKLARGDI KAPTDIDADV VFAGAVQALP
TTMKPKDLES IRTALLDCLD RAQDTDPRSS NRRGAIYRAA CRLDELLYSK KR