Gene RPD_0652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0652 
Symbol 
ID4021123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp735384 
End bp737810 
Gene Length2427 bp 
Protein Length808 aa 
Translation table11 
GC content71% 
IMG OID637960840 
Producthypothetical protein 
Protein accessionYP_567791 
Protein GI91975132 
COG category[S] Function unknown 
COG ID[COG3002] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.66111 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCATC CCCAATCCCC GGCGATGGCC GATCTGAAAC GCCTCGAAGC CGCGGCGGAT 
CGCGCGGCGC GGGCGATTCC TCCGGTCTGG CCGCTGGCGT CGAGCGTCGC GGTCAATCCC
TTCCTCGGGC AGACCAGCGA GAGCCTCGCC ACGGCGGGCG CACGCCTTGC GCGGGTCGCG
GGCGTCGCCG TCACCATGCC GCGGCGCTGG TTTCACGACA GGATTTCAGC CGGCGTGATT
TCCGACGAGG ACCTGCTGGA GGCCTGGCTC TCGGCGCCGC GCGATCTCCG GCCGGCCGAT
CTGGCGGCGC TGAAGGCCGC CGCGGCGTCG GATACGCCCA AGCCGCGCGC CCTGCCGTCG
ATCGCCGATC TCGCGGCGGA CGTCTCCGGC GTCGACTGGC CGGGCCTGAT CGCCGAGCGG
TTCGGCGCCT GGGCGGCGGG CTATTTCGAC GAAGGCCAGG CGCTGTGGGC CGCGCCGCAT
GGCAAGGGCG CCTTTGCCGC CTGGCAGGCG GTGACGACGC ACGATCTGAC GCCGGAAATC
GCCGGGCTGC GTGGCTTCGC CTTCCATGTC TCGGAAGCTC CCGACAGCGC GCTCGCGGTG
ATCGCCCGCG CCGCCGAACG GCTCGGCCTC AAGCAGGCGG CCATGGACAG CTATTTCCAC
CAGATGCTCA TGACCCTCGG GGGCTGGGCG CAATATGCCC GCTACGCGCT CTGGCAGGCG
GAGCTCGCCG GCGGCTCGGA TCAGACAATC ACCGACCTGC TGGCGATCCG GCTGATCTGG
GAAGAGGCGC TGTGGCTTCG CTACGCGCCG CAGATCGCCG CCAGGTGGGC GAGCGTGTCC
GCGGCGCATG GCGCGCCGAT CGCGGCGACG CCCGACCTCG TGACCGACGC GATCCTTCAA
GAGGCGGCGG AGCGCGCGGC GCAGCGGGCG CTCGCGAACA CGCTGGCGAA GCCCGCCATT
GCGGCGATCG CTGATCGCCC GGCGCTGCAG GCCGCGTTCT GTATCGACGT CCGCTCGGAG
GTGTTCCGCC GCGCGCTCGA GAGCGTCAAT CCGAAGGTCC AGACGCTCGG CTTCGCCGGC
TTCTTCGGAC TGGCCACCGC GCACCGCCGC CTGGCGTCCG ATATCGACGA GCTGCGGCTT
CCGGTGCTGC TCAACCCCGC GCTCAGATCC TGCGCCGGCG GCCCCGATGT CGCATCGCGC
GATCGGTCCG AACGGGTCAA GGCCCGGGCG ACGCGGGCCT GGGGCCGGTT CAAATTCGCC
GCCGTCTCCT CTTTCGCCTT CGTCGAGGCG ACGGGTCCGA TCTATGTCGG CAAGCTCGTG
ACCGATGCGC TCGGACTGCG CCCCGCGCCG GCGGCGAACG ACCCCGCGCC GCTGCTCAGC
CCCGCACTCG ATCTCGCCGA CCGGACCCGG GCGGCCGCTG CCGTGTTGCG GGCGATGTCG
TTGACCGATC GCTTCGCGCG GCTGGTGGTG CTGGCGGGGC ACGGCGCCAA TGTCGTCAAC
AATCCGCACG CCAGCGGGCT GCAGTGCGGC GCCTGCGGCG GCTATTCGGG GGAGGTCAAC
GCCCGGCTGC TGGCGGCGCT GCTGAACGAT ACGAAGGTCA GGGCCGGGCT GACGCCGGAC
GGCATCGCGA TTCCCGCAGA CACGCTGTTC CTCGCGGCGC TGCACGACAC CACCACCGAC
GCGGTGACCC TCTACGCCGA CGATCATCCC TCCGCCGCGC ATCAGCACGA TATCAGCCAG
GCCCGGATCT GGTTCGCCGC GGCGGGCAAG CTCGCCCGGG GCGAGCGCGC GCTTCGGCTG
CCGCGGGCGG CTCATCAGGG CTCCGTCGCA AGACGCGGCC GCGACTGGGC CGAGACGCGC
CCCGAATGGT CGCTCGCCGG ATGCAAGGCG TTCATCGCCG CGCCCCGGAC CCGCACCACC
GGCAGGAGCC TCGACGGCCG CGCCTTCCTG CACGACTACG ACTGGAAGCA GGACACCAGC
TTCGGCGTAC TCGAACTGAT CCTGACCGCG CCGGTCGTGG TGGCGAGCTG GATCAGCCTG
CAATATTACG GATCGACCGT GGCGCCCGAA ATATTCGGGG CCGGCAACAA GCTGCTCCAC
AACGTCACCG GCGGAATCGG CGTCGTCGAA GGCAATGGCG GCCTGCTCAG GGCCGGCCTT
CCGTGGCAAT CGGTCCATGA CGGCGCAAGC TACGCACACG ACCCGTTGCG CCTGTCGGTC
TGCATCGAAG CGCCCCGCGA GGCGATCAGC GACGTGCTGA GCCGCCACGA CAATGTGCGG
GCGCTGTTCG ACAATGGCTG GCTGCATTTG TTCGCGCTCG ATGAGGCAGG ACGGATGGCC
TGGCGCTACG CGGGCGATCT GCAATGGAGC GCGATGAGTC CCGTCGAAGC CGCCGATCCG
CAGCCGCGGC TCAAGGTCGC GGTCTGA
 
Protein sequence
MSHPQSPAMA DLKRLEAAAD RAARAIPPVW PLASSVAVNP FLGQTSESLA TAGARLARVA 
GVAVTMPRRW FHDRISAGVI SDEDLLEAWL SAPRDLRPAD LAALKAAAAS DTPKPRALPS
IADLAADVSG VDWPGLIAER FGAWAAGYFD EGQALWAAPH GKGAFAAWQA VTTHDLTPEI
AGLRGFAFHV SEAPDSALAV IARAAERLGL KQAAMDSYFH QMLMTLGGWA QYARYALWQA
ELAGGSDQTI TDLLAIRLIW EEALWLRYAP QIAARWASVS AAHGAPIAAT PDLVTDAILQ
EAAERAAQRA LANTLAKPAI AAIADRPALQ AAFCIDVRSE VFRRALESVN PKVQTLGFAG
FFGLATAHRR LASDIDELRL PVLLNPALRS CAGGPDVASR DRSERVKARA TRAWGRFKFA
AVSSFAFVEA TGPIYVGKLV TDALGLRPAP AANDPAPLLS PALDLADRTR AAAAVLRAMS
LTDRFARLVV LAGHGANVVN NPHASGLQCG ACGGYSGEVN ARLLAALLND TKVRAGLTPD
GIAIPADTLF LAALHDTTTD AVTLYADDHP SAAHQHDISQ ARIWFAAAGK LARGERALRL
PRAAHQGSVA RRGRDWAETR PEWSLAGCKA FIAAPRTRTT GRSLDGRAFL HDYDWKQDTS
FGVLELILTA PVVVASWISL QYYGSTVAPE IFGAGNKLLH NVTGGIGVVE GNGGLLRAGL
PWQSVHDGAS YAHDPLRLSV CIEAPREAIS DVLSRHDNVR ALFDNGWLHL FALDEAGRMA
WRYAGDLQWS AMSPVEAADP QPRLKVAV