Gene RPD_2818 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2818 
Symbol 
ID4023316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3139172 
End bp3140782 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content71% 
IMG OID637963016 
Producthypothetical protein 
Protein accessionYP_569947 
Protein GI91977288 
COG category[S] Function unknown 
COG ID[COG1376] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.746954 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCGGG CAATCCAATC GAGGCAAGGC GTGGTGATTC GATCCAACAC TAAGCTGCAC 
GCCATGACGG ACCGGCGTTT CGGAATTGCG GCCGTCGCAA CGATCGCCGC CTTGGTCACG
TGGATCGCGC TGACCGGCGA CGCGCTCGCC AAGCGCGCGC GCCCGGCTCC GCCGGTCGAG
GCGACGGCGC CGCGCGAAGC GGGTGAGCCG ATCATGGCGA TCGTCTCGAT CAAGTCGCAG
CAGGTCACGC TGTACGATGC CGATGGCTGG ATCCTGCGCG CGCCGGTGTC GACCGGCACT
TCGGGGCGGG AGACGCCGGC CGGCGTGTTT GCGATCCTCG AGAAGCGCAA GGACCATCGC
TCCAGCATGT ATGACGACGC CTGGATGCCG AACATGCAGC GCATCACCTG GAACGGCGTC
GCGCTGCACG GCGGCCCGCT GCCGGGCTAC GCCGCCTCGC ATGGCTGCGT ACGGATGCCC
TACGGCTTTG CGGAGAAACT GTTCGACAGG ACCCGGATCG GGATGCGGGT GATCGTCTCG
CCGACCGATG CGGCGCCGGT CGATATCGCG CATCCCGCGC TGTTGACGCC GGACCCGGCC
GCGATCGCCG CCGTTCCGAC GCGCGCCGTG ACGCTGAGCC GCGAGGCCGA CGACGCGACC
AAGGCGGCGG ACGACGCAAG GAAGGCCGCC AAGGCCGCGG CCCGCGAAAT CGCGCCGCTC
AAGGCCACGC TCCGCGGGCT CGAACGGGCC AAGGCGCGCG CCGATGCCGA TCTGACGCGC
GCCGACCGGT TGCTCGCCGG GGCGAAAACC GACGAGGCCA AGGCGCGGGC CGAAGCGCTG
CAGCAGAACG CCGCGCAGCA GGCCGGGGAG GCCGCGGCGC GACTCGCCAC CGCCACGGCG
GACGCCGAGG CGAAGCGGGC GCTGGCGGAC GCGACCAGGG ACGCCGCCAA GGCGGCCGAG
GCCAAGAAGG CCAACACCGC CAAGGCGGCG CTGGACGCGA AGCTCGCGCA GGAGCCGGTC
TCTATCTACA TCAGCCGCGC GACTCAGAAG CTCTACGTCC GCCGCAACAC CCACAAGCCT
GCGCCGGACG GCGGCGGCGA GGTGTTCGAC GCGACCATCG AGGTTCCGGT CACGATCCGC
GAATCCGACA GACCGATCGG CACCCATGTG TTCACCGCGA TGGCGAAGAG CGACGCCGGC
CTGCGCTGGA GCGCGGTGAC CATCGACGGT CCCGACGACG CCAGAGAGGC GCTCGACCGC
GTCACCATTC CGCCGGAGGT GCTGGACCGG ATCGGCCCGA CCGCGTTGCC GCGCAGCTCG
ATCGTCATTT CCGACGAGCC GCTGAGCGCC GAAACCAACT ACCGCACCGA ATTCGTCGCG
GTGCTGAGCC ATCAGCCGCA GGGCGGCTTC ATCACCCGCA AGCCGACGCC CAACGTCGTC
GTCGCCGACC GCGACGACAA TTGGGACGGC GGTTTCGGTT CGTTCTTCTT CCCGCGCGCG
CCTGAGCCGC AGCCGCAACC GCGAAACCCG CGGCAGCAGC GCGGCGGCAG AGGCTATCCG
CAGCCGATGC AGCAGCAGGG CTGGCAGCCG GGGTGGCAAC CGAGCTGGTA G
 
Protein sequence
MRRAIQSRQG VVIRSNTKLH AMTDRRFGIA AVATIAALVT WIALTGDALA KRARPAPPVE 
ATAPREAGEP IMAIVSIKSQ QVTLYDADGW ILRAPVSTGT SGRETPAGVF AILEKRKDHR
SSMYDDAWMP NMQRITWNGV ALHGGPLPGY AASHGCVRMP YGFAEKLFDR TRIGMRVIVS
PTDAAPVDIA HPALLTPDPA AIAAVPTRAV TLSREADDAT KAADDARKAA KAAAREIAPL
KATLRGLERA KARADADLTR ADRLLAGAKT DEAKARAEAL QQNAAQQAGE AAARLATATA
DAEAKRALAD ATRDAAKAAE AKKANTAKAA LDAKLAQEPV SIYISRATQK LYVRRNTHKP
APDGGGEVFD ATIEVPVTIR ESDRPIGTHV FTAMAKSDAG LRWSAVTIDG PDDAREALDR
VTIPPEVLDR IGPTALPRSS IVISDEPLSA ETNYRTEFVA VLSHQPQGGF ITRKPTPNVV
VADRDDNWDG GFGSFFFPRA PEPQPQPRNP RQQRGGRGYP QPMQQQGWQP GWQPSW