Gene RPD_2063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2063 
Symbol 
ID4022545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2311136 
End bp2312281 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content68% 
IMG OID637962256 
Producthypothetical protein 
Protein accessionYP_569199 
Protein GI91976540 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.80257 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.561686 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGACA TGTTCGAGAC CGCGAGTTGC TTCCGCCGCC GGTTTGTGGC TGGCGCTGTG 
CTGTTCGCCG CTGTGCTTTT GGACATGGCG ACGCTTGGCT CCGGCGCGAG CGCGCAGCAG
GGCTATCCGC CTCCGGGCGG CGCGCCGCAA GCCGCGGTCA ATCCGGTCTG CCCGCGGCTC
GAGGCACAGC TCGCCTCGAT CGACCGCGGC GGCGCCGACC CGGCGCGCGC CGAGCAGATC
CGGCGCTACG AGGATTCCGT GAACCGCCAG CAGGCCGAGC TCGATCGGGT CACGATGCAG
GCGAAGCGGA TGGGCTGCGA CAGCTCCGGC TTCTTCTCGC TGTTCAACGG CCAGTCGGCG
CAATGCGGCC CGGTCAATAA TCAGATCCAG CAGATGCGCG GCAATCTCGA CCAGATGACC
TCCGGCCTCG AGCGGCTACG CGGCGGCGGC CCTGGCGGCG GCGAGCGCGA CAACCAGCGC
AGGTCGGTGC TGATGGCGCT GGCGCAGAAC AATTGCGGCC CGCAATATGC CGCGGCGGCG
CAAGGCGGCG GCGGTTTTCT CGATAATCTG TTCCGCGGCA ACCCTCAGGG CGCTCCCAGC
GCGCTGCCCG ATTTCAACAC CGACTCCGGC ACCTTCCGCA CGGTCTGCGT CCGCACCTGC
GACGGTTTCT ACTTCCCGAT CTCCTTCGCC ACCGTGCCGG CACGCTTCGC CGATGACGAG
AAGACCTGCA AGAACCTGTG CCCGGCGTCC GAAGCGGCGC TTTACGCTCA CCGCAATCCG
GGCCAGGACA TGAACCAGGC GGTATCCATC AACGGCCAGC CCTACACCTC GCTGCCGGCA
GCATTCCGCT ATCGTCAGGA GTTCAACCCG GCTTGCTCGT GCAAGGCGGC GAATCAGAGC
TGGGCGGACG CGCTGAAGGG CGTCGACGAT ACCTCTGCAC GCGAACACGG CGACATCATC
GTCACTGAAG AGAGCGCGAA GCGGATGGCG CTGCCGCCAG CGCAACGGGC GGCAGCCCAG
CGCAAGGGCA CCACCGCAGC GCCTGCGCCC GCAACTGGCG ACGCCAAGCC GCCGGCGACG
ACGGGGTCGT CCGATCCGAA CACGATCCGC TCAGTCGGCC CGACCTTCCT GCCGAAGATG
CAGTAA
 
Protein sequence
MPDMFETASC FRRRFVAGAV LFAAVLLDMA TLGSGASAQQ GYPPPGGAPQ AAVNPVCPRL 
EAQLASIDRG GADPARAEQI RRYEDSVNRQ QAELDRVTMQ AKRMGCDSSG FFSLFNGQSA
QCGPVNNQIQ QMRGNLDQMT SGLERLRGGG PGGGERDNQR RSVLMALAQN NCGPQYAAAA
QGGGGFLDNL FRGNPQGAPS ALPDFNTDSG TFRTVCVRTC DGFYFPISFA TVPARFADDE
KTCKNLCPAS EAALYAHRNP GQDMNQAVSI NGQPYTSLPA AFRYRQEFNP ACSCKAANQS
WADALKGVDD TSAREHGDII VTEESAKRMA LPPAQRAAAQ RKGTTAAPAP ATGDAKPPAT
TGSSDPNTIR SVGPTFLPKM Q