Gene RPD_2094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2094 
Symbol 
ID4022576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2343389 
End bp2345137 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content65% 
IMG OID637962287 
Producthypothetical protein 
Protein accessionYP_569230 
Protein GI91976571 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.429967 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAAG CTGTTTCCGC TCCGGCGTTC GGCAAGATCA TCTCGGTGCG CGGGTCGATG 
GCGCGCGCGG GCCTGTTGCC CGAAAAGCAC CTGTCGCCTG CCGCAATCCG GGCCACGGTC
GGCCGCTTCA TCAGCATCCG TACGGCGAGT TCCACCATCA TCGCGATCAT CACCGAAGTA
TCGTGTGAAG ACGTGGTCGG CGACCAGTAC AGCGCCAGCG CCTCGGTCGA TCTGCTCGGC
GAAATCCTGC CCGGTCCGGC GCGCGCGAAG TTTCAGCGCG GCGTCACCAA CTATCCGACG
ATCGGCGACG CGGTCGAAAT GATCACCAGC GAAGACCTGC GCGTCGTTTA TGCGCCGACC
GGCTCCGACC AGATCAATGT CGGCACGCTG CAGCAGGACC CGTCGGTGAT CGCCTATGTC
GACATCGAGG AAATGCTGTC GAAGCACTTC GCGGTACTCG GCTCGACCGG CGTCGGCAAA
TCGACCGGTG TATCGCTGCT GCTCAACGAG ATCCTGAAAT CTCGGCCGGC CTTGCGCGTC
TTCCTGCTCG ACGTTCACAA CGAGTACGGC CGTTGCTTCG GCGATCGCGC GCTGGTGCTC
AACCCGCGCA ACCTCAAGCT GCCGTTCTGG CTGTTCAACT TCGATGAAAT CGTCGACGTG
CTGTTCGCCG GCCGCCCGGG CGTTCCGGAA GAACTCGACG TCCTCGCCGA GGTGATCCCG
ATCGCGAAGG GCCTGTACGT GCAGTACACC AACGCCGACC GGATCGGGCT GAAGCGGATG
GATCCGAAGT CGGTCGGCTA TACCGCCGAC ACGCCGGTGC CGTATCGCCT TGTCGATCTG
ATCTCGCTGA TCGACGAGCG CATGGGCAAG CTCGAGAACC GCTCGTCGCG CATCATCTAT
CACAAGCTGA TCTCGCGCAT CGAGACCGTG CGCAACGACC CGCGCTACGC TTTCATGTTC
GACAACGCCA ATGTCGGCGG CGACACCATG GCGGAGGTGA TCAGTCACCT GTTCCGGCTG
CCCGCCAATG GTCGCCCGAT GACGATCATG CAGCTCGCCG GCTTCCCGGC CGAGGTCGTC
GACTCGGTGG TGTCGGTGCT GTGCCGGATG GCGTTCGATT TCGGGCTGTG GAGCGACGGC
GTCTCGCCGC TGCTGTTCGT CTGCGAGGAA GCGCATCGCT ACGCGGCCGC CGACCGTTCG
ATCGGTTTCG GCCCGACCCG CAAGGCGGTG TCGCGGATCG CCAAGGAAGG CCGCAAATAC
GGCGTCTATC TCGGCTTGGT GTCCCAGCGC CCGGCGGAAC TCGACGCGAC GATCCTGTCC
CAGTGCAACA CGCTGTTCGC GATGCGGCTC GCCAACGACC GCGACCAGTC GCTGCTGCGC
TCGGCGGTGT CGGACGCTGC CGCCAATCTG TTGTCGTTCG TGCCTTCGCT CGGAACCCGC
GAAGTGCTGG CGTTCGGCGA AGGCGTCGCG CTGCCGACCC GGCTGCGCTT CAAGGAAGTG
CCAGTGCAGC AATTGCCGCG TTCGGAAGCG GCGATCTCGA CCGTGCCGTC GGCGACCGCG
GGCCACGACA TGCATTTCGT CAGCGCGGTG CTGGAACGCT GGCGAGGCGC CACCTCGCAT
CGCGACATTC CGAACGATCC AGGCGTGGTC GAGCGGCCGC TGGCACGCAC CATGGACGCT
CCGATGCTGC AGCCCTCGCT CGGGCTCGAT CCCGACCGTT TCTCGCTGCT GAAGAAGCCG
CTGCGCTGA
 
Protein sequence
MAEAVSAPAF GKIISVRGSM ARAGLLPEKH LSPAAIRATV GRFISIRTAS STIIAIITEV 
SCEDVVGDQY SASASVDLLG EILPGPARAK FQRGVTNYPT IGDAVEMITS EDLRVVYAPT
GSDQINVGTL QQDPSVIAYV DIEEMLSKHF AVLGSTGVGK STGVSLLLNE ILKSRPALRV
FLLDVHNEYG RCFGDRALVL NPRNLKLPFW LFNFDEIVDV LFAGRPGVPE ELDVLAEVIP
IAKGLYVQYT NADRIGLKRM DPKSVGYTAD TPVPYRLVDL ISLIDERMGK LENRSSRIIY
HKLISRIETV RNDPRYAFMF DNANVGGDTM AEVISHLFRL PANGRPMTIM QLAGFPAEVV
DSVVSVLCRM AFDFGLWSDG VSPLLFVCEE AHRYAAADRS IGFGPTRKAV SRIAKEGRKY
GVYLGLVSQR PAELDATILS QCNTLFAMRL ANDRDQSLLR SAVSDAAANL LSFVPSLGTR
EVLAFGEGVA LPTRLRFKEV PVQQLPRSEA AISTVPSATA GHDMHFVSAV LERWRGATSH
RDIPNDPGVV ERPLARTMDA PMLQPSLGLD PDRFSLLKKP LR