Gene RPD_4253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4253 
Symbol 
ID4024774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4721690 
End bp4723387 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content64% 
IMG OID637964459 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_571371 
Protein GI91978712 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.417802 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.174095 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTCGA GGCGGGAATT TCTGCAGGTG ACTGCGGCTG CGTCGGCGCT GACGCTCGCC 
GGCGGCCTCG GTCCGGTTGG ACGCGTTGCG GCACAGCAGC GGCTGACGCA GGCCGACATC
CTGAAATTCG ATCCGCTGGG CACGGTGACG CTGCTACACA TCACCGACCT CCACGCCCAA
CTGATGCCGC TGCATTTCCG CGAGCCGTCG GTCAATCTCG GAGTCGGCGA GGTCAAGGGC
AAGCCGCCGC ATCTCACCGA CGCGGAATTC CGCAACTACT TCCACATCGC TACCGGCTCT
CCGGACGCTC TCGCGCTGAC CGCCGATGAT TTCGTCGCGC TCGCCCGCAA CTATGGCCGG
ATGGGCGGCA TGGACCGGAT CGCGACGCTG GTCGGCGCGA TCCGCGCGGA GCGCGGCGAC
GACAAGGTGC TGCTGCTCGA CGGCGGCGAC GCATGGCAGG GAAGCTGGAC TTCGCTGCAG
ACCAAGGGCC AGGACATGAT CGACGTCCTG AGCGCGCTCA AGATCGACGC GATGACCGGC
CATTGGGAGT TCACCTACGG CGCCGATCGC GTCAAGCAGG TCGCCGAACA GGCGTCATTC
GCCTTTCTCG CGCAGAACGT CCGCGACAAC GAATGGCAGG AACCGGTGTT CGAGGCGCGC
AAGATGTATG AGCGCGGCGG CGTGAAGGTC GCCGTGATCG GACAGGCGCT GCCGCGCACC
GCGATCGCCA ATCCACGCTG GATGTTCCCG AAATGGGAGT TCGGCATCCG CGAAGAGGAC
ATGCAGAAGC AGGTCGACGA CGCGCGCGCC GAAGGCGCCG AGGTCGTGGT TCTGCTGTCG
CACAATGGCT TCGACGTCGA CCGCAAGCTC GCCGGGCGCG TGAAGGGCCT CGACGTCATC
CTCACCGGTC ACACCCACGA CGCGATGCCG GGCCTGGTCA AGGTCGGCGA CACCGTTCTG
GTGGCGTCGG GCTCGCACGG CAAATTCGTG TCGCGGCTCG ATATCGCGGT GAAGGACAAG
AAGGTCTCCG ATATCCGCTT CAAGCTGATG CCGGTGTTCG CCGACGCCAT CAAGCCGGAT
CCGGCGATGG CGCAACTGGT CGAGAAGCTG CGTGCGCCTT TTGCCAAGGA TCTCGCCCGC
GTCGTCGGCA AGACCGACTC GCTGCTGTAT CGCCGCGGCA ATTTCAACGG CACGTTCGAC
GATCTGATCT GCGAAGCGAT GTTGAAGCAG CGCGACACCG AGATCGCCCT GTCGCCCGGT
TTCCGCTGGG GCGGAACGCT GCTGCCGAAC GATGACATCA CCTGGGAAGC GATCACCAAC
GCCACTGCGA TCACCTATCC GAACTGCTAC CGCACCGAGA TGACCGGCGA GCAGCTCAAG
ATCGTGCTCG AGGACATTGC CGACAACATC TTCCATCCCG ATCCCTATTT CCAGGGCGGC
GGCGACATGG TGCGCACCGG CGGCATGGGC TATGCGATCG ACGTCGGCAA GGAGATCGGC
TCGCGGATCT CCAACATGAC GCATCTCAAG ACCGGCAAGC CGATCGAGGC GTCGAAGAAA
TACACGGTCT CCGGCTGGGC CAGCATCAAC GAAAACACCG AGGGCCCGCC GATCTGGGAG
GTGCTGTCCA AGCACGTCGC GCAGGCCGGT CCGGTGAAGA TCGATCCCAG CAGCGCGGTC
AAGGTTTCAG GAGCCTGA
 
Protein sequence
MISRREFLQV TAAASALTLA GGLGPVGRVA AQQRLTQADI LKFDPLGTVT LLHITDLHAQ 
LMPLHFREPS VNLGVGEVKG KPPHLTDAEF RNYFHIATGS PDALALTADD FVALARNYGR
MGGMDRIATL VGAIRAERGD DKVLLLDGGD AWQGSWTSLQ TKGQDMIDVL SALKIDAMTG
HWEFTYGADR VKQVAEQASF AFLAQNVRDN EWQEPVFEAR KMYERGGVKV AVIGQALPRT
AIANPRWMFP KWEFGIREED MQKQVDDARA EGAEVVVLLS HNGFDVDRKL AGRVKGLDVI
LTGHTHDAMP GLVKVGDTVL VASGSHGKFV SRLDIAVKDK KVSDIRFKLM PVFADAIKPD
PAMAQLVEKL RAPFAKDLAR VVGKTDSLLY RRGNFNGTFD DLICEAMLKQ RDTEIALSPG
FRWGGTLLPN DDITWEAITN ATAITYPNCY RTEMTGEQLK IVLEDIADNI FHPDPYFQGG
GDMVRTGGMG YAIDVGKEIG SRISNMTHLK TGKPIEASKK YTVSGWASIN ENTEGPPIWE
VLSKHVAQAG PVKIDPSSAV KVSGA