Gene RPD_4056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4056 
Symbol 
ID4024573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4508065 
End bp4509249 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content64% 
IMG OID637964259 
Productextracellular ligand-binding receptor 
Protein accessionYP_571176 
Protein GI91978517 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.423392 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.164952 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTCATC TTTCCATTGT TGCAGCCGCA GCCCTCACGC TGACCGCGAC GGTCGCCGCC 
CGAGCCGACG ACCTCAAGAT CGCGCTGATC TACGGCAAGA CGGGACCGCT CGAAGCCTAC
GCCAAGCAGA CCGAAACCGG CCTGATGATG GGCCTCGAAT ACGCGACCAA GGGCACGATG
ACGCTCGACG GCCGCAAGAT CAAGGTGATC ACCAAGGACG ATCAGAGCAA GCCCGACCTC
TCCAAGGCCG CGCTCGCCGA AGCCTATCAG GATGACGGCG CCGACATCGC GATCGGCACC
TCGTCGTCGG CGGCGGCGCT GGCGGACCTG CCGGTCGCCG AAGAGAACAA GAAAATCCTG
ATCGTCGAGC CCGCGGTCGC CGACCAGATC ACCGGCGAGA AGTGGAATCG CTACATCTTC
CGCACCGGCC GCAATTCCTC GCAGGACGCG ATCTCCAACG CGGTCGCAAT CGGCAAGCAA
GGCGTCACCA TCGCCACGCT GGCGCAGGAC TACGCGTTCG GCCGCGACGG CGTCGCCGCC
TTCAAGGAGG CGCTGACCAA GACCGGCGCG ACGCTCGCCG CCGAGGAATA TGTTCCGACC
ACCACCACCG ACTTCACCGC GGTCGGGCAG CGGTTGTTCG ACACGCTGAA AGACAAGCCC
GGCAAGAAGA TCATCTGGGT GGTCTGGGCC GGCGGCGGCG ATCCCTTGAC CAAGCTGCAG
GACATGGACC CGAAGCGCTA CGGCATCGAA CTGTCCACCG GCGGCAACAT CCTGCCGGCA
CTCGCCGCCT ACAAGCGACT GCCCGGCATG GAAGGCGCGA CCTATTACTA TTACGACATC
CCGAAGAACC CGATCAACGA CTGGCTGGTG ACCGAGCATC AGAAGCGCTT CAACGCGCCG
CCGGATTTCT TCACCGCCGG CGGTTTCTCC GCGGCGATGG CGGTGGTCAC CGCCGTGCAG
AAGGCGAAAT CGACCGACAC CGAGAAGCTG ATCGCGGCGA TGGAAGGCAT GGAGTTCGAC
ACGCCGAAGG GCAAGATGAT GTTCCGCAAG GAAGACCATC AGGCGCTGCA GAGCATGTAT
CACTTCAAGG TCAAGGCCGA TCCGAACCTC GCCTGGGCCG TGCTCGAGCC GGTGCGGGAG
CTGAAGATCG AGGACATGAC GATCCCGATC AAGAACAAGC GGTAA
 
Protein sequence
MRHLSIVAAA ALTLTATVAA RADDLKIALI YGKTGPLEAY AKQTETGLMM GLEYATKGTM 
TLDGRKIKVI TKDDQSKPDL SKAALAEAYQ DDGADIAIGT SSSAAALADL PVAEENKKIL
IVEPAVADQI TGEKWNRYIF RTGRNSSQDA ISNAVAIGKQ GVTIATLAQD YAFGRDGVAA
FKEALTKTGA TLAAEEYVPT TTTDFTAVGQ RLFDTLKDKP GKKIIWVVWA GGGDPLTKLQ
DMDPKRYGIE LSTGGNILPA LAAYKRLPGM EGATYYYYDI PKNPINDWLV TEHQKRFNAP
PDFFTAGGFS AAMAVVTAVQ KAKSTDTEKL IAAMEGMEFD TPKGKMMFRK EDHQALQSMY
HFKVKADPNL AWAVLEPVRE LKIEDMTIPI KNKR