Gene RPD_0796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0796 
Symbol 
ID4021270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp894262 
End bp895482 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content62% 
IMG OID637960986 
Productextracellular ligand-binding receptor 
Protein accessionYP_567935 
Protein GI91975276 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTGC GCTCCACTTC CGTCTTCACG TCCGCAGGCG CTTTCGCCGC GGTTTTGCTC 
GCCGCCACAT CGGTGGCTGC GGCGGAGAAG AAGTACGATC CCGGCGCCAG CGATACCGAA
ATCAAGATCG GCCAGACTGT TCCGCATTCC GGTCCGGGTT CGCTGTACGG CGTGCTCGGC
CGCGTCGGCG AAGCCTATTT CCAGATGCTG AACGAGAAGG GCGGCATCAA CGGGCGCAAG
GTGAAGTTCC TGACCCTGGA CGATTCCTAC AGCTCCCCGA AGGCGGTGGA GGCGACCCGC
CGGCTGGTCG AGCAGGAAGA GGTGCTGGCG CTGTATGGCT CGCTCGGCAC CGCGCCGCAG
ACCGCTGTGC ACAAATACCT CAACAGCAAG AAGGTCCCGC AGCTTCTGCT GAACACCGGT
GCGTCGAAGT GGAACGACCC GAAGAACTTC AAATGGACCA TGGCGGGCCT GCCGCTGTAC
CCCACCGAGG CCCGCATCCT CGCCAAACAC GTCGTGAGCG TGAAGCCCGA CGCCAAGATC
GCGATTCTCT ACCAGAACGA CGATTTCGGT CGTGACTTCC TCGGACCGTT CAAGAAGGTT
CTGGAAGAAG CCGGCGGCAG GGCCAAGGTG GTCGCCGAGG CCAGCTACGA CCTGACCGAG
CCGACCATCG ATTCGCAGAT GATCAACCTG TCGAAGTCCG GCGCCGACGT GTTCTACAAC
ATCACCACCG GCAAGGCGAC GTCGCAGTCG ATCCGCAAAG TGGCCGAGCT CGGCTGGAAG
CCGCTGCAGT TGTTGTCGGC GGGCTCGACC GGCCGTTCGA TTCTCAACGC CGCCGGCATC
GAGAATGCGA CCGGTATCGT CGCGATCCGC TACTCCAAGG AGGTCGGTGT GCCGCGATTT
GAGAACGATC CGGACGTCAA GGCGTTCGAG GAGTTCCGCC AGAAGTATCT GCCGAACGTC
GACAAGGACA ACACCATCGC CTACGCCGGC TACGGCCAGG TGGTGACGAT GGCCGAGATC
CTGCGCCGCT GTGGCGACAA CCTCACCCGC GAGAACGTGC TGAAGCAGGC GACCTCGCTG
AAGGGCTTCC ACTCGCCGTA TTTCCTCGAC GGCATCGAAT ATAGCTACAC GTCGGACGAC
TACACGCCGA TGAAGACCCT CTACATCTCG ACCTTCAACG GCAAGGACTG GGACATCTCC
GACAAGCCGG TCACCGAATA A
 
Protein sequence
MNLRSTSVFT SAGAFAAVLL AATSVAAAEK KYDPGASDTE IKIGQTVPHS GPGSLYGVLG 
RVGEAYFQML NEKGGINGRK VKFLTLDDSY SSPKAVEATR RLVEQEEVLA LYGSLGTAPQ
TAVHKYLNSK KVPQLLLNTG ASKWNDPKNF KWTMAGLPLY PTEARILAKH VVSVKPDAKI
AILYQNDDFG RDFLGPFKKV LEEAGGRAKV VAEASYDLTE PTIDSQMINL SKSGADVFYN
ITTGKATSQS IRKVAELGWK PLQLLSAGST GRSILNAAGI ENATGIVAIR YSKEVGVPRF
ENDPDVKAFE EFRQKYLPNV DKDNTIAYAG YGQVVTMAEI LRRCGDNLTR ENVLKQATSL
KGFHSPYFLD GIEYSYTSDD YTPMKTLYIS TFNGKDWDIS DKPVTE