Gene RPD_1520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1520 
Symbol 
ID4022000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1694986 
End bp1696131 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content64% 
IMG OID637961715 
Productextracellular ligand-binding receptor 
Protein accessionYP_568658 
Protein GI91975999 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.657552 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAGGT TTCTGACAAC ATGTCTGGCG GCCGCATTTG GGCTCGGCCT GGCCGTGCAG 
GCCAAGGCGG CCGAGCCGAT CAAAATCGGC TCGGTGCTTT CGGTCACCGG CCCGGCCGCC
TTCCTCGGCG ATCCGGAGCT GAAGACCCTG CAGCTCTATG TCGAGAAGAT CAATCAGGAG
GGCGGCCTGC TCGGGCGGCC GGTGCAGCTC ATCCATTACG ATGACGGCTC CGACGCCACG
AAGGCGAATA GTTTCGGCAA GCGCCTGATC GAGGACGACA AGGTCGACGT GCTGATCGGC
GGCACCACCA CCGGCTCGAC GATGTCGATG GCGCCGCTGG TCGATCGCGC CGGAATCCCG
TTTATTTCGC TGGCGGGCGG CGTGGTGATC GTCGAGCCGG TGAAGAAATG GATGTTCAAG
ACGCCGCATA CCGACCGCAT GGCGGCGGAG CGGGTGTTCG GCGACATGAA GAAGCGCAAC
CTGACCAAGG TCGCGCTGTT GTCGGAGACC AGCGGCTTCG GCCAATCCGG CAAGAAGGAG
AGCGAGGCGG CGGCGGCGCG GCTCGGCATC ACGCTGGTCG CCAACGAGAC CTACGGTCCG
AAAGATACCG ACATGAGCCC GCAACTCACC AATATCAGGA GCACGGCGGG GGTGCAGGCG
CTGTTTATTT TCGGCCTCGG TCAGGGACCG GCGATCGCCA ACAAGAACGC CAAGATGCTC
GGGCTGAGCC TTCCGATCTA CCATGCGCAT GGCGTGGCGT CGGAGGAGTT CATCAAGCTG
TCGGATGGCG CCGCCGAGGG CATCCGCCTG CCGGCCGCCG CTCTGCTGGT GGCGAGCAAG
TTGCCGGACA ACGATCCGCA GAAGCCGATC GCGCTCGGCT ACGCCAAGGC CTACACCGAC
CGCTACAAGG AAGAGGTCTC GACCTTCGGC GGCCATGCCT ATGACGCGCT GATGATCATG
GTGTCGGCGA TCAAGCGCGC CGGCGACACC GACAAGAACA AGGTGCGCGA CGCGATCGAG
CAGACCAAGG ACCATATCGG CGCCGACGGC AAGTTCAACA TGTCGCCGAC CGACCATATG
GGCCTCGACC TGTCGGCGTT CCGGATACTG GAAGTCCGGA ACGGCGACTG GGTGCTGGTC
GATTGA
 
Protein sequence
MNRFLTTCLA AAFGLGLAVQ AKAAEPIKIG SVLSVTGPAA FLGDPELKTL QLYVEKINQE 
GGLLGRPVQL IHYDDGSDAT KANSFGKRLI EDDKVDVLIG GTTTGSTMSM APLVDRAGIP
FISLAGGVVI VEPVKKWMFK TPHTDRMAAE RVFGDMKKRN LTKVALLSET SGFGQSGKKE
SEAAAARLGI TLVANETYGP KDTDMSPQLT NIRSTAGVQA LFIFGLGQGP AIANKNAKML
GLSLPIYHAH GVASEEFIKL SDGAAEGIRL PAAALLVASK LPDNDPQKPI ALGYAKAYTD
RYKEEVSTFG GHAYDALMIM VSAIKRAGDT DKNKVRDAIE QTKDHIGADG KFNMSPTDHM
GLDLSAFRIL EVRNGDWVLV D