Gene RPD_1898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1898 
Symbol 
ID4022380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2131309 
End bp2132544 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content61% 
IMG OID637962091 
Productextracellular ligand-binding receptor 
Protein accessionYP_569034 
Protein GI91976375 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.193262 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCAC TCTCCCGATC AATCGCGACC CTGGCAACCG CTGCCCTTCT GTCCGCCGCT 
GGCGGCCAAG CGATAGCGCA GAAAAAATAC GGCCCCGGCG CCAGCGACAC CGAAGTCAAG
ATCGGCAACA TCGTGCCCTA TAGCGGCCCG GCTTCGGCCT ATGGCAGCGT CGGCCGGGCA
CAAGAAGCCT ATTTCAAGAT GATCAACGAC AAGGGCGGCA TCAACGGCCG CAAGATCGTC
TACATCTCCT ACGACGACGC CTATTCGCCG CCGAAGTCGG TCGAGCAGAC CCGCAAGCTG
GTCGAGAGCG ACGAAGTGCT GTTCATGTTC AGCCCGCTCG GCACGCCGTC CAACACGGCG
ATCCAGAAAT ATCTCAACGT CAAGAAGGTG CCGCATTTGT TCCTGGCGTC GGGCGCCACC
AAATGGAACG ACCCGAAGCA CTTCCCGTGG ACGATGGGCT GGCTGCCGAG CTACCAGAGC
GAAGGCCGGA TCTACGCCAA ATATCTGCTG AAGGAAAAGC CGGGCGCGAA GATCGCTGTG
CTGTATCAGG GCGACGATTT CGGCAAGGAC TATCTCAAGG GCCTGAGGGA TGGTCTCGGC
GACAAGGCGT CCTCGATCGT GGTCGAAGAC AGCTACGAAC TGACCGAGCC GACCGTCGAT
TCCCACATCG TCAAGATCAA GGCAGCGGCG CCCGACGTGC TGGTGATCTT CGCCACGCCG
AAATTCGCCG CGCAGACCAT CAAGAAGGTC GCTGAACTTG CCTGGAAGCC GATGATGATC
GTGCCGAACG TCTCGGCCTC GACCGGCAGC GTGATGAAAC CCGCCGGCTT CGAGAATGCC
CAGGGCATCG TCTCCGCCTC CTACGCCAAG GACGCCACCG ACAAGCAGTG GGAAAACGAC
CCCGGCATGA AGGAATACTA CGACTTCCTG GCGAAGCACG CGCCGCAGGC CAGCCGCGCC
GATTCGTCGT TCACCACCGG CTACAACATC GCCGAAACCG TCGCGATCCT GATCAAGCAG
TGCGGCGACG ATCTCACCCG CGAGAACGTG ATGAAACAGG CCGCCAACCT GAAGGACATT
CAGCTCGGCG GGCTGCTGCC GGGCATCAAG CTCAACACCA GCGCAACCGA TTTCTCACCG
ATCGAACAGC TGCAACTGAT GCGGTTCGAG GGCGAGAACT GGAAGCTGTT CGGCGACGTG
ATCGAAGGCG AAGTCGCCGC ACCGACCGGC GGCTAG
 
Protein sequence
MSALSRSIAT LATAALLSAA GGQAIAQKKY GPGASDTEVK IGNIVPYSGP ASAYGSVGRA 
QEAYFKMIND KGGINGRKIV YISYDDAYSP PKSVEQTRKL VESDEVLFMF SPLGTPSNTA
IQKYLNVKKV PHLFLASGAT KWNDPKHFPW TMGWLPSYQS EGRIYAKYLL KEKPGAKIAV
LYQGDDFGKD YLKGLRDGLG DKASSIVVED SYELTEPTVD SHIVKIKAAA PDVLVIFATP
KFAAQTIKKV AELAWKPMMI VPNVSASTGS VMKPAGFENA QGIVSASYAK DATDKQWEND
PGMKEYYDFL AKHAPQASRA DSSFTTGYNI AETVAILIKQ CGDDLTRENV MKQAANLKDI
QLGGLLPGIK LNTSATDFSP IEQLQLMRFE GENWKLFGDV IEGEVAAPTG G