Gene RPD_0121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0121 
Symbol 
ID4020577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp138971 
End bp140191 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content61% 
IMG OID637960298 
Productextracellular ligand-binding receptor 
Protein accessionYP_567262 
Protein GI91974603 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.455844 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGCC TGTATCGGGC CGCAGCCGCG GCTGTAGCTG CAATCGCTCT CACAGCCGCG 
CCCGCCGCGG CACAGAAGAA ATACGACACC GGCGCCACCG ACACCGAAAT CAAGATCGGC
CAGACCGTGC CGTTCTCCGG TCCCGCTTCG GCCTATGCCG GTATCGGCAA AACCCAGGCC
GCCTATATGC GGATGATCAA CGATTCCGGC GGAATCAATG GCCGCAAGAT CAACCTCATT
CAATATGACG ACGCCTATTC GCCGCCCAAG GCGGTCGAGC AGGTGCGCAA GCTGGTCGAA
GGCGACGAGG TTCTGCTGAC CTTCCAGATC ATCGGGACGC CGTCGAACGC CGCGGTGCAG
AAATATCTCA ACGGCAAGAA GGTGCCGCAG CTGCTCGCCT CGACCGGCGC GACCCGCTTC
ACCGATCCGA AGAGTTTCCC CTGGACGATG GGCTACAACC CGAACTACCA GACCGAAGCC
CGGATCTATG CGCGCTACAT CCTGAAGAAC CACCCCAATG CCAAGATCGG CATCATGTAC
CAGAACGACG ACCTGGGGCG TGATTACCTC GCCGGGCTGA AGGCGGGACT CGGCGACAAG
GCCGCCGCGA TGATCGTGGC GGAGACCTCC TACGAACTGT CCGACCCGAC CGTCGACTCG
CAGATCGTCA AGCTCAAGGC CGCCGGCGTC GACCTCTTCT TCAACGCCTC GACGCCGAAA
TTCGCCGCGC AGGCGATCAA GAAGGTCGCC GACCTCGACT GGCGCCCGAT CCACATTCTC
GACATCAATG CGAGCCCGGT GTCCTCGACG CTGAAACCGG CGGGCCTGGA CATCTCCAAG
GGCATCATCA GCGTCAATTA CGGCAAGGAC CCGGCCGACC CGCAATGGAA GGACGATCCC
GGCGTCGCGA AATATCTCGC CTTCATGGAC AAGTACTATC CGGAGGGTGA CAAGATGTCG
ACGATCAACA CCTACGGCTA CTCGACCGCG CAATTGCTGG TCACCATCCT GAAGCAATGC
GGCGACGACC TCACCCGCGA CAACGTCATG AAACAGGCGG CGAATCTCAA GAACGTGACC
GGCGACCTGT CGCTGCCGGG CATGGTGATC AACACCTCGC CGACCGATTA TCGCATCAAC
AAGCAGCTTC AGATGATGAA GTTCAACGGC GAGCGCTGGG AGCTGTTCGG CCAGATCATC
GAAGACGACC AAGCGATGTA A
 
Protein sequence
MKSLYRAAAA AVAAIALTAA PAAAQKKYDT GATDTEIKIG QTVPFSGPAS AYAGIGKTQA 
AYMRMINDSG GINGRKINLI QYDDAYSPPK AVEQVRKLVE GDEVLLTFQI IGTPSNAAVQ
KYLNGKKVPQ LLASTGATRF TDPKSFPWTM GYNPNYQTEA RIYARYILKN HPNAKIGIMY
QNDDLGRDYL AGLKAGLGDK AAAMIVAETS YELSDPTVDS QIVKLKAAGV DLFFNASTPK
FAAQAIKKVA DLDWRPIHIL DINASPVSST LKPAGLDISK GIISVNYGKD PADPQWKDDP
GVAKYLAFMD KYYPEGDKMS TINTYGYSTA QLLVTILKQC GDDLTRDNVM KQAANLKNVT
GDLSLPGMVI NTSPTDYRIN KQLQMMKFNG ERWELFGQII EDDQAM