Gene RPD_1889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1889 
Symbol 
ID4022371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2119225 
End bp2120382 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content64% 
IMG OID637962082 
Productextracellular ligand-binding receptor 
Protein accessionYP_569025 
Protein GI91976366 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.808445 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.559234 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGT TCAAGCTGTC CGTCGCGGCG TTCGCCGTGG CGATCGCACT GCCGGCGCTG 
TCCGGCGCCG CGCTCGCAGA AACCAATGAA ATCACGGTGG GCATCAGCGT CACCACCACG
GGTCCTGCCG CCGCGCTCGG CATTCCGGAG CGCAATGCGC TGGAATTCGT GGTGAAGGAA
ATCAGCGGTC ATCCGATCAA GATCATCGTG CTCGACGATG GCGGCGACCC GACCGCCGCG
ACCACCAATG CGCGACGTTT CGTCACGGAA TCGAAGGCCG ACGTGATCAT GGGCTCGTCG
GTCACGCCGC CGTCGGTGGC GATCTCGAAC GTCGCCAACG AGGCGCAGAT TCCGCATATC
GCGCTCGCGC CGCTGCCGAT CACGCCGGAA CGCGCCAAGT GGTCGGTGGT GATGCCGCAG
CCGATCCCGA TCATGGGCAA GGTGCTCTAT GAGCACATGA AGAAGAACAA CGTGAAGACC
GTTGGCTACA TCGGCTATTC GGATAGCTAC GGCGACCTTT GGTTCAATGA TCTCAAGAAG
CAGGGCGAGG CGATGGGCCT CAAGATCGTC GGCGAGGAGC GCTTCGCGCG GCCCGACACG
TCGGTCGCGG GCCAGGCGCT GAAGCTCGTT GCCGCCAATC CCGACGCGAT CCTGGTCGGC
GCGTCCGGCA CCGCCGCAGC GCTGCCGCAG ACCACGCTGC GTGAGCGAGG TTATAACGGG
CTGATCTATC AGACCCACGG CGCCGCTTCG ATGGATTTCA TCCGGATCGC CGGCAAGTCC
GCGGAGGGCG TGCTGATGGC GTCGGGTCCG GTGATGGATC CGGAAGGACA GGACGACACG
GCGCTGACGA AGAAGCCCGG CATGGCGCTG GTGAAGGTCT ATGAGGAAAA GTACGGCCCG
AGCAGCCGCA GCCAGTTCGC GGGCCACTCC TACGACGCCT TCAAGGTGCT GGAGCGCGTG
GTCCCGGTTG CGCTGAAGAA GGCCAAGCCC GGCACGCAGG AATTCCGCGA GGCGCTCCGT
GAAGCGTTTC TCACTGAGAA GGACATCGCG GCGAGCCAGG GCGTGTACAA TTTCACCGAA
ACCGATCGCT ACGGCCTCGA CGACCGTTCG CGCATCCTGC TGACGGTGAA GAACGGCAAG
TATGTGATCG TGAAGTAA
 
Protein sequence
MTKFKLSVAA FAVAIALPAL SGAALAETNE ITVGISVTTT GPAAALGIPE RNALEFVVKE 
ISGHPIKIIV LDDGGDPTAA TTNARRFVTE SKADVIMGSS VTPPSVAISN VANEAQIPHI
ALAPLPITPE RAKWSVVMPQ PIPIMGKVLY EHMKKNNVKT VGYIGYSDSY GDLWFNDLKK
QGEAMGLKIV GEERFARPDT SVAGQALKLV AANPDAILVG ASGTAAALPQ TTLRERGYNG
LIYQTHGAAS MDFIRIAGKS AEGVLMASGP VMDPEGQDDT ALTKKPGMAL VKVYEEKYGP
SSRSQFAGHS YDAFKVLERV VPVALKKAKP GTQEFREALR EAFLTEKDIA ASQGVYNFTE
TDRYGLDDRS RILLTVKNGK YVIVK