Gene RPD_4226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4226 
Symbol 
ID4024747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4692917 
End bp4693870 
Gene Length954 bp 
Protein Length317 aa 
Translation table11 
GC content62% 
IMG OID637964432 
ProductTRAP transporter solute receptor TAXI family protein 
Protein accessionYP_571344 
Protein GI91978685 
COG category[R] General function prediction only 
COG ID[COG2358] TRAP-type uncharacterized transport system, periplasmic component 
TIGRFAM ID[TIGR02122] TRAP transporter solute receptor, TAXI family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0963703 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.823727 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCCA GATTTTTGGG CATTGCCGTG GCGTCGGCCG CGGTCCTGTC GGTGTCGGCG 
CCGTTGGCCC ACGCCCAGCA ATTCGTCAAC GTGCTGACCG GCGGAACCTC GGGCGTGTAT
TATCCGCTCG GCGTCGCGAT CGCGAAAATT TACGGCGAGA AGATCCCCAA TGTGAAGTCT
CAGGTCCAGG CCACCAAGGC GTCGGTCGAG AATTTGAATC TGCTGCAGCA GGGCCGCGGC
GAGATCGCCT TCACCCTCGG CGACTCGCTC AAGGCCGCAT GGGACGGTGA CGCCGAAGCC
GGCTTCAAGG CCAAGCTCGA CAAGCTGCGG GTGATCGGCG CGATCTATCC GAACTACATC
CAGATCGTTT CGACCGCGGA GTCCGGGATC AAGACGCTTG CCGATCTCAA GGGCAAGAGC
CTGTCGGTCG GCGCGCCGAA GTCCGGCACC GAGCTGAATT CGCGGGCGAT CCTGAAGGCC
GCCGGAATGG AGTACAAGGA TCTCGGCAAG GTCGAGTATC TGCCGTTTGC CGAATCCGTC
GACCTGATGA AAAATCGCCA ACTCGCCGCC ACGCTGCAGT CGGCCGGCCT CGGCGTCGCT
TCGCTGAAGG ATCTCAGCAA TTCGTCCGAA GTCAATGTGG TGTCGGTGCC GAAGGAGATC
GTCGACAAGA TCGGCCCGCC GTTCATCGCC GAGACGATCC CGGCCGGCAC CTATAAGGGC
CAAGACAAGG ACGTCCCGAC CGCAGCGGTG GTGAACTATC TCGTCACCAG CAGCGCGGTG
TCCGACGACC TCGCTTATCA GATGACCAAG CTGGTCTATG ACGCGCTCCC GGAACTCGCC
AGCGCTCACT CCGCCGGCAA GGGCATCAAG CTCGAGACCG CCGCCACCGA CAGCCCGGTT
CCGTTGCACC CCGGTGCCAT CAAGTACTTC AAGGAAAAGG GCGTGCTGAA GTAG
 
Protein sequence
MKARFLGIAV ASAAVLSVSA PLAHAQQFVN VLTGGTSGVY YPLGVAIAKI YGEKIPNVKS 
QVQATKASVE NLNLLQQGRG EIAFTLGDSL KAAWDGDAEA GFKAKLDKLR VIGAIYPNYI
QIVSTAESGI KTLADLKGKS LSVGAPKSGT ELNSRAILKA AGMEYKDLGK VEYLPFAESV
DLMKNRQLAA TLQSAGLGVA SLKDLSNSSE VNVVSVPKEI VDKIGPPFIA ETIPAGTYKG
QDKDVPTAAV VNYLVTSSAV SDDLAYQMTK LVYDALPELA SAHSAGKGIK LETAATDSPV
PLHPGAIKYF KEKGVLK