Gene RPD_2108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2108 
Symbol 
ID4022590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2359074 
End bp2360099 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content62% 
IMG OID637962301 
Productextracellular solute-binding protein 
Protein accessionYP_569244 
Protein GI91976585 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAACAA TAATCGCCAG ACTCGCCGCG ATGCTGTCGG CGCTGTTGCT CACCACCACG 
CTCGCCGTTG CGCAGAGCAA GGTGACGATC GCGATCGGCG GCGGGGCATG TCTTTGCTAT
CTGCCGACGG TGCTGGCCAA GCAACTCGGC GAATACGACA AGGCCGGGCT CAGCGTCGAA
CTCGTCGATC TCAAGGGTGG TTCGGATGCG CTGAAGGCCG TGCTCGGCGG CAGCGCCGAC
GTGGTGTCGG GCTATTTCGA TCACACGGTG AATCTCGCCG CCAAGAAGCA GGAGATGCAG
TCGTTCGTGG TCTATGATCG CTATCCGGGG CTGGTGCTGG CGGTTTCGCC TGGGCACACG
GCGGAGATCA AGTCGATCAA GGACCTCGCC GGCAAAAAGG TCGGCGTCAG CGCGCCGGGC
TCATCGACCG ATTTCTTTCT CAAGTATCTT TTGAAGAAGA ACGGCGTTGA TCCGAACAAC
GTGTCGGTGA TCGGCGTCGG CCTCGGCGCC ACCGCGGTGG CGGCGATGCA GCAGGGCCAG
ATCGACGCCG CGGTGATGCT CGATCCGGCG GTGACGATTC TGCAGAGCGC CCACGCTGAT
TTGCGTATCC TCAGCGATAC GCGGACCGAG CACGACACCC GCGAGGTGTT CGGCGGTGAC
TATCCCGGCG GTGCGCTGTA CGCGACGGTG GCCTGGATCA AGGCGCATCC GAAGGAGGCG
CAGGGACTGA CCAACGCCAT CCTGAATACG CTGGGCTGGA TTCACACGCA TTCGGCGGAC
GAGATCGCCG ACAGGATGCC GCCCAACATC GTCGGCAAGG ACAGGGCGCA ATATGTCGCC
GCGTTGAAAA ACACGATTCC GATGTATTCG ACCACCGGGT TGATGGACCC GAAGGGCGCC
GATGCGGTGC TCGCGGTGTT CAGCGTCGGC TCGCCCGAGG TCGCGAAAGC CAATATCGAC
GTGACCAAGA CCTACACCAA CGCTTTCGTC GAGCAAGCGG CGAAGACGTC GGGTGCGGCG
AAGTAA
 
Protein sequence
MKTIIARLAA MLSALLLTTT LAVAQSKVTI AIGGGACLCY LPTVLAKQLG EYDKAGLSVE 
LVDLKGGSDA LKAVLGGSAD VVSGYFDHTV NLAAKKQEMQ SFVVYDRYPG LVLAVSPGHT
AEIKSIKDLA GKKVGVSAPG SSTDFFLKYL LKKNGVDPNN VSVIGVGLGA TAVAAMQQGQ
IDAAVMLDPA VTILQSAHAD LRILSDTRTE HDTREVFGGD YPGGALYATV AWIKAHPKEA
QGLTNAILNT LGWIHTHSAD EIADRMPPNI VGKDRAQYVA ALKNTIPMYS TTGLMDPKGA
DAVLAVFSVG SPEVAKANID VTKTYTNAFV EQAAKTSGAA K