Gene Rsph17025_0526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_0526 
Symbol 
ID5082794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp526397 
End bp528328 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content65% 
IMG OID640482081 
Productextracellular solute-binding protein 
Protein accessionYP_001166737 
Protein GI146276578 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0241638 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.268494 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGGAG TCGCGGCGCG CACGGCGCAG GGCAGGGTCG CAACCTCGAG ACTTCCCGAC 
GTGCAGTCAT GGCTTCTGGG GGGGCTTGGC CTTCTGCTCG TCACGGCCGC GGCGCTTCCG
GTCCGCGCGC AGGAGGCTGA GACGATCATC CGCTCGCATG GCATCTCGAC CTTCGGCGAA
CTGAAATACC CGGCCGATTT CACGCATCTC GCCTATGTGA ACCCCGACGC GCCCAAGGGG
GGCGAGATCT CGGAATGGAC CTTCGGCGGG TTCGATTCGA TGAACCCCTA CTCGGTGAAG
GGCCGGGCCG CCGCGCTCTC GTCGATCATG TACGAATCGA TCCTCACCGG CACTGCCGAC
GAGATCGGCG CCTCCTATTG CCTGCTTTGC GAGACGCTGG AATATCCCGA GGATCGCAGC
TGGGTGATCT TCAACCTGCG CCCCGAGGCG AAGTTCTCGG ACGGAACGCC GGTGACGGCC
GAGGATGTGG TCTTTTCCTA CGAGACCTTC GTCACCAAGG GGCTGACCGA TTTCCGCACC
GTCTTTGCCC AGCAGGTCGA AAGCGCCGAG GCGCTCGACA GTCACCGGGT CAGGTTCACC
TTCAAGCCGG GCATCCCCAC CCGCGACCTG CCGCAGGACG TGGGCGGCCT GCCGGTCCTG
TCCAAGGCCC AGTACGAGCG CGAGGGGCTG GATCTGGAGG AGGGCAGCCT CAAGCCCTTC
CTCGGCTCGG GGCCCTATGT GCTCGACCGG ATGAACGTGG GCCAGACGGT CGTCTATCGC
CGCAATCCCG ACTACTGGGG CAATGATCTG CCGATCAACC GCGGGCGGGG CAATTTCGAC
ACGATCCGCA TTGAATATTA CGCCGATTAC AACGCGGCCT TCGAGGGCTT CAAGGGCGGC
AGCTACACCT TCCGCAACGA GGCCTCCTCG ATCCTCTGGG CCACGGGCTA CGACTTTCCT
TCGGTCGACG CAGGCCATGT GACGAAGGTC GAACTGCCCT CGGGCGCCAA GGCCACCGGG
CAGGGCTGGA TGATGAACCT GCGCCGCGAG AAGTTCCAGG ACCCCCGCGT GCGCGAGGCG
CTGGGCCTGA TGTTCAACTT CGAATGGTCC AACGCGACGC TGTTCTACGG CCTCTATACC
CGTGTCGATT CCTTCTGGGA AAACAGCTAC CTAGAGGCCG AGGGTTCGCC GTCCGAGGCC
GAGGTGGCGC TGATGAAGCC GCTGGTGGAC GAGGGGCTGC TGCCCGAGTC GATCCTGACG
GACCCTCCCG TCAGCCCCGC CGGATCGGGC GACCGGCAAC TCGACCGTGG CAACCTGCGC
GCGGCGAGCC GGCTTCTGGA CGAGGCCGGC TGGGCGGTGG GCGCCGACGG CCTGCGCCGC
AACGCCAGCG GCGAGGTGCT GCGGATCGAG TTCCTGAACG ACAGCCAGAC CTTCGACCGC
GTCATCAACC CGTTCATCGA GAACCTGCGG GCGCTCGGGA TCGACGCGCT GATGACGCGG
GTGGACAACG CCCAGATGGA AAGCCGCACC CGGCCTCCGA GCTACGATTT CGACATCACC
ACCGGGAACG CGCGCACGAA CTACATCTCC GGCTCGGAAC TCAAGCAGTA TTACGGGTCG
GAAACCGCCG ATGTCTCCAG CTTCAACATG ATGGGGCTCA AGAGCCCGGC GGTGGACCGG
ATGATCGAGA TGGTGCTGGC GGCCCATACC TCGGACGAGC TTGAGGTGGC GACCAAGGCG
CTGGATCGTG TGCTGCGGCT TCAGCGGTTC TGGGTGCCGC AATGGTACAA GGCCAGCCAC
ACCGTCGCCT ATTACGACAT GTACGAGCAT CCCGAGGAAC TTCCGCCCTA TGCGCTGGGA
GAGCTGGACT TCTGGTGGTT CAACCCCGAC AAGGCCGAGG CCTTGCGCGC GGCGGGCGCG
CTGAGACGCT AA
 
Protein sequence
MGGVAARTAQ GRVATSRLPD VQSWLLGGLG LLLVTAAALP VRAQEAETII RSHGISTFGE 
LKYPADFTHL AYVNPDAPKG GEISEWTFGG FDSMNPYSVK GRAAALSSIM YESILTGTAD
EIGASYCLLC ETLEYPEDRS WVIFNLRPEA KFSDGTPVTA EDVVFSYETF VTKGLTDFRT
VFAQQVESAE ALDSHRVRFT FKPGIPTRDL PQDVGGLPVL SKAQYEREGL DLEEGSLKPF
LGSGPYVLDR MNVGQTVVYR RNPDYWGNDL PINRGRGNFD TIRIEYYADY NAAFEGFKGG
SYTFRNEASS ILWATGYDFP SVDAGHVTKV ELPSGAKATG QGWMMNLRRE KFQDPRVREA
LGLMFNFEWS NATLFYGLYT RVDSFWENSY LEAEGSPSEA EVALMKPLVD EGLLPESILT
DPPVSPAGSG DRQLDRGNLR AASRLLDEAG WAVGADGLRR NASGEVLRIE FLNDSQTFDR
VINPFIENLR ALGIDALMTR VDNAQMESRT RPPSYDFDIT TGNARTNYIS GSELKQYYGS
ETADVSSFNM MGLKSPAVDR MIEMVLAAHT SDELEVATKA LDRVLRLQRF WVPQWYKASH
TVAYYDMYEH PEELPPYALG ELDFWWFNPD KAEALRAAGA LRR