Gene Rsph17029_2359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2359 
Symbol 
ID4897098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2495499 
End bp2497436 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content65% 
IMG OID640112955 
Productextracellular solute-binding protein 
Protein accessionYP_001044233 
Protein GI126463119 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0844875 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.182874 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGAAG TCACGGCGCG CACGGCGCAG GGCAGGGTCG CAGTCTCGAG ACTTCCCGAC 
GTGCGGTCAT GGCTTCTGGG GGGGCTCGGC CTGCTGGCCG CCGCCGCCGC GGTGCTGCCC
GCCCACGCGC AGGACGCGCC GAAGATCATC AAGGCGCACG GCATCTCGAC CTTCGGCGAT
CTGAAATATC CGGCCGACTT CACCCATCTC GATTACGTCA ATCCCGACGC ACCCAAGGGC
GGCGAGATCT CGGAATGGAC CTTCGGCGGC TTCGATTCGA TGAACCCCTA TTCGGTGAAG
GGCCGGGCCG CGGCCCTCTC GTCGATCATG TATGAATCGA TCCTCGCGGG CACGGCCGAC
GAGATCGGCG CGGCCTACTG CCTGCTCTGC GAGACGCTCG AATATCCCGA GGACCGCAGC
TGGGTGATCT TCAACCTGCG TCCCGAGGCG AAATTCTCGG ACGGCACCCC CGTCACCGCA
GAGGACGTGG TCTTTTCCTA CGAGACCTTC GTGGCCAAGG GTCTCACCGA TTTCCGCACC
ATCTTCGCCC AGCAGGTCGA GGGGGCCGAG GCGCTCGACA CGCATCGGGT GAAGTTCACC
TTCAAGAAGG GCATCCCCAC CCGCGATCTG CCGCAGGACG TGGGCGGGCT GCCGGTCCTG
TCCAAGGCGC AGTATGAGCG TGAGGGGCTC GACCTCGAGG AGGGAAGCCT GAAGCCCTTC
CTCGGCTCGG GCGCCTATGT GCTCGACGAG AGCCGGATGA AGGTGGGCCA GACGGTCGTC
TACCGCCGCA ATCCCGACTA CTGGGGCAAG GACCTGCCGC TCATGCGCGG CACCGGAAAT
TTCGACGCGA TCCGCATCGA ATATTACGCC GACTACAATG CGGCCTTCGA GGGCTTCAAG
GGCGGCAGCT ACACCTTCCG CAACGAGGCC TCCTCGATCC TCTGGGCCAC GGGCTACGAC
TTCCCGGCCG TGCAGACCGG CCATGTGGTG AAGGTCGAGC TGCCCTCGGG CGCCAAGGCC
ACGGGGCAGG GCTGGATGCT GAACCTCCGG CGCGAGAAGT TCCAGGACCC GAAGGTGCGC
GAGGCGCTGA ACCTCATGTT CAACTTCGAA TGGTCGAACC AGACGCTGTT CTACGGCCTC
TATACCCGCG TCGATTCCTT CTGGGAAAAC AGCTACCTCG AGGCGGAGGG CGCGCCCTCC
GAGGCCGAGG CGGCGCTTCT GAAGCCGCTC GTCGATGAGG GCCTGCTTCC GGCCTCGATC
CTCACCGAGC CCCCGGTCAG CCCGCCCGTC TCTGGCGAAC GGCAGCTCGA CCGCAGGAAC
CTGCGGGCGG CCAGCAAGCT CTTGGACGAG GCGGGCTGGA CCGTGGGCTC GGACGGGATG
CGCCGCAACG CCAAGGGCGA GGTGCTGCGC GTCGAATTCC TCAACGACAG CCAGACCTTC
GACCGGGTTA TCAGCCCCTT CGTCGAGAAC CTGCGCGCGC TGGGCGTGGA TGCGCTGATG
ACGCGCGTGG ACAATGCCCA GATGGAAAGC CGCACCCGGC CGCCGAGCTA CGATTTCGAC
ATCACCACCG GCAATGCGCG CACCAACTAC ATCTCGGGCG CCGAGTTGAA GCAGTATTAC
GGGTCGGAGA CCGCCGACAT CTCGGCCTTC AACATCATGG GCCTGAAGGA CAAGGCGGTG
GACCGGATGA TCGAGGTGGT TCTGGCCGCC AAGACCTCCG AGGAGCTCGA GGTGGCGACG
AAGGCGCTCG ACCGGGTGCT GCGGCTGCAG CGGTTCTGGG TGCCGCAATG GTACAAGGCC
AGCAACACCG TCGCCTATTA CGACATGTTC GAGCATCCCG AGACCCTGCC GCCCTATGCG
CTGGGCGAGC TGGACTTCTG GTGGTTCAAC CCCGACAAGG CCCAGGCGCT GCGTGACGCG
GGCGCCTTGA GACAGTAA
 
Protein sequence
MGEVTARTAQ GRVAVSRLPD VRSWLLGGLG LLAAAAAVLP AHAQDAPKII KAHGISTFGD 
LKYPADFTHL DYVNPDAPKG GEISEWTFGG FDSMNPYSVK GRAAALSSIM YESILAGTAD
EIGAAYCLLC ETLEYPEDRS WVIFNLRPEA KFSDGTPVTA EDVVFSYETF VAKGLTDFRT
IFAQQVEGAE ALDTHRVKFT FKKGIPTRDL PQDVGGLPVL SKAQYEREGL DLEEGSLKPF
LGSGAYVLDE SRMKVGQTVV YRRNPDYWGK DLPLMRGTGN FDAIRIEYYA DYNAAFEGFK
GGSYTFRNEA SSILWATGYD FPAVQTGHVV KVELPSGAKA TGQGWMLNLR REKFQDPKVR
EALNLMFNFE WSNQTLFYGL YTRVDSFWEN SYLEAEGAPS EAEAALLKPL VDEGLLPASI
LTEPPVSPPV SGERQLDRRN LRAASKLLDE AGWTVGSDGM RRNAKGEVLR VEFLNDSQTF
DRVISPFVEN LRALGVDALM TRVDNAQMES RTRPPSYDFD ITTGNARTNY ISGAELKQYY
GSETADISAF NIMGLKDKAV DRMIEVVLAA KTSEELEVAT KALDRVLRLQ RFWVPQWYKA
SNTVAYYDMF EHPETLPPYA LGELDFWWFN PDKAQALRDA GALRQ