Gene Rsph17029_3587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3587 
Symbol 
ID4898155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp679707 
End bp680744 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content67% 
IMG OID640114196 
Productextracellular solute-binding protein 
Protein accessionYP_001045450 
Protein GI126464337 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID[TIGR01096] lysine-arginine-ornithine-binding periplasmic protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0949603 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAGA CACGCGCGAC CGGCCTATCC GTGCTTGCGG CCCTCCTGCT CGCCGGAGCG 
GGCACCGCCG CCTTCGCCCA GTCGGGCGAC ACGCTGGCCG CGGTGAAGGA GCGGGGCGAG
GTCCTCTGCG GCGTCCATCC CGCGCGCCAC GGCTTCGCGG CCCCCGACAG CCAGGGCAAA
TGGAGCGGCT TCGAGGTCGA TTTCTGCCAC GCCATCGCGG CGGCGGTGTT CGGCGATGCG
AACAAGGTTC GGTTCGTGGC GCTCTCCTCG CAGCAGCGCT TCCCGGCGAT CCAGTCGGGC
GAGGTCGATG TGCTGGCGCG CAACGTGACC GCCACGCTCA GCCGCGACAC GGCGCTCGGG
CTGAACTTTG CGCCGCCGAT CTTCTACACC GGCACGGGCT TCCTCGTGCG CGCGGCCGAC
GGGATCGAGA AGGTCGAGGA TCTGGACGGC GCGGCGATCT GCATGGCGCC CGGCTCCACC
ACCGAGCGCA ACGTGGCGCA GATCTTCGCT GCCCGCGGGC TCAGCTACAC GCCCGTCGTG
ATCGAGAACA ACAAGCAGCT GGTCGATGCC TATGTCACGG GGCGCTGCGA CGCGCTGACC
AAGGACAAGG CGGCGCTTCC GGGCGTGCGG GCCTTCGACA CCGAGGTTCC CGGAGACCAT
GTGCTGCTGC CCGGCATCTA TTCCAAGGAG CCGCTCGCCA TGGCCGTGCG TCAGGGCGAC
GACAAATGGT ACGATCTGGT GAAATGGGTG ACCTACGCCA CCTTCAACGC CGAGGAACTG
GGCGTGACCC AGGCCAATGT CGACGAGATG AAAGCCTCGG ACGATCCCGA CATCCAGACC
CTTCTGGGCG TGATCGGCGA CAACGGCACG AAGCTCGGGG TGCCCAACGA CTGGGCCTAT
GCGATCGTGA AGCAGGTCGG CAACTACGAA GACATCTACA TGCGCCATTT CGGGCCCGAC
ACCCCCGTGG CGCTCGACCG CGACCAGAAC CAGCTCTGGA CCGAGGGCGG GCTGCTCTAC
GGCTTCCCGA TGCACTGA
 
Protein sequence
MNQTRATGLS VLAALLLAGA GTAAFAQSGD TLAAVKERGE VLCGVHPARH GFAAPDSQGK 
WSGFEVDFCH AIAAAVFGDA NKVRFVALSS QQRFPAIQSG EVDVLARNVT ATLSRDTALG
LNFAPPIFYT GTGFLVRAAD GIEKVEDLDG AAICMAPGST TERNVAQIFA ARGLSYTPVV
IENNKQLVDA YVTGRCDALT KDKAALPGVR AFDTEVPGDH VLLPGIYSKE PLAMAVRQGD
DKWYDLVKWV TYATFNAEEL GVTQANVDEM KASDDPDIQT LLGVIGDNGT KLGVPNDWAY
AIVKQVGNYE DIYMRHFGPD TPVALDRDQN QLWTEGGLLY GFPMH