Gene Rsph17025_0763 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_0763 
Symbol 
ID5083527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp775121 
End bp776116 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content64% 
IMG OID640482321 
ProductABC-type nitrate/sulfonate/bicarbonate transport systems periplasmic components-like protein 
Protein accessionYP_001166974 
Protein GI146276815 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4521] ABC-type taurine transport system, periplasmic component 
TIGRFAM ID[TIGR01729] taurine ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0103291 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCC GAAGCGTCCT CGCAGGCGCA GTGTCCGCCC TTGGCCTTCT CGCGGCCCAG 
GCCCAGGCCG AAGCGCCCGG CGAGATCACC GTCGCCTACT TCCTCGAATG GCCGCTGCCC
TTCGAGGCCG CCAAGAGCCT TGGCACCTTC GAGAAGGAAC TGGGCGTCAA GATCAACTGG
CGCGCCTTCG ACACGGGCAC CGCCATGTCG GCGGCCTTCG CCTCGGGCGA CGTGCAATAC
GCCATCAGCC AAGGCGTTCC GCCCTTCGTC ACCGCGGTCT CGGCCGGTCA GGATCTGATC
GTGATCGATG CGGCCGTAGG CTACGCCGAG AACGACAACT GCGTGGTGCG ATCGGATCTG
GAAATCACGC GAGACAACGC GAAGGATCTG GAAGGCAGGA AGGTCGGCCT GCCCATCGGC
ACCGCGCCGC ATTTCGGTTT CCTGAAGGTG GCCGAGCATC TGGGCATCGA CGTCTCGAAG
GTGCAGGTGG TGGACATCGC GCCGGCCGAG GCGGCGGCCG CCTTCGCGGC AGGCAATTTC
GACGCGGTCT GCGGCTGGGG CGGCCCGCTC AGGCGCATGA AGGAACATGG CAACGTGCTG
CTGACCGGCG CTGAAAAGGA AGCCATCGGG ATCAAGATCT TCGATGTCGT CTCGACCACC
TCGGCCTTTG CCAATGAACA CCCGGATCTG ACCGCGAAGC TCCTGAAGGT CATCAACGAC
GCCAATGACG CATGGAAGTC CGACCCCGAC AGCCTGCTGC CGCTGATCGC GAAGGACAGC
GGTCTGGACG AACAGGCGGC GAAGGACAAC CTTGCCACCT TCACCCTGCT GCCGATCGAC
GAGAAGCTGG GGCCGGACTG GCTGGGCGGC GGCGTCCAGA CCTATCTGAA GGAGGTGGCC
GATTTCTTCG TCTCGATCGG CAACATCCCT CAGGCCCTGC CCAGCTATGA CGCTGTGGTG
AAATCCGAAT TCCTTGCCGC AGCGGCCAAG GACTGA
 
Protein sequence
MKIRSVLAGA VSALGLLAAQ AQAEAPGEIT VAYFLEWPLP FEAAKSLGTF EKELGVKINW 
RAFDTGTAMS AAFASGDVQY AISQGVPPFV TAVSAGQDLI VIDAAVGYAE NDNCVVRSDL
EITRDNAKDL EGRKVGLPIG TAPHFGFLKV AEHLGIDVSK VQVVDIAPAE AAAAFAAGNF
DAVCGWGGPL RRMKEHGNVL LTGAEKEAIG IKIFDVVSTT SAFANEHPDL TAKLLKVIND
ANDAWKSDPD SLLPLIAKDS GLDEQAAKDN LATFTLLPID EKLGPDWLGG GVQTYLKEVA
DFFVSIGNIP QALPSYDAVV KSEFLAAAAK D