Gene Rsph17029_3889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3889 
Symbol 
ID4898333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1020468 
End bp1021475 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content67% 
IMG OID640114493 
Productnitrate/sulfonate/bicarbonate ABC transporter periplasmic ligand-binding protein 
Protein accessionYP_001045740 
Protein GI126464627 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.475299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.125658 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCC ACAAGACACT GACGCCCTGG GCGGCGGCGC TTCTGCTCGC CTCGACGGCC 
TGGCCCGCGC TGGCCGAGGT TTCGACGGTG CGCCTCGCCA AGCAGTTCGG CATCGGCTAC
CTGCCGCTCA CGCTGGTCGA GGAGCTGGAC CTGCTCGAGA AGCACGCGGC CGCCTCGGGC
CACGAGATCA CGACCGAATG GCTGCGCTTC ACCGGCGGCT CGGGCATGAA CGAGGCGCTG
CTGTCGGGCA ACCTCGATCT GGCGGCGGGC GGCACCGGGC CGCTCTTCAC CATCTGGGCC
CGCACGCGCG AGAACCTGAA GATCAAGGGC GTGGCGGCGC TGGCCTCGAT GCCGCTGCAT
CTGATGACCT CGAACCCCGA GGTGAAGACG CTGGCCGACT TCGGACAGGG CGACAAGATC
GCCCTGCCCG CCGTCAAGAC CTCGATCCAG GCCGTCACGC TGCAGATGGC CTCCAAGCAG
GCCTTCGGGG CGGACAAGGC CACCGCCATG GATGCCTTCA CCGTTTCGAT GGGCCATCCC
GACGCGCAGC TCGCGCTGAC CGGCGGGCAG TCCGAAGTGA CGGCGCATTT CGGCTCGCCG
CCGTTCCAGA ACCTCGAGGC CAAGGTCGAG GGCATCCACA AGGTGCTCGA CAGCTATGAC
GTGCTCGGCG GCTCGCACAC CTTCACCGTG GTCTGGGCGG CCGACAAGTT CATCTCGGAG
AACCCCGAGA TCACCAAGGC CTTCATGGCG GCGCTCGAGG AAAGCATGGA GCTGATCCGC
ACCGACCCCG AGAAGGCGGC CGAGATCTGG ATGGCGGCCG AGCGCAGCCC TCTGAGCCGG
GAAGAGGTCG TGGCGCTGAT CCAGGACGAG CAGACCGTCT GGACCACCAC GCCCGAGCGC
ACCCTGCCCT ATGTCGAGTT CCTGAGCGAG TCCGGCCTCA TCAAGACCTC GGCCGAGGAC
TGGAGCGAGA TCTTCTTCGA CACGATGTCG GGCAAGGAGG GAAGCTGA
 
Protein sequence
MKIHKTLTPW AAALLLASTA WPALAEVSTV RLAKQFGIGY LPLTLVEELD LLEKHAAASG 
HEITTEWLRF TGGSGMNEAL LSGNLDLAAG GTGPLFTIWA RTRENLKIKG VAALASMPLH
LMTSNPEVKT LADFGQGDKI ALPAVKTSIQ AVTLQMASKQ AFGADKATAM DAFTVSMGHP
DAQLALTGGQ SEVTAHFGSP PFQNLEAKVE GIHKVLDSYD VLGGSHTFTV VWAADKFISE
NPEITKAFMA ALEESMELIR TDPEKAAEIW MAAERSPLSR EEVVALIQDE QTVWTTTPER
TLPYVEFLSE SGLIKTSAED WSEIFFDTMS GKEGS