Gene Rsph17029_3465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3465 
Symbol 
ID4898119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp541083 
End bp542129 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content65% 
IMG OID640114062 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_001045330 
Protein GI126464217 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCA TGCTGACGTC AGCGGCGCTG GCGCTGTCGG TTCTGGCCGC GCCCGCCGTG 
GCTCAGGACA TCGTCGACGT GTCCAAGGTC AACAAGGAGC TGATCACGAC GAAGGACGGC
AAGGAATATT CCATCGCCAC CGTCGTGAAG GTGGACGGCA TCGCCTGGTT CGACCGGATG
CGCGACGGGA TCACCCAGTT CCAGAAGGAC ACCGGCCACG ACGTCTGGAT GGTGGGCCCG
AGCCAGGCCG ACGCGGCGGC GCAGGTGCAG CTGATCGAGA ACCTGATCGC GCAGGGCGTG
GATGCGATCT GCGTGGTGCC CTTCTCGGTC GAGGCGGTGG AGCCGGTGCT GAAGAAGGCG
CGCGACCGGG GCATCGTGGT CATCACCCAC GAGGCGTCGA ACATCCAGAA CACCGACTTC
GACCTCGAGG CCTTCGACAA CCTCGCCTAT GGCGCGAACC TGATGAAGGA ACTCGCCAAA
TCCATGGGCG AGAAGGGCAA ATACGTCGCC ACCGTCGGCT CGCTCACCTC GAAGAGCCAG
ATGGAATGGA TCGACGGCGC GGTGGCCTAC CAGAAGGAGC ATTACCCCGA GATGAGCCTC
GTGGGCGACC GTCTGGAGAC CGCCGACGAC GCGGCCATCG ACTATACCAA GCTCAAGGAA
GCGATGACCA CCTATCCCGA CATCACGGGG ATCCTCGGCG CGCCGATGCC GACCTCGGCC
GGGGCCGGTC GCCTGATCGC CGAGAGCGGG CTGAAGGACA AGGTCTTCTT CGCGGGCACG
GGCCTGCCGT CGGTCGCGGG CGAATATCTG CAGAACGGCG ACATCCAGTA TATCCAGTTC
TGGGATCCGG CGGTCGCGGG CTATGCGATG AACATGCTGG CCGTGGCGGC CCTCGAGGGC
AAGAAGGACG AGATCAAGCC GGGCCTGAAC CTCGGCCTCG CGGGCTACGA GGAGTTGACC
GCGCCGGACG CGGCCAACCC GCATCTGCTC TATGGCGCAG GCTGGGTCGG CGTGACCAAG
GACAACATGG CCGACTACGA CTTCTGA
 
Protein sequence
MKIMLTSAAL ALSVLAAPAV AQDIVDVSKV NKELITTKDG KEYSIATVVK VDGIAWFDRM 
RDGITQFQKD TGHDVWMVGP SQADAAAQVQ LIENLIAQGV DAICVVPFSV EAVEPVLKKA
RDRGIVVITH EASNIQNTDF DLEAFDNLAY GANLMKELAK SMGEKGKYVA TVGSLTSKSQ
MEWIDGAVAY QKEHYPEMSL VGDRLETADD AAIDYTKLKE AMTTYPDITG ILGAPMPTSA
GAGRLIAESG LKDKVFFAGT GLPSVAGEYL QNGDIQYIQF WDPAVAGYAM NMLAVAALEG
KKDEIKPGLN LGLAGYEELT APDAANPHLL YGAGWVGVTK DNMADYDF