Gene Rsph17025_3355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3355 
Symbol 
ID5085846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp234339 
End bp235385 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content65% 
IMG OID640484924 
Producthypothetical protein 
Protein accessionYP_001169541 
Protein GI146279383 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0675521 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATTC TGCTGACGTC GGCGGCGCTG GCGCTGACGG CTCTGGCCGC TCCGGCCGTG 
GCGCAAGACA TCGTGGATGT GTCGAAGGTC AACCAGGACC TGATCGCCAC CGCCGACGGC
AAGGAATACA GCATTGCCAC CGTGGTGAAG GTGGACGGCA TCGCCTGGTT CGACCGGATG
CGCGACGGCA TCGACCAGTT CAAGGGCGAC ACCGGCCATG ACGTCTGGAT GGTCGGCCCG
AGCCAGGCGG ACGCCGCGGC GCAGGTGCAG CTGATCGAGA ACCTGATCGC GCAGGGGGTC
GATGCGATCT GCGTGGTGCC CTTCTCGGTC GAGGCGGTGG AGCCGGTGCT GAAGAAGGCG
CGTGACCGCG GCATCGTGGT CATCACCCAC GAGGCCTCGA ACATCCAGAA CACCGACTTC
GACCTCGAGG CGTTCGACAA CCTCGCCTAT GGCGCGAACC TGATGAAGGA ACTCGCCAAA
TCCATGGGCG AGAAGGGTCA GTATGTCGCC ACCGTCGGCT CGCTCACCTC GAAGAGCCAG
ATGGAATGGA TCGACGGCGC GGTGGCCTAC CAGAAGGAGA ACTACCCCGA GATGAGCCTC
GTGGGTGATC GTCTGGAAAC CGCCGACGAT GCGGCCATCG ACTACACCAA GCTCAAGGAA
GCGATGACCA CCTACCCCGA CATCACCGGG ATCCTCGGCG CGCCGATGCC GACCTCGGCC
GGGGCGGGCC GCCTGATCGC CGAGAGCGGG CTGAAGGACA AGGTCTTCTT TGCCGGCACC
GGCCTGCCGT CGGTGGCGGG CGAATACCTC CAGAACGGCG ACATCCAGTA CATCCAGTTC
TGGGATCCGG CGGTTGCGGG CTATGCGATG AACATGCTGG CCGTGGCGGT GCTCGAGGGC
CGGAAGGACG AGATCAAGCC GGGCCTGAAC CTCGGCCTCA CCGGCTATGA GGATCTCACC
GCGCCGGACG AGGCCAACCC GCATCTGCTC TATGGCGCGG GCTGGGTCGG CGTGACGAAG
GACAACATGG CCGACTACGA CTTCTGA
 
Protein sequence
MKILLTSAAL ALTALAAPAV AQDIVDVSKV NQDLIATADG KEYSIATVVK VDGIAWFDRM 
RDGIDQFKGD TGHDVWMVGP SQADAAAQVQ LIENLIAQGV DAICVVPFSV EAVEPVLKKA
RDRGIVVITH EASNIQNTDF DLEAFDNLAY GANLMKELAK SMGEKGQYVA TVGSLTSKSQ
MEWIDGAVAY QKENYPEMSL VGDRLETADD AAIDYTKLKE AMTTYPDITG ILGAPMPTSA
GAGRLIAESG LKDKVFFAGT GLPSVAGEYL QNGDIQYIQF WDPAVAGYAM NMLAVAVLEG
RKDEIKPGLN LGLTGYEDLT APDEANPHLL YGAGWVGVTK DNMADYDF