Gene Rsph17025_3352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3352 
Symbol 
ID5085843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009429 
Strand
Start bp230757 
End bp231767 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content69% 
IMG OID640484921 
Producthypothetical protein 
Protein accessionYP_001169538 
Protein GI146279380 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.913341 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.18018 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTCGG ATACCGAAAC CCGCCCCCTG CTGAGCCGCC TTCTCGGGCG CGACGCGGCG 
GTGGCGCAGC TCATCGTCAT CACGCTGCTG ATCTTCGTCG CGATGACCGC GCTGAACCCC
GACAAGTTCC TGCGCTACTA CAATTTCGAG TCGCTCACCT ACATCGCGCC GGAACTCGGC
ATCCTCTCCA TCGCCATGAT GATCGCGATG CTGACCGGGG GGATCGACCT TTCCATCGTG
GGGGTGGCGA ACCTCGCGGC CATCGTGGCG GGGGTCTATT TCCGCCTCCC GGCCGTGAGC
GAGGCCGCCG CGGCGGGCGG TGCGGGGCTC GTGCTGCACA CCTCGGCCGC CGTGCTGATC
GCGCTGACGG TCGGGCTCGC CGCAGGGGGG CTGAACGGGG TGCTGATCGC GCGCCTCCGC
ATCATCCCGA TCCTCGCGAC GCTCGGCACG GGCCAGATCT TCGCGGGCCT CGCGCTCGTC
CTGACGGGGG GGCCGGCCAT CACGGGCTTC CCCGAGACCT GGGCCTGGAT CGGGACCGGC
AAGATCCTGG GCCTGGCCAC CCCGCTCTGG GTGCTGATCG TGGTGGCGGG GCTGGTGGCG
GTCCTGCTCG CGCGGACCAC GCTGGGGGTG AACCTGATGC TGATGGGCAC CAACCCGCGC
GCCGCCGTCT TTGCCGGCAT CCGGTCGGGG CGGATGATCC TCTACAGCTA CATGCTGACG
GGGATGCTGG CCGCCGTGGC GGGGGTGCTC CTGTCGGGGC GGACCAACTC GGCCAAGGCG
GATTTCGGCG CCTCCTACCT GCTGCAGGCG GTGCTGATCG CGGTCCTGGG CGGCACCAAT
CCCGCCGGCG GCAAGGGCCG GGTGCTGGGC GTGCTGCTCG CGCTCGTGGC GCTCATGCTG
CTCTCGTCGG GCCTTCAGAT CATGCGGGTG TCGAACTTCC TGATCGACGC CGTCTGGGGC
GCCTTCCTCG TCATCGTCAT TGCCATCAAC TATCTGAGGT CACGGAAATG A
 
Protein sequence
MASDTETRPL LSRLLGRDAA VAQLIVITLL IFVAMTALNP DKFLRYYNFE SLTYIAPELG 
ILSIAMMIAM LTGGIDLSIV GVANLAAIVA GVYFRLPAVS EAAAAGGAGL VLHTSAAVLI
ALTVGLAAGG LNGVLIARLR IIPILATLGT GQIFAGLALV LTGGPAITGF PETWAWIGTG
KILGLATPLW VLIVVAGLVA VLLARTTLGV NLMLMGTNPR AAVFAGIRSG RMILYSYMLT
GMLAAVAGVL LSGRTNSAKA DFGASYLLQA VLIAVLGGTN PAGGKGRVLG VLLALVALML
LSSGLQIMRV SNFLIDAVWG AFLVIVIAIN YLRSRK