Gene Rsph17025_4079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4079 
Symbol 
ID5086252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009430 
Strand
Start bp131710 
End bp133167 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content69% 
IMG OID640485642 
Productsulfate ABC transporter, periplasmic sulfate-binding protein 
Protein accessionYP_001170236 
Protein GI146280079 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2342] Predicted extracellular endo alpha-1,4 polygalactosaminidase or related polysaccharide hydrolase 
TIGRFAM ID[TIGR01370] possible cysteinyl-tRNA synthetase, Methanococcus type 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.237943 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.0279439 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCGAC TGTCTTCTGT CAAGAGCTGG CTCTACCATC TGGGCGATAT AGACGCCCGG 
CGCGCGCAGG TGATCGGCGC CTCGAACGCC GATCTCGTCG TGACCGAATG GGCCAGCTAC
CGGGACGGCG AGGCCCCCTA CGGGCGCGCC CTGCTCGACC GGATGCGCGG CGGCGATCCC
GACCGGCTGA TCGTCAGCTA TCTCTCGATC GGCGAGGCCG AGGATTACCG CTACTACTGG
AAGGACAGCT GGGCGAAGAC GCCGCCGCAC TGGCTCGGGG CGGAAAATCC CGAATGGGCC
GGCAACATGA AGGTCCGCTA CTGGGAGGCG GGCTGGCAAA AAATCGTGCT CGGCTATCTC
GACCGGATCA TCGACCGGGG CTTCGACGGC GTCTATCTCG ACATCATCGA CGCCTTCGAG
TTCTGGGAGG AGACGGCGCC CCGCTCGGGG ATCGACTACC GTCAGGAAAT GGCGGATTTC
GTCCTGCTGC TGCGCACCCA TGCGCTCGAA CGGCTGGCAA AGGTCGACCC GGACCGGGAT
TTCGTGATCC TCGGGCAGAA CGGGCTTGAT CTGATCGGCA ATGCCACCTA CCGGGCCGCG
GTCGATGGCG TGGCCGCCGA GGATGTGCGC TTCCATTATC CTAACGGCCG GCCGAAGAGC
TTCACGCCCC AGGACGATGG CGAGGCCGCT TGGGCGCTGC AGCAGCTTCG GCGCGCCGAG
CGGGCGGGGA TCGAGACCTT CGTGGTGGAA TATGTTCCGC CCGCCGCGCG GGCCGCGGCC
GCGGGGCCGC TGGCGGGTCT GGCGAGCGAA ATGACGACGA TGGGCAGCCG GCTGTTCGTG
GCGGCCAACC GGGATCTGGA CGGGCTGCCG GCCCAGCCCC GGGCGGCCTT TGGCGGGCTT
TTTCCGACCT TCGGACCGGA GGCACCCGAT CCTAGGCCGC TTTCGGGCAC CTCCCGGCCC
GACCGGCTGA CCGGCGGTGC GGGCCCCGAG AGGATCTCGG GGGGGGCGGG CCACGACAGG
CTGGACGGCC GCGGCGGCCG GGACGTGCTG CAGGGCGGCA CGGGCGACGA CCTCCTGCGC
GGAGGACCGG GGGACGACGG GCTGTTCGGC GGTGCGGGCC GCGATCGGCT GGAGGGTGGG
GCGGGCCACG ACAGGCTTTA TGGCGGGGCC GGTAACGACG TGCTTCTCGG CGGTGCTGGA
GATGATGTGC TGCGTGGTCA TGCGGGGGGC GACCGGCTTC ACGGCGGTGC GGGGGCCGAT
GTCTTCGTCT ACCGCAGGGG GGACGGCGGG GATCTGATCC TCGACTTCAA CCGCGCCCAT
GACCTGATCG ACCTGCCGCT GCATCTGGAT CACCGGATGC GCGCCGTCGC GGGGGACACG
CTGATCGACT TCGGCGGCGG GGACCGGCTG ACGGTGCGCG GGATCCTGCC CGACGCCCTC
GACGATTTCC TGATCTGA
 
Protein sequence
MSRLSSVKSW LYHLGDIDAR RAQVIGASNA DLVVTEWASY RDGEAPYGRA LLDRMRGGDP 
DRLIVSYLSI GEAEDYRYYW KDSWAKTPPH WLGAENPEWA GNMKVRYWEA GWQKIVLGYL
DRIIDRGFDG VYLDIIDAFE FWEETAPRSG IDYRQEMADF VLLLRTHALE RLAKVDPDRD
FVILGQNGLD LIGNATYRAA VDGVAAEDVR FHYPNGRPKS FTPQDDGEAA WALQQLRRAE
RAGIETFVVE YVPPAARAAA AGPLAGLASE MTTMGSRLFV AANRDLDGLP AQPRAAFGGL
FPTFGPEAPD PRPLSGTSRP DRLTGGAGPE RISGGAGHDR LDGRGGRDVL QGGTGDDLLR
GGPGDDGLFG GAGRDRLEGG AGHDRLYGGA GNDVLLGGAG DDVLRGHAGG DRLHGGAGAD
VFVYRRGDGG DLILDFNRAH DLIDLPLHLD HRMRAVAGDT LIDFGGGDRL TVRGILPDAL
DDFLI