Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_4079 |
Symbol | |
ID | 5086252 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009430 |
Strand | + |
Start bp | 131710 |
End bp | 133167 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640485642 |
Product | sulfate ABC transporter, periplasmic sulfate-binding protein |
Protein accession | YP_001170236 |
Protein GI | 146280079 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2342] Predicted extracellular endo alpha-1,4 polygalactosaminidase or related polysaccharide hydrolase |
TIGRFAM ID | [TIGR01370] possible cysteinyl-tRNA synthetase, Methanococcus type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.237943 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.0279439 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCGAC TGTCTTCTGT CAAGAGCTGG CTCTACCATC TGGGCGATAT AGACGCCCGG CGCGCGCAGG TGATCGGCGC CTCGAACGCC GATCTCGTCG TGACCGAATG GGCCAGCTAC CGGGACGGCG AGGCCCCCTA CGGGCGCGCC CTGCTCGACC GGATGCGCGG CGGCGATCCC GACCGGCTGA TCGTCAGCTA TCTCTCGATC GGCGAGGCCG AGGATTACCG CTACTACTGG AAGGACAGCT GGGCGAAGAC GCCGCCGCAC TGGCTCGGGG CGGAAAATCC CGAATGGGCC GGCAACATGA AGGTCCGCTA CTGGGAGGCG GGCTGGCAAA AAATCGTGCT CGGCTATCTC GACCGGATCA TCGACCGGGG CTTCGACGGC GTCTATCTCG ACATCATCGA CGCCTTCGAG TTCTGGGAGG AGACGGCGCC CCGCTCGGGG ATCGACTACC GTCAGGAAAT GGCGGATTTC GTCCTGCTGC TGCGCACCCA TGCGCTCGAA CGGCTGGCAA AGGTCGACCC GGACCGGGAT TTCGTGATCC TCGGGCAGAA CGGGCTTGAT CTGATCGGCA ATGCCACCTA CCGGGCCGCG GTCGATGGCG TGGCCGCCGA GGATGTGCGC TTCCATTATC CTAACGGCCG GCCGAAGAGC TTCACGCCCC AGGACGATGG CGAGGCCGCT TGGGCGCTGC AGCAGCTTCG GCGCGCCGAG CGGGCGGGGA TCGAGACCTT CGTGGTGGAA TATGTTCCGC CCGCCGCGCG GGCCGCGGCC GCGGGGCCGC TGGCGGGTCT GGCGAGCGAA ATGACGACGA TGGGCAGCCG GCTGTTCGTG GCGGCCAACC GGGATCTGGA CGGGCTGCCG GCCCAGCCCC GGGCGGCCTT TGGCGGGCTT TTTCCGACCT TCGGACCGGA GGCACCCGAT CCTAGGCCGC TTTCGGGCAC CTCCCGGCCC GACCGGCTGA CCGGCGGTGC GGGCCCCGAG AGGATCTCGG GGGGGGCGGG CCACGACAGG CTGGACGGCC GCGGCGGCCG GGACGTGCTG CAGGGCGGCA CGGGCGACGA CCTCCTGCGC GGAGGACCGG GGGACGACGG GCTGTTCGGC GGTGCGGGCC GCGATCGGCT GGAGGGTGGG GCGGGCCACG ACAGGCTTTA TGGCGGGGCC GGTAACGACG TGCTTCTCGG CGGTGCTGGA GATGATGTGC TGCGTGGTCA TGCGGGGGGC GACCGGCTTC ACGGCGGTGC GGGGGCCGAT GTCTTCGTCT ACCGCAGGGG GGACGGCGGG GATCTGATCC TCGACTTCAA CCGCGCCCAT GACCTGATCG ACCTGCCGCT GCATCTGGAT CACCGGATGC GCGCCGTCGC GGGGGACACG CTGATCGACT TCGGCGGCGG GGACCGGCTG ACGGTGCGCG GGATCCTGCC CGACGCCCTC GACGATTTCC TGATCTGA
|
Protein sequence | MSRLSSVKSW LYHLGDIDAR RAQVIGASNA DLVVTEWASY RDGEAPYGRA LLDRMRGGDP DRLIVSYLSI GEAEDYRYYW KDSWAKTPPH WLGAENPEWA GNMKVRYWEA GWQKIVLGYL DRIIDRGFDG VYLDIIDAFE FWEETAPRSG IDYRQEMADF VLLLRTHALE RLAKVDPDRD FVILGQNGLD LIGNATYRAA VDGVAAEDVR FHYPNGRPKS FTPQDDGEAA WALQQLRRAE RAGIETFVVE YVPPAARAAA AGPLAGLASE MTTMGSRLFV AANRDLDGLP AQPRAAFGGL FPTFGPEAPD PRPLSGTSRP DRLTGGAGPE RISGGAGHDR LDGRGGRDVL QGGTGDDLLR GGPGDDGLFG GAGRDRLEGG AGHDRLYGGA GNDVLLGGAG DDVLRGHAGG DRLHGGAGAD VFVYRRGDGG DLILDFNRAH DLIDLPLHLD HRMRAVAGDT LIDFGGGDRL TVRGILPDAL DDFLI
|
| |