Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_0937 |
Symbol | |
ID | 5207883 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 1155799 |
End bp | 1157559 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640594551 |
Product | extracellular solute-binding protein |
Protein accession | YP_001275296 |
Protein GI | 148655091 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATAAGT CATCGTCGTC AATCCATACC CCTCGTTCTC TCAGACAGTT GTTCCAGTTG CTCAGCGCCA GCGTCTGGGT CGCTGCGACG ATACTTGTGC TGCTTGCAGC TTGCGGCGCG CCAGACCAGC CGGTTGCGAC CGGTGAAGCC TCGCCTACTT CTGCGCCCAC ATCAGCGCCC CCGACTCCTA CCGCGCTGCC GCGTGGCGGA AATCTCACCA TCCGTCTGTC CGACGATAGT GGCACATTGC AACCCTGGCA TCCGCGCACA CGCGGCGAAG AGCAGATCAT CGGGTTGATC TACAGTGGTC TGATGCGCCT GGACGCAACC CTGGCGCCGC AACCGGACCT GGCGACGGGT TGGGTAGCCT CAGCCGATGG ACGGGTCATC ACGTTCACCC TGCGCAGCGA TGCCGTCTGG CACGATGGGC GTCCGGTCAC GGTTGATGAT GTTGTCTTTA CGCTCGATGC ATTGCGCGCA CTGCCGCCGT CTACAGCGTT GCTCGCCGGG TTGCGGCGAA TTGTCGAAGT GACGGCGCCG GAGAGCGATA CGATTGTCAT CCGCCTCGAT GAACGCTACG CACCCATCTT CAGCCTGCTT ACAACGCCGG TGCTGCCGCG TCACCTGCTG ATCGGCAGAA ACCTGGCAGA ATTCAACGCC TGGGATGTGC CGTTTGGCAG CGGTCCCTTC CGTTTTGAGC AGCGCGAGCC GGGCGTGGCA ATCACACTGG CGGCGAATCA GGCATTCTAC CGGGGTGCGC CGCTCCTTGA TCGGGTGGTG TTCGTCATTG CGCCCGATGC ACAGGTGGCA GCGTCGGCGT TACAGGATGA ACGTTTGCTG CTGGCTGAAC TTCCCTGGAG CGTCGGCAGC GTCATAACCG AAACATCGCC GATGCTGCAA TGGGGCGCAT ATGCTGAGAA CGGGTATTAT TTTCTCGCCT TCAACCTGCG TTCCGACCGA ATCTTCAGCG ATCCGAGGCT GCGCGAGGCG CTGGCAGCGA CGATCGATCT GCCGCGCATT GTGCAGGAGG TGACCGATGG GCAGGGGATG CTCATCGGCA GCAGCGCCGC GCCGGGTTCC TGGGCAGATC TGACGCCGCC GCCAGCCGGA ACGGTCGATC TGGATCGCGC GCGGGCGCTG CTGGACGAAG CCGGATGGCG TCTGCCGCCG GAAGGCGCCA TTCGCCAGCG CGACGGCGTG CCGCTGACGG TGCAACTCTT CGTGCGCGCC GACGATCCGC GTCGGGTGCG CGCCGCCGAG TTGATTGCCA GCGCTGCCGA ACAGATCGGG ATGGATATTG TGGTGCAGCC TGCCGATTTT GCCACCGTCA TTCGCTCGAA ATATGCGCCG CCCTACGACT TCGATCTGCT GATCGGCAGT TGGGTCAATG GCGTCGCCGA TCCGGATTTC GCCGACTACG CTTTCTACGA CCCGGACGAT TTTGCGCTCT TTCATTCGAG TCAGATCAAC CAGGGGCTGG CGGATACACG TCCGACGCTC AACTTTGTCG GGTTCAGCGA TCCGATCTAC GATAATCAGG CGGGTGCGGC ACGGCAGTTA TACGATCTGA GCGAACGGGC GCAGGCGATC CAGCGTGCCC AGGAACGGGT GGCGCTTCTA CGCCCATATC TGTTTCTCTG GGCGGATCGA ATAGCGGTAG TGTGCAATCC GCGGGTAAAA ACGCCGGACG GACCGGTCAC CCTGATGACG CCCAACTATA TGTGGAATAT CGAGCGCTGG TATGTCGAGG TGGAAGGTTG A
|
Protein sequence | MNKSSSSIHT PRSLRQLFQL LSASVWVAAT ILVLLAACGA PDQPVATGEA SPTSAPTSAP PTPTALPRGG NLTIRLSDDS GTLQPWHPRT RGEEQIIGLI YSGLMRLDAT LAPQPDLATG WVASADGRVI TFTLRSDAVW HDGRPVTVDD VVFTLDALRA LPPSTALLAG LRRIVEVTAP ESDTIVIRLD ERYAPIFSLL TTPVLPRHLL IGRNLAEFNA WDVPFGSGPF RFEQREPGVA ITLAANQAFY RGAPLLDRVV FVIAPDAQVA ASALQDERLL LAELPWSVGS VITETSPMLQ WGAYAENGYY FLAFNLRSDR IFSDPRLREA LAATIDLPRI VQEVTDGQGM LIGSSAAPGS WADLTPPPAG TVDLDRARAL LDEAGWRLPP EGAIRQRDGV PLTVQLFVRA DDPRRVRAAE LIASAAEQIG MDIVVQPADF ATVIRSKYAP PYDFDLLIGS WVNGVADPDF ADYAFYDPDD FALFHSSQIN QGLADTRPTL NFVGFSDPIY DNQAGAARQL YDLSERAQAI QRAQERVALL RPYLFLWADR IAVVCNPRVK TPDGPVTLMT PNYMWNIERW YVEVEG
|
| |