Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_0907 |
Symbol | |
ID | 5207853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 1122166 |
End bp | 1123416 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640594521 |
Product | sterol 3-beta-glucosyltransferase |
Protein accession | YP_001275266 |
Protein GI | 148655061 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.499452 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00108162 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGCCGGA CGATCACGCT CCTGGTGAGC GGGACGCTGG GCGATGTCCG ACCGCTGGTT GCGCTCGGCG TCGGGTTGCG TAACGCCGGT TATGTCGTTC GAGTGGCGAC TCACGCGCAC TATGCGCCAC TCGCTCAGGC GCACGGTCTG CTGTGGAGGT GCGTCGAAGG CAATCCCAGC GATCTGCTCC GCTCCGACGA TGCGGCGCTG ACTCTCGATC GGGGGGCGCT GCGCGGCGCT GCTGCGACGC TGCGGTACAT TTGCCGGGCA CAGGCAGTGT ATGCGCGCAT GATCGATTCG GCAACCGAAG CGTGCCGCGA AAGTGATGCG TTGATCGTGT CGCTGGCAAG TTGCTGGGGG CAACTGATCG CCACCGCACT CGAACTGCCG TGCATCTGGG CGCCGCTGCA ACCGATCACG TCCACTGCAC GCTTTTCGTC GCCACTGTTG CCCATACACC ATCGTCTGGC GCGCCTGAGT TATTCCATCG TCGAACTGAC CACGTGGCTC CCGTGGCGCA CTGTGCTGCG CCGATGGCAA TTGCGCGCGC CCGGTCCGCG CCACGCGCCG CTCGACCCCT TCGCGCAGGC GCGGCAGTCG CGCGCACCGT TCATCTACGG CTTCAGTCCC AACGTTGTGC CGACGCCCGA TGACTGGTCA CCGCACCATA CCGTTGCCGG CTACTGGTTT CTCGACGATC CCAACGAACG CCTGTCGTCT GAAATTGCCG ACTTCCTGAC GAATGGCGAT CCGCCAGTTG CTATCGGTTT TGGCAGCATG AGCGGGCGGC GACCGCATGA CGACGCTGTT CTGGCGATAA CGGCGCTGAC CCTGGCACAA CGGCGTGGCA TTCTGATTGG CGCACCAGAA GCAGTGCGCC TGGTAACCGG TCGCCGCGAC ATCCTGGTTG TGCCGTATGT GCCGCACCAT CTGCTCTTCC CGCACGTCGC CGTCGCCGTC CACCACGGCG GCGCTGGCGC GACCGCCGCC AGTTTGCGCG CCGGTGTCCC AACCGTAACG ATACCGGTCG GCATCGACCA GTTTTTCTGG GGGAGGCGTG TCGCCGCACT GGGAGCAGGA CCGCCACCGC TGCCACGTCG CCGCGCAACG CCAGACCGCC TGGCATCAGC GCTTGTCGCC GCAACAGACG ACGCGATCCG GGTGCGCGCC GCCGCGCTTG GGCGCCTGAT CCGCGCCGAA CAGGGCGTGA CGCGCGCCGT TGAAACGATC AGCGCCTGTC TGGGGTGGTA G
|
Protein sequence | MRRTITLLVS GTLGDVRPLV ALGVGLRNAG YVVRVATHAH YAPLAQAHGL LWRCVEGNPS DLLRSDDAAL TLDRGALRGA AATLRYICRA QAVYARMIDS ATEACRESDA LIVSLASCWG QLIATALELP CIWAPLQPIT STARFSSPLL PIHHRLARLS YSIVELTTWL PWRTVLRRWQ LRAPGPRHAP LDPFAQARQS RAPFIYGFSP NVVPTPDDWS PHHTVAGYWF LDDPNERLSS EIADFLTNGD PPVAIGFGSM SGRRPHDDAV LAITALTLAQ RRGILIGAPE AVRLVTGRRD ILVVPYVPHH LLFPHVAVAV HHGGAGATAA SLRAGVPTVT IPVGIDQFFW GRRVAALGAG PPPLPRRRAT PDRLASALVA ATDDAIRVRA AALGRLIRAE QGVTRAVETI SACLGW
|
| |