Gene RoseRS_0907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0907 
Symbol 
ID5207853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp1122166 
End bp1123416 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content66% 
IMG OID640594521 
Productsterol 3-beta-glucosyltransferase 
Protein accessionYP_001275266 
Protein GI148655061 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.499452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00108162 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCCGGA CGATCACGCT CCTGGTGAGC GGGACGCTGG GCGATGTCCG ACCGCTGGTT 
GCGCTCGGCG TCGGGTTGCG TAACGCCGGT TATGTCGTTC GAGTGGCGAC TCACGCGCAC
TATGCGCCAC TCGCTCAGGC GCACGGTCTG CTGTGGAGGT GCGTCGAAGG CAATCCCAGC
GATCTGCTCC GCTCCGACGA TGCGGCGCTG ACTCTCGATC GGGGGGCGCT GCGCGGCGCT
GCTGCGACGC TGCGGTACAT TTGCCGGGCA CAGGCAGTGT ATGCGCGCAT GATCGATTCG
GCAACCGAAG CGTGCCGCGA AAGTGATGCG TTGATCGTGT CGCTGGCAAG TTGCTGGGGG
CAACTGATCG CCACCGCACT CGAACTGCCG TGCATCTGGG CGCCGCTGCA ACCGATCACG
TCCACTGCAC GCTTTTCGTC GCCACTGTTG CCCATACACC ATCGTCTGGC GCGCCTGAGT
TATTCCATCG TCGAACTGAC CACGTGGCTC CCGTGGCGCA CTGTGCTGCG CCGATGGCAA
TTGCGCGCGC CCGGTCCGCG CCACGCGCCG CTCGACCCCT TCGCGCAGGC GCGGCAGTCG
CGCGCACCGT TCATCTACGG CTTCAGTCCC AACGTTGTGC CGACGCCCGA TGACTGGTCA
CCGCACCATA CCGTTGCCGG CTACTGGTTT CTCGACGATC CCAACGAACG CCTGTCGTCT
GAAATTGCCG ACTTCCTGAC GAATGGCGAT CCGCCAGTTG CTATCGGTTT TGGCAGCATG
AGCGGGCGGC GACCGCATGA CGACGCTGTT CTGGCGATAA CGGCGCTGAC CCTGGCACAA
CGGCGTGGCA TTCTGATTGG CGCACCAGAA GCAGTGCGCC TGGTAACCGG TCGCCGCGAC
ATCCTGGTTG TGCCGTATGT GCCGCACCAT CTGCTCTTCC CGCACGTCGC CGTCGCCGTC
CACCACGGCG GCGCTGGCGC GACCGCCGCC AGTTTGCGCG CCGGTGTCCC AACCGTAACG
ATACCGGTCG GCATCGACCA GTTTTTCTGG GGGAGGCGTG TCGCCGCACT GGGAGCAGGA
CCGCCACCGC TGCCACGTCG CCGCGCAACG CCAGACCGCC TGGCATCAGC GCTTGTCGCC
GCAACAGACG ACGCGATCCG GGTGCGCGCC GCCGCGCTTG GGCGCCTGAT CCGCGCCGAA
CAGGGCGTGA CGCGCGCCGT TGAAACGATC AGCGCCTGTC TGGGGTGGTA G
 
Protein sequence
MRRTITLLVS GTLGDVRPLV ALGVGLRNAG YVVRVATHAH YAPLAQAHGL LWRCVEGNPS 
DLLRSDDAAL TLDRGALRGA AATLRYICRA QAVYARMIDS ATEACRESDA LIVSLASCWG
QLIATALELP CIWAPLQPIT STARFSSPLL PIHHRLARLS YSIVELTTWL PWRTVLRRWQ
LRAPGPRHAP LDPFAQARQS RAPFIYGFSP NVVPTPDDWS PHHTVAGYWF LDDPNERLSS
EIADFLTNGD PPVAIGFGSM SGRRPHDDAV LAITALTLAQ RRGILIGAPE AVRLVTGRRD
ILVVPYVPHH LLFPHVAVAV HHGGAGATAA SLRAGVPTVT IPVGIDQFFW GRRVAALGAG
PPPLPRRRAT PDRLASALVA ATDDAIRVRA AALGRLIRAE QGVTRAVETI SACLGW