Gene Rcas_3770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3770 
Symbol 
ID5541272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4942487 
End bp4943746 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content66% 
IMG OID640895880 
Productsterol 3-beta-glucosyltransferase 
Protein accessionYP_001433827 
Protein GI156743698 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.190341 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCCAA CCATCACGAT CCTGGCGAGC GGCACGCTGG GGGACGTGCG CCCACTGGCG 
GCGCTCGGCA AAGGTTTGCA CGATGCTGGC TTCGCTGTTG CGCTTGCAAC CCATCCTCAG
TTTGCGCCTC TGGTTCAGGC GCAGGGTCTG GCGTTTCGCA GCATCGGGGG CAATCCCAGT
GATCTGCTTC TTCATGATGA TGCGGCGCTG ACCTTCGATG GCGGAGTAGG GCGTGGCGTG
GTCGCAACAC TCCGTTATAT TCGGTCGGCG CAGGCGATCT ATGCTCGCAT GCTGGACGCG
GCAGCGACCG CATGTTACGG GAGCGCCCTG ATCATTGTGT CGCTGGCAAG TTGCTGGGGG
CAACTTATTG CGACGACGTT CGGCATACCC TGTGTCTGGG CGCCGTTGCA GCCCGTCACG
CTAACGATCC GCTTTCCATC GCCGCTGCTG CCGGTGACAT TAAGCCTGGG CGCGCGCGCC
CGCCGTCTGA GTTATACGGC TGTCGAACTG GCGACCTGGC TGCCATGGCG AACCGTATTC
CATCGCTGGC GGGCGCGCGC GCCTGGTCCG CGCCACATGT CGCTCGACCC CTTTGCTCTA
GCGTGCACAT CGAGTGCGCC CTTCGTCTAC GGGTTCAGCC CGCATGTCGT GCCACCGCCT
GACGACTGGC CGCCACATCA TATGGTGACC GGCTACTGGT TCCTCGACCA TCCGGCTGAA
CGCCTGGCGC CGGAGATTGA GTCGTTCCTT GCAGCTGGCG ATCCGCCGAT TGTCATCGGT
TTTGGCAGCA TGGGCGGTCG GCGACCGCGC GATGATGCGG CGCTGGCGCT GGAAGCGCTG
CGCCTGGCGC AGCGCCGCGG CATTCTCTTC GGTTCAGCCG ACGTCGCGCG CCTGGCAGCC
GGTCGCCGTG ATGTGCTCGT CGTGCCATAC GCGCCCCATC GCCTGCTCTT CCCACGTGTC
GCCGTCGCCG TTCACCATGG CGGCGCCGGA ACAACCGCCG CCAGTTTGCG CGCCGGTATC
CCGACGATGA CGGTTCCGGT CGGGATCGAT CAACCCTTCT GGGGGATGCG CGTCGCCGCA
ATTGGCGCGG GACCGCCGCC GCTGCCGCGG CGACGCGCAA CGCCGGATCG CCTGGCACCC
GCCATTATGG CGGCGACTGA TGACCTCATC CGGGTGCGTG CTGCGGCGAT AGGGCGGTTG
ATCGGCGCCG AGGAGGGCGT TGCGCGAGCG GTGGAGGTCG TTGCGCGCGT AATGCCATGA
 
Protein sequence
MRPTITILAS GTLGDVRPLA ALGKGLHDAG FAVALATHPQ FAPLVQAQGL AFRSIGGNPS 
DLLLHDDAAL TFDGGVGRGV VATLRYIRSA QAIYARMLDA AATACYGSAL IIVSLASCWG
QLIATTFGIP CVWAPLQPVT LTIRFPSPLL PVTLSLGARA RRLSYTAVEL ATWLPWRTVF
HRWRARAPGP RHMSLDPFAL ACTSSAPFVY GFSPHVVPPP DDWPPHHMVT GYWFLDHPAE
RLAPEIESFL AAGDPPIVIG FGSMGGRRPR DDAALALEAL RLAQRRGILF GSADVARLAA
GRRDVLVVPY APHRLLFPRV AVAVHHGGAG TTAASLRAGI PTMTVPVGID QPFWGMRVAA
IGAGPPPLPR RRATPDRLAP AIMAATDDLI RVRAAAIGRL IGAEEGVARA VEVVARVMP