Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3770 |
Symbol | |
ID | 5541272 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 4942487 |
End bp | 4943746 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640895880 |
Product | sterol 3-beta-glucosyltransferase |
Protein accession | YP_001433827 |
Protein GI | 156743698 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.190341 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCCAA CCATCACGAT CCTGGCGAGC GGCACGCTGG GGGACGTGCG CCCACTGGCG GCGCTCGGCA AAGGTTTGCA CGATGCTGGC TTCGCTGTTG CGCTTGCAAC CCATCCTCAG TTTGCGCCTC TGGTTCAGGC GCAGGGTCTG GCGTTTCGCA GCATCGGGGG CAATCCCAGT GATCTGCTTC TTCATGATGA TGCGGCGCTG ACCTTCGATG GCGGAGTAGG GCGTGGCGTG GTCGCAACAC TCCGTTATAT TCGGTCGGCG CAGGCGATCT ATGCTCGCAT GCTGGACGCG GCAGCGACCG CATGTTACGG GAGCGCCCTG ATCATTGTGT CGCTGGCAAG TTGCTGGGGG CAACTTATTG CGACGACGTT CGGCATACCC TGTGTCTGGG CGCCGTTGCA GCCCGTCACG CTAACGATCC GCTTTCCATC GCCGCTGCTG CCGGTGACAT TAAGCCTGGG CGCGCGCGCC CGCCGTCTGA GTTATACGGC TGTCGAACTG GCGACCTGGC TGCCATGGCG AACCGTATTC CATCGCTGGC GGGCGCGCGC GCCTGGTCCG CGCCACATGT CGCTCGACCC CTTTGCTCTA GCGTGCACAT CGAGTGCGCC CTTCGTCTAC GGGTTCAGCC CGCATGTCGT GCCACCGCCT GACGACTGGC CGCCACATCA TATGGTGACC GGCTACTGGT TCCTCGACCA TCCGGCTGAA CGCCTGGCGC CGGAGATTGA GTCGTTCCTT GCAGCTGGCG ATCCGCCGAT TGTCATCGGT TTTGGCAGCA TGGGCGGTCG GCGACCGCGC GATGATGCGG CGCTGGCGCT GGAAGCGCTG CGCCTGGCGC AGCGCCGCGG CATTCTCTTC GGTTCAGCCG ACGTCGCGCG CCTGGCAGCC GGTCGCCGTG ATGTGCTCGT CGTGCCATAC GCGCCCCATC GCCTGCTCTT CCCACGTGTC GCCGTCGCCG TTCACCATGG CGGCGCCGGA ACAACCGCCG CCAGTTTGCG CGCCGGTATC CCGACGATGA CGGTTCCGGT CGGGATCGAT CAACCCTTCT GGGGGATGCG CGTCGCCGCA ATTGGCGCGG GACCGCCGCC GCTGCCGCGG CGACGCGCAA CGCCGGATCG CCTGGCACCC GCCATTATGG CGGCGACTGA TGACCTCATC CGGGTGCGTG CTGCGGCGAT AGGGCGGTTG ATCGGCGCCG AGGAGGGCGT TGCGCGAGCG GTGGAGGTCG TTGCGCGCGT AATGCCATGA
|
Protein sequence | MRPTITILAS GTLGDVRPLA ALGKGLHDAG FAVALATHPQ FAPLVQAQGL AFRSIGGNPS DLLLHDDAAL TFDGGVGRGV VATLRYIRSA QAIYARMLDA AATACYGSAL IIVSLASCWG QLIATTFGIP CVWAPLQPVT LTIRFPSPLL PVTLSLGARA RRLSYTAVEL ATWLPWRTVF HRWRARAPGP RHMSLDPFAL ACTSSAPFVY GFSPHVVPPP DDWPPHHMVT GYWFLDHPAE RLAPEIESFL AAGDPPIVIG FGSMGGRRPR DDAALALEAL RLAQRRGILF GSADVARLAA GRRDVLVVPY APHRLLFPRV AVAVHHGGAG TTAASLRAGI PTMTVPVGID QPFWGMRVAA IGAGPPPLPR RRATPDRLAP AIMAATDDLI RVRAAAIGRL IGAEEGVARA VEVVARVMP
|
| |