Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3043 |
Symbol | |
ID | 7267258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 3701068 |
End bp | 3702327 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643567863 |
Product | Sterol 3-beta-glucosyltransferase |
Protein accession | YP_002464337 |
Protein GI | 219849904 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000372483 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGAATCT TCATGTGTAC CTACGGCAGT CGCGGTGATG TTCAACCGTT TGTGGCGCTC GGGAAGGCAC TCCGCGCTGC CGGTCACACC CCGATCCTCG CTGCACCGGC CCGCTTTACC ACCTTTGCTG CGGCGTATGG AATTGACTTT GTCCCACTAC CGGGTGATGT CGAAACACTG GCACGGCAGA TCGCCGACGA AGCCCGCAAT CGTCCGTTAC GCTTAATCGG CATCGTCTAC CGCTTTGCCC TGCCGCTCGG CGTCGAAGTG GCGCGCCGGC TGCAACGCGT GGCAGCCAAC GCCGATCTGA TTATCCATAC GTTTTTGACG GTTGCCATCG GTCATCTCTA CGCTCAGCAA TACGGCATTA GCGAATGGGC CGTTGATCTC TTTCCCTTTT TCGATCCACC CGATGAGATT GCCAATATTA TGTGGCCGCA CACCTCGATG GGACCACGCC GGCGCCGACT GAGCCATCGT TTTGCCCACA CCGTGTTTCA ATATAGCCAG CAGTTCACCT ATCGCATGCT CCGCCGACGC GCGCCGGACA TTGGTCCACC CCGCCTATCG TGGGCGATGC CCGGTCGTCA GATTCCATTG TTGCTCGCCT ACAGTTCGGC GCTTGTCCCG CCGGGAACAG GACCGCTCAC GGTACAAACG GGTTCGTGGC ATCTCGAGCA CACTGACTGG CAACCGCCAC CCGATCTCCG TGCCTTTCTC GCCAGCGGTC CACCGCCGGT CGTAGTCAAT TTTGGCAGTA TGGCAACGCG CAACGCACCT CAACTGATGC ATATCGTACT GTCGGCGCTA CGTCAGACCG GCCAGCGGGG CATTATCCAA CGCGGTTGGG CACGGCTCGA ACTCACCGAT CGGCCCAACG ACATCTATCT TGCCGATGAA CTTCCGCACG ATTGGCTTCT CCCGCAGGCC GCCGCTATGA TCCACCACGG TGGCGCAGGG ACAACCGCTA CCGCACTCCG CGCCGGCATC CCCGCTATTA TCATACCGTT TGCCGCCGAT CAACCGTTTT GGGCTTGGCG TGCCCATCTG ACCGGCGCCA ATCCACCACC GATCCCGCCG AGCGAATTAT CGGTTGCACG TTTGTGCCAC GCACTTGAGC AGGCCCTGTC ACCGGAACAA CGTCAGCGTG CTGCCAACAT AAGCGCACAA ATGCAACTTG AAAGAGGAGT AGTTGCCGCC GTCGAGCAAA TTGAACGCCG GAACTTTGAA ACCGAACGAG AGACACGCTT CACAAGATAA
|
Protein sequence | MRIFMCTYGS RGDVQPFVAL GKALRAAGHT PILAAPARFT TFAAAYGIDF VPLPGDVETL ARQIADEARN RPLRLIGIVY RFALPLGVEV ARRLQRVAAN ADLIIHTFLT VAIGHLYAQQ YGISEWAVDL FPFFDPPDEI ANIMWPHTSM GPRRRRLSHR FAHTVFQYSQ QFTYRMLRRR APDIGPPRLS WAMPGRQIPL LLAYSSALVP PGTGPLTVQT GSWHLEHTDW QPPPDLRAFL ASGPPPVVVN FGSMATRNAP QLMHIVLSAL RQTGQRGIIQ RGWARLELTD RPNDIYLADE LPHDWLLPQA AAMIHHGGAG TTATALRAGI PAIIIPFAAD QPFWAWRAHL TGANPPPIPP SELSVARLCH ALEQALSPEQ RQRAANISAQ MQLERGVVAA VEQIERRNFE TERETRFTR
|
| |