Gene Cagg_3043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3043 
Symbol 
ID7267258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3701068 
End bp3702327 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content59% 
IMG OID643567863 
ProductSterol 3-beta-glucosyltransferase 
Protein accessionYP_002464337 
Protein GI219849904 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000372483 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGAATCT TCATGTGTAC CTACGGCAGT CGCGGTGATG TTCAACCGTT TGTGGCGCTC 
GGGAAGGCAC TCCGCGCTGC CGGTCACACC CCGATCCTCG CTGCACCGGC CCGCTTTACC
ACCTTTGCTG CGGCGTATGG AATTGACTTT GTCCCACTAC CGGGTGATGT CGAAACACTG
GCACGGCAGA TCGCCGACGA AGCCCGCAAT CGTCCGTTAC GCTTAATCGG CATCGTCTAC
CGCTTTGCCC TGCCGCTCGG CGTCGAAGTG GCGCGCCGGC TGCAACGCGT GGCAGCCAAC
GCCGATCTGA TTATCCATAC GTTTTTGACG GTTGCCATCG GTCATCTCTA CGCTCAGCAA
TACGGCATTA GCGAATGGGC CGTTGATCTC TTTCCCTTTT TCGATCCACC CGATGAGATT
GCCAATATTA TGTGGCCGCA CACCTCGATG GGACCACGCC GGCGCCGACT GAGCCATCGT
TTTGCCCACA CCGTGTTTCA ATATAGCCAG CAGTTCACCT ATCGCATGCT CCGCCGACGC
GCGCCGGACA TTGGTCCACC CCGCCTATCG TGGGCGATGC CCGGTCGTCA GATTCCATTG
TTGCTCGCCT ACAGTTCGGC GCTTGTCCCG CCGGGAACAG GACCGCTCAC GGTACAAACG
GGTTCGTGGC ATCTCGAGCA CACTGACTGG CAACCGCCAC CCGATCTCCG TGCCTTTCTC
GCCAGCGGTC CACCGCCGGT CGTAGTCAAT TTTGGCAGTA TGGCAACGCG CAACGCACCT
CAACTGATGC ATATCGTACT GTCGGCGCTA CGTCAGACCG GCCAGCGGGG CATTATCCAA
CGCGGTTGGG CACGGCTCGA ACTCACCGAT CGGCCCAACG ACATCTATCT TGCCGATGAA
CTTCCGCACG ATTGGCTTCT CCCGCAGGCC GCCGCTATGA TCCACCACGG TGGCGCAGGG
ACAACCGCTA CCGCACTCCG CGCCGGCATC CCCGCTATTA TCATACCGTT TGCCGCCGAT
CAACCGTTTT GGGCTTGGCG TGCCCATCTG ACCGGCGCCA ATCCACCACC GATCCCGCCG
AGCGAATTAT CGGTTGCACG TTTGTGCCAC GCACTTGAGC AGGCCCTGTC ACCGGAACAA
CGTCAGCGTG CTGCCAACAT AAGCGCACAA ATGCAACTTG AAAGAGGAGT AGTTGCCGCC
GTCGAGCAAA TTGAACGCCG GAACTTTGAA ACCGAACGAG AGACACGCTT CACAAGATAA
 
Protein sequence
MRIFMCTYGS RGDVQPFVAL GKALRAAGHT PILAAPARFT TFAAAYGIDF VPLPGDVETL 
ARQIADEARN RPLRLIGIVY RFALPLGVEV ARRLQRVAAN ADLIIHTFLT VAIGHLYAQQ
YGISEWAVDL FPFFDPPDEI ANIMWPHTSM GPRRRRLSHR FAHTVFQYSQ QFTYRMLRRR
APDIGPPRLS WAMPGRQIPL LLAYSSALVP PGTGPLTVQT GSWHLEHTDW QPPPDLRAFL
ASGPPPVVVN FGSMATRNAP QLMHIVLSAL RQTGQRGIIQ RGWARLELTD RPNDIYLADE
LPHDWLLPQA AAMIHHGGAG TTATALRAGI PAIIIPFAAD QPFWAWRAHL TGANPPPIPP
SELSVARLCH ALEQALSPEQ RQRAANISAQ MQLERGVVAA VEQIERRNFE TERETRFTR