Gene Cagg_1828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1828 
Symbol 
ID7267740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2239940 
End bp2241910 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content55% 
IMG OID643566666 
ProductCellulose synthase (UDP-forming) 
Protein accessionYP_002463161 
Protein GI219848728 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATACA CGCGGACAAT CTTATTTCGG GTTGGCTGGG AACCGGCGGA ATGGGCGGCA 
TTCCGCCGCG GTCTGATCAA GATTCTGGTC ATCACCAACC TCATCCTCGG CGCCTACTAT
CTCCTCTGGC GGGGAACAGT ATCGCTTAAT TGGCATGCGT GGTGGTTTGC CCTGGCATTG
TTCGGAGCCG AACTCTACAG TTACATCGGT AGTTTTCTCT TTGGATTAAC CGTGTTTCGG
CTCAAAGAAC GGGGTGAGCC ACCGCCGACG CCGCCGGGAT TGCGCGTTGA TGTCTACATT
ACCTGCTACA ACGAGCCGGT GGAATTGGTG CGCAAGACGG TGCGAGCCGC CATCGCTATC
CGCTATCCGC ATCAGACGTA TCTGCTCGAT GATGGTAATA GTCCGGCGAT GTGGGCAATG
GCCGCCGAAG AGGGTTGCGG ATATATCACC CGTTCACCGG TATGGCAGGG ATTTGACCGT
CATGCCAAAG CCGGCAATCT CATCAATGCC CTTGAACATA CGACCGGGGA TTTTATTCTG
ATCCTCGATG CCGATCAGGT ACCGCTGCCA ACCATTCTCG ACCGTACCCT CGGCTATTTT
ACCGATCCGT TGGTCGCGTT GGTGCAAACA CCACAGTATT TCATCAATGT CCCGCCAGGC
GATCCGTTTG GCAGCCAGGC ACCGCTGTTT TACGGTCCGA TTCAGCAAGG CAAAGACGGC
TGGAACGCAG CATTTTTCTG TGGCTCGAAT GCCGTGCTAC GCCGCGAGGC GCTGGCCCGG
ACCGGTGTTC GCTTCTTTGT ACGCGATGTC GAGCGACGTA TTCGGCGTGC TTTACGTGAA
GCTGACACTG TCGTCAAGCG AGCTGAACGA ACGTTGACTA AAGCAGACCG TCATCGCATT
GCACCGGCGT TGCGGGCATT GCGACAGGCG GTGCGGCAGG CCCGGCACGA ACTTAACCGC
GGTGATACCT TCCAAGAAGT AACCGAACGC TTTCAGCAGC GGGCAGAAGC GGCGGCACGG
TTGATAGTAG CAGCGGATTT ACAGCAAATA ATGGCCGATT TAGCCGAGAT CCAAGCCGCT
GAAGCGGCAG AGATTATGCG GTCGCTGAAT GACGAAACGA TCTTGGCCGA GCTTGCAGCC
CGCGACCGCT CACCGTTGGC TGCCATTGAG ACGGTGCGTC AACTGATACC GCTGCTCGTC
AATGAAGCGA TGGATTATCT CCCGATTTCA ACCATCTCGG TGACCGAAGA TATGTCAACA
TCGATGCGCC TCCACGCATG TGGTTGGCGT TCGGTATACC ACGATGAGAT TCTGGCCCAC
GGATTAGCAC CTGAAGATTT GCGTAGTGCG TTGCAACAAC GCCTCCGTTG GGCGCAGGGG
ACGATACAGG TTATGCTGCG GGAAAACCCC TTGACGCTTC CAGGGTTAAG CTGGGGGCAG
CGTCTGGCCT ATTTTGATAC GATGTGGAGT TATTTGTCAG GCGTGCCGAC GATTATCTAC
CTTATATCAC CGCCACTGTT TCTGTTGTTT GGCCTCTTGC CGGTGAATGC GTTATCGAAT
GAGTTTTTTT GGCGCTTAAT CCCGTACTTA CTGATCAACC AGCTCTTGTT TGCCGTCATC
AGTTGGGGGC GACCAACATG GCGTGGGCAG CAGTATAGTT TGGCGCTTTT CCCACTCTGG
ATCAAAGCAG TGGTAACGGC GGCTGCAAAC GTCTGGTTTG GCAAGAAGTT AGGATTTATC
GTCACTCCCA AAACCCGGCA GGCCGGGCGG TATTTTGGCC TGGTGCGCTG GCAGTTAGCG
ATGATGATCC TCTTAGCAGT CTCGATTGTC GTAGGACTAC TGCAACTAAC GCTGGGCTGG
CGCAATGACG GACTTCCCGT GCTTGTCAAT GTCTTTTGGG CGGTGTACGA TCTGGTTATG
CTCAGTGTGG TCATCGATGC TGCCTTGTAC CAACCCACCG AGGAGCAATA A
 
Protein sequence
MQYTRTILFR VGWEPAEWAA FRRGLIKILV ITNLILGAYY LLWRGTVSLN WHAWWFALAL 
FGAELYSYIG SFLFGLTVFR LKERGEPPPT PPGLRVDVYI TCYNEPVELV RKTVRAAIAI
RYPHQTYLLD DGNSPAMWAM AAEEGCGYIT RSPVWQGFDR HAKAGNLINA LEHTTGDFIL
ILDADQVPLP TILDRTLGYF TDPLVALVQT PQYFINVPPG DPFGSQAPLF YGPIQQGKDG
WNAAFFCGSN AVLRREALAR TGVRFFVRDV ERRIRRALRE ADTVVKRAER TLTKADRHRI
APALRALRQA VRQARHELNR GDTFQEVTER FQQRAEAAAR LIVAADLQQI MADLAEIQAA
EAAEIMRSLN DETILAELAA RDRSPLAAIE TVRQLIPLLV NEAMDYLPIS TISVTEDMST
SMRLHACGWR SVYHDEILAH GLAPEDLRSA LQQRLRWAQG TIQVMLRENP LTLPGLSWGQ
RLAYFDTMWS YLSGVPTIIY LISPPLFLLF GLLPVNALSN EFFWRLIPYL LINQLLFAVI
SWGRPTWRGQ QYSLALFPLW IKAVVTAAAN VWFGKKLGFI VTPKTRQAGR YFGLVRWQLA
MMILLAVSIV VGLLQLTLGW RNDGLPVLVN VFWAVYDLVM LSVVIDAALY QPTEEQ