Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_1828 |
Symbol | |
ID | 7267740 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 2239940 |
End bp | 2241910 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643566666 |
Product | Cellulose synthase (UDP-forming) |
Protein accession | YP_002463161 |
Protein GI | 219848728 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATACA CGCGGACAAT CTTATTTCGG GTTGGCTGGG AACCGGCGGA ATGGGCGGCA TTCCGCCGCG GTCTGATCAA GATTCTGGTC ATCACCAACC TCATCCTCGG CGCCTACTAT CTCCTCTGGC GGGGAACAGT ATCGCTTAAT TGGCATGCGT GGTGGTTTGC CCTGGCATTG TTCGGAGCCG AACTCTACAG TTACATCGGT AGTTTTCTCT TTGGATTAAC CGTGTTTCGG CTCAAAGAAC GGGGTGAGCC ACCGCCGACG CCGCCGGGAT TGCGCGTTGA TGTCTACATT ACCTGCTACA ACGAGCCGGT GGAATTGGTG CGCAAGACGG TGCGAGCCGC CATCGCTATC CGCTATCCGC ATCAGACGTA TCTGCTCGAT GATGGTAATA GTCCGGCGAT GTGGGCAATG GCCGCCGAAG AGGGTTGCGG ATATATCACC CGTTCACCGG TATGGCAGGG ATTTGACCGT CATGCCAAAG CCGGCAATCT CATCAATGCC CTTGAACATA CGACCGGGGA TTTTATTCTG ATCCTCGATG CCGATCAGGT ACCGCTGCCA ACCATTCTCG ACCGTACCCT CGGCTATTTT ACCGATCCGT TGGTCGCGTT GGTGCAAACA CCACAGTATT TCATCAATGT CCCGCCAGGC GATCCGTTTG GCAGCCAGGC ACCGCTGTTT TACGGTCCGA TTCAGCAAGG CAAAGACGGC TGGAACGCAG CATTTTTCTG TGGCTCGAAT GCCGTGCTAC GCCGCGAGGC GCTGGCCCGG ACCGGTGTTC GCTTCTTTGT ACGCGATGTC GAGCGACGTA TTCGGCGTGC TTTACGTGAA GCTGACACTG TCGTCAAGCG AGCTGAACGA ACGTTGACTA AAGCAGACCG TCATCGCATT GCACCGGCGT TGCGGGCATT GCGACAGGCG GTGCGGCAGG CCCGGCACGA ACTTAACCGC GGTGATACCT TCCAAGAAGT AACCGAACGC TTTCAGCAGC GGGCAGAAGC GGCGGCACGG TTGATAGTAG CAGCGGATTT ACAGCAAATA ATGGCCGATT TAGCCGAGAT CCAAGCCGCT GAAGCGGCAG AGATTATGCG GTCGCTGAAT GACGAAACGA TCTTGGCCGA GCTTGCAGCC CGCGACCGCT CACCGTTGGC TGCCATTGAG ACGGTGCGTC AACTGATACC GCTGCTCGTC AATGAAGCGA TGGATTATCT CCCGATTTCA ACCATCTCGG TGACCGAAGA TATGTCAACA TCGATGCGCC TCCACGCATG TGGTTGGCGT TCGGTATACC ACGATGAGAT TCTGGCCCAC GGATTAGCAC CTGAAGATTT GCGTAGTGCG TTGCAACAAC GCCTCCGTTG GGCGCAGGGG ACGATACAGG TTATGCTGCG GGAAAACCCC TTGACGCTTC CAGGGTTAAG CTGGGGGCAG CGTCTGGCCT ATTTTGATAC GATGTGGAGT TATTTGTCAG GCGTGCCGAC GATTATCTAC CTTATATCAC CGCCACTGTT TCTGTTGTTT GGCCTCTTGC CGGTGAATGC GTTATCGAAT GAGTTTTTTT GGCGCTTAAT CCCGTACTTA CTGATCAACC AGCTCTTGTT TGCCGTCATC AGTTGGGGGC GACCAACATG GCGTGGGCAG CAGTATAGTT TGGCGCTTTT CCCACTCTGG ATCAAAGCAG TGGTAACGGC GGCTGCAAAC GTCTGGTTTG GCAAGAAGTT AGGATTTATC GTCACTCCCA AAACCCGGCA GGCCGGGCGG TATTTTGGCC TGGTGCGCTG GCAGTTAGCG ATGATGATCC TCTTAGCAGT CTCGATTGTC GTAGGACTAC TGCAACTAAC GCTGGGCTGG CGCAATGACG GACTTCCCGT GCTTGTCAAT GTCTTTTGGG CGGTGTACGA TCTGGTTATG CTCAGTGTGG TCATCGATGC TGCCTTGTAC CAACCCACCG AGGAGCAATA A
|
Protein sequence | MQYTRTILFR VGWEPAEWAA FRRGLIKILV ITNLILGAYY LLWRGTVSLN WHAWWFALAL FGAELYSYIG SFLFGLTVFR LKERGEPPPT PPGLRVDVYI TCYNEPVELV RKTVRAAIAI RYPHQTYLLD DGNSPAMWAM AAEEGCGYIT RSPVWQGFDR HAKAGNLINA LEHTTGDFIL ILDADQVPLP TILDRTLGYF TDPLVALVQT PQYFINVPPG DPFGSQAPLF YGPIQQGKDG WNAAFFCGSN AVLRREALAR TGVRFFVRDV ERRIRRALRE ADTVVKRAER TLTKADRHRI APALRALRQA VRQARHELNR GDTFQEVTER FQQRAEAAAR LIVAADLQQI MADLAEIQAA EAAEIMRSLN DETILAELAA RDRSPLAAIE TVRQLIPLLV NEAMDYLPIS TISVTEDMST SMRLHACGWR SVYHDEILAH GLAPEDLRSA LQQRLRWAQG TIQVMLRENP LTLPGLSWGQ RLAYFDTMWS YLSGVPTIIY LISPPLFLLF GLLPVNALSN EFFWRLIPYL LINQLLFAVI SWGRPTWRGQ QYSLALFPLW IKAVVTAAAN VWFGKKLGFI VTPKTRQAGR YFGLVRWQLA MMILLAVSIV VGLLQLTLGW RNDGLPVLVN VFWAVYDLVM LSVVIDAALY QPTEEQ
|
| |