Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_2259 |
Symbol | |
ID | 7266671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 2757802 |
End bp | 2758878 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643567089 |
Product | Cellulase |
Protein accession | YP_002463575 |
Protein GI | 219849142 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0943649 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000000415481 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | GTGGATGCAA TGTTGCAACT CTTTGGCGAT CTCATTGCTG CACCCGGCCC CTCCGGTTAT GAAGGGGCGG TACGAGAGGT GATGCGCCGC TACCTTGAAC CGATCGGCGA GATCGAGATC GATTATCTAG GTAGCATCAT CGCACATCGC ACGGGTAAAC CCGACGGCCC ACGGGTGGCA CTCGCTGCTC ATCTTGACGA GATCGGACTG CTCGTGACCC GCATCACCGA CGATGGCTTT CTCAAGTTTC AACCGCTGGG CGGCTGGTGG GATCATGTGC TGTTGGGCAT GCGGGTCGAG GTGATTGGTC GGAACGGTCC TATTATCGGC GTGATCGGGG CCAAACCCCC ACACATTTTG AGCAACGACG AACGCAGCCG CTTGGTTGAA AAGAAAACAA TGTACATCGA CATTGGGGCC ACGTCGCGTG ATGAAGTAGT TGCGTGGGGA GTACGACCCG GTGATCCGGT GGTGCCGGTC GGGCCACTTA CCCCAATGCG CAATCCCGAT TTACTGATGG CCAAAGCGAT CGATAACCGT GTCGGCTGTG CGATTGTCGT CGAAACCCTA CGCAGATTAG TCGGCGTCAC CCACCCTAAT ATCATCTTCG GGGTTGGAAA TGTGCAAGAA GAGGTTGGTT TACGTGGCGC CGCAACCACT ACTTATACGA TTCAACCCGA CATCGGGATT ACCATCGATA CCGCTATCGC CGGCGATACA CCAGGGGTTG GCCCCGATGA CGCGATGAGC CGTCTGGGAC AAGGTCCGGC CTTACTCTTG ATCGACGGAT CACTGATCGC ACACGCGACA CTCCGCCATC TGGTGATCGA TGTCGCTGCT GAAGAGGGCA TCCCGCTCCA ATTCGATCTG ATGCCTGGGG GTGGTACTGA TGGTGGCCGG ATGCACATCT TTGGCAAGGG CGTACCAACC GTTGTGATCG GTCCACCGGT GCGCTACATC CATTCAGCAT CAGCGATTGT TCACCGCCGT GACATCGAAC AAACAGTGCA GCTCCTCCTG GCGCTGATCC AGCGCCTTAA CAGTGAAACG GTGCGTCAGA TTCGGCAGGG GATGTAG
|
Protein sequence | MDAMLQLFGD LIAAPGPSGY EGAVREVMRR YLEPIGEIEI DYLGSIIAHR TGKPDGPRVA LAAHLDEIGL LVTRITDDGF LKFQPLGGWW DHVLLGMRVE VIGRNGPIIG VIGAKPPHIL SNDERSRLVE KKTMYIDIGA TSRDEVVAWG VRPGDPVVPV GPLTPMRNPD LLMAKAIDNR VGCAIVVETL RRLVGVTHPN IIFGVGNVQE EVGLRGAATT TYTIQPDIGI TIDTAIAGDT PGVGPDDAMS RLGQGPALLL IDGSLIAHAT LRHLVIDVAA EEGIPLQFDL MPGGGTDGGR MHIFGKGVPT VVIGPPVRYI HSASAIVHRR DIEQTVQLLL ALIQRLNSET VRQIRQGM
|
| |