Gene Cagg_2259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2259 
Symbol 
ID7266671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2757802 
End bp2758878 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content57% 
IMG OID643567089 
ProductCellulase 
Protein accessionYP_002463575 
Protein GI219849142 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0943649 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000415481 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
GTGGATGCAA TGTTGCAACT CTTTGGCGAT CTCATTGCTG CACCCGGCCC CTCCGGTTAT 
GAAGGGGCGG TACGAGAGGT GATGCGCCGC TACCTTGAAC CGATCGGCGA GATCGAGATC
GATTATCTAG GTAGCATCAT CGCACATCGC ACGGGTAAAC CCGACGGCCC ACGGGTGGCA
CTCGCTGCTC ATCTTGACGA GATCGGACTG CTCGTGACCC GCATCACCGA CGATGGCTTT
CTCAAGTTTC AACCGCTGGG CGGCTGGTGG GATCATGTGC TGTTGGGCAT GCGGGTCGAG
GTGATTGGTC GGAACGGTCC TATTATCGGC GTGATCGGGG CCAAACCCCC ACACATTTTG
AGCAACGACG AACGCAGCCG CTTGGTTGAA AAGAAAACAA TGTACATCGA CATTGGGGCC
ACGTCGCGTG ATGAAGTAGT TGCGTGGGGA GTACGACCCG GTGATCCGGT GGTGCCGGTC
GGGCCACTTA CCCCAATGCG CAATCCCGAT TTACTGATGG CCAAAGCGAT CGATAACCGT
GTCGGCTGTG CGATTGTCGT CGAAACCCTA CGCAGATTAG TCGGCGTCAC CCACCCTAAT
ATCATCTTCG GGGTTGGAAA TGTGCAAGAA GAGGTTGGTT TACGTGGCGC CGCAACCACT
ACTTATACGA TTCAACCCGA CATCGGGATT ACCATCGATA CCGCTATCGC CGGCGATACA
CCAGGGGTTG GCCCCGATGA CGCGATGAGC CGTCTGGGAC AAGGTCCGGC CTTACTCTTG
ATCGACGGAT CACTGATCGC ACACGCGACA CTCCGCCATC TGGTGATCGA TGTCGCTGCT
GAAGAGGGCA TCCCGCTCCA ATTCGATCTG ATGCCTGGGG GTGGTACTGA TGGTGGCCGG
ATGCACATCT TTGGCAAGGG CGTACCAACC GTTGTGATCG GTCCACCGGT GCGCTACATC
CATTCAGCAT CAGCGATTGT TCACCGCCGT GACATCGAAC AAACAGTGCA GCTCCTCCTG
GCGCTGATCC AGCGCCTTAA CAGTGAAACG GTGCGTCAGA TTCGGCAGGG GATGTAG
 
Protein sequence
MDAMLQLFGD LIAAPGPSGY EGAVREVMRR YLEPIGEIEI DYLGSIIAHR TGKPDGPRVA 
LAAHLDEIGL LVTRITDDGF LKFQPLGGWW DHVLLGMRVE VIGRNGPIIG VIGAKPPHIL
SNDERSRLVE KKTMYIDIGA TSRDEVVAWG VRPGDPVVPV GPLTPMRNPD LLMAKAIDNR
VGCAIVVETL RRLVGVTHPN IIFGVGNVQE EVGLRGAATT TYTIQPDIGI TIDTAIAGDT
PGVGPDDAMS RLGQGPALLL IDGSLIAHAT LRHLVIDVAA EEGIPLQFDL MPGGGTDGGR
MHIFGKGVPT VVIGPPVRYI HSASAIVHRR DIEQTVQLLL ALIQRLNSET VRQIRQGM