Gene Cagg_3435 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3435 
Symbol 
ID7269660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4172539 
End bp4174923 
Gene Length2385 bp 
Protein Length794 aa 
Translation table11 
GC content54% 
IMG OID643568245 
Productglycoside hydrolase family 9 
Protein accessionYP_002464713 
Protein GI219850280 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.019722 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0493729 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTCCA CCATTCTCTT CTTCGCAGCC GTTATCTGGA CACTTAGGTT GATTCCGTCC 
CCGCTAATAG TGCTTCCCAC CACGCAGTTT AATTACGGCG AAGCTCTGCA AAAGTCGATC
TTTTTCTACG AAATCCAACG TTCCGGCCGT CTACCGCCCG ATAATCGGGT TCGGTGGCGT
GGTGATTCGG GTCTGAACGA TGGCGCCGAT GTTGGTATTG ACCTAACCGG TGGCTGGTAT
GATGCAGGTG ATCATGTGAA GTTTGGGTTT CCGATGGCGG CCTCGGCTAC GCTACTGGCA
TGGGGCGTAG TCGAATATCG GCAGGCCTAC GAACAAGCCG GCCTGCTCGA CGATATCTTA
GCCAATTTGC GCTGGGCGAC TGACTATTTC ATCAAAGCTC ACACAGGGCC ATTTGAATTC
TATGGCCAGG TGGGTGATGG TCACCTTGAT CATGCATGGT GGGGGCCGGC TGAAGTGATG
CCGATGCCAC GACCGGCGTA CAAGATCACC GCCGACTGCC CCGGTTCTGA TCTCGCCGCC
GAGACGGCAG CAGCTTTAGC TGCCGCATCC ATCGCTTTTC GCCCGACCGA TCCTGATTAT
GCCGAACAGA TGCTTAATCA TGCGCGTCAG CTCTACACCT TTGCCGATAC GTATCGCGGG
AAGTATAGCG ATTGTATTCA AAATGCAGCA GCATTTTACA ACTCGTGGAG CGGTTATCAG
GACGAACTGG TCTGGGGTGC GGCGTGGTTG TATCGCGCGA CAGGGGAATC GACATACTTA
AGCAAAGCGC AACAGTATGC AATGCAGCTC AGTGGTCAAT ATAAATGGAC GCACAATTGG
GATGATAAAT CGTATGGTAG CTACATCCTG TTAGCTCAAC TGACCGGTCA ACCAACTTAC
CGCGCCAATG TTGAGCGGTG GCTCAATTGG TGGACGGTTG GCGGTACTGA GCATGGCGCC
GATGGTACGC GAATCACCTA TAGTCCGGGT GGGCAAGCGT GGCTAAGTCA ATGGGGATCG
CTGCGGTACA CGGCGAACAC GGCATTTCTC GCGTTCATCT ATGCCGATTG GCTGGCCGCC
AATCACGGCG ATGAGCAGAA GATCGTGCGC TATCGCGATT TTGCCGTCCG CCAGATCAAC
TACATTCTTG GTGAGAATCC ACGTGGGTGT AGTTATATGG TTGGGTTCGG CAATTGTCCT
CCCCAAAACC CGCATCATCG CACAGCACAT GGATCATGGC TCGACTCAAT TGATCAACCA
CCGTATCAGC GTCACATCCT CTACGGCGCT CTCGTTGGTG GACCGGCTCA GCCCGACGAT
CAGTATCATG ATGTCCGCAG CGACTATATC ATGAATGAAG TCGCTACCGA CTATAATGCC
GGCTTGACCG GTGCATTGGC GCGTATGTAT GCGTTGTTCG GTGGCGAACC GCTTACCAAC
TTTCCTCCTC CGGATCTTCC GCCCGATGAT GATGAAGTCT ACGTGCAAGC CACCGTCAAT
GCGAGTGGCC CGAATTTCAC CGAGATCAAA GCTTTTATTA TTAATAAATC GGGTTGGCCG
GCCCGCGTAA CCGACCGGTT AACGATGCGC TACTTCTTTA CTCTTGATGG TGATACCCGT
CCGGAGGATA TTACCGTCAG TGTACCGCGT AATCAGTGTC GCAGTGTCTC ATCGCCGATC
CAGTATACTG ATACGGTGTA TGCGGTTGTC ATCGATTGTG TTGGCGTTAG CATCTATCCC
GGCGGCGCCG ATCATTATCG AAAAGAAGTC CAGTTTCGGC TGACAAGCAG CAAACAGTGG
GATCCAAGTA ATGATTGGTC GTATCGCGAT TTGCGCGCAA CGACGTCGGG CAATCTGATC
AAAGTGACGA CGATCAGTTT GTACGAGGAT GGGACGCGCA TTTGGGGAAC AGAACCAGGC
GGTGCAATTC TACCGCCCCC CGTTACCGAA CGATACGTCT ATATCCCACT GATTGTTGGT
AGTGGCGGGC AGAGCATATC GACGCCGACT CCAGCCCCAA CTATCCTACC GACTTCAACG
CCACCTCCAG TCGATTCGGC AGGATGTCGG GTGAGGTATC ATGTGCAACA GGCGTGGAAC
GATGGGGCAA CGATCACAGT CGTCATCACG AATACCGGAT TACTGGCGAT TGATGGGTGG
ACACTGGCAT GGCAATTTCC CGATGGGCAA CAGATGGTAA CCGATTTCTG GAATGCGGTG
ATTACGCAAG TCGGACGCGA TGTCAGTGCT GCGCACGTCG ATTGGAACCG CGCACTTGCT
CCCGGTGCCC AGCAACAGTT TGGGTTTAAC CTCCAACATA GTGGCGCCAA TCCGCGACCG
TCACAGTTTA CGCTAAACGG TATGATTTGC AATGTAGATA GTTAA
 
Protein sequence
MRSTILFFAA VIWTLRLIPS PLIVLPTTQF NYGEALQKSI FFYEIQRSGR LPPDNRVRWR 
GDSGLNDGAD VGIDLTGGWY DAGDHVKFGF PMAASATLLA WGVVEYRQAY EQAGLLDDIL
ANLRWATDYF IKAHTGPFEF YGQVGDGHLD HAWWGPAEVM PMPRPAYKIT ADCPGSDLAA
ETAAALAAAS IAFRPTDPDY AEQMLNHARQ LYTFADTYRG KYSDCIQNAA AFYNSWSGYQ
DELVWGAAWL YRATGESTYL SKAQQYAMQL SGQYKWTHNW DDKSYGSYIL LAQLTGQPTY
RANVERWLNW WTVGGTEHGA DGTRITYSPG GQAWLSQWGS LRYTANTAFL AFIYADWLAA
NHGDEQKIVR YRDFAVRQIN YILGENPRGC SYMVGFGNCP PQNPHHRTAH GSWLDSIDQP
PYQRHILYGA LVGGPAQPDD QYHDVRSDYI MNEVATDYNA GLTGALARMY ALFGGEPLTN
FPPPDLPPDD DEVYVQATVN ASGPNFTEIK AFIINKSGWP ARVTDRLTMR YFFTLDGDTR
PEDITVSVPR NQCRSVSSPI QYTDTVYAVV IDCVGVSIYP GGADHYRKEV QFRLTSSKQW
DPSNDWSYRD LRATTSGNLI KVTTISLYED GTRIWGTEPG GAILPPPVTE RYVYIPLIVG
SGGQSISTPT PAPTILPTST PPPVDSAGCR VRYHVQQAWN DGATITVVIT NTGLLAIDGW
TLAWQFPDGQ QMVTDFWNAV ITQVGRDVSA AHVDWNRALA PGAQQQFGFN LQHSGANPRP
SQFTLNGMIC NVDS