Gene Cagg_1691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1691 
Symbol 
ID7268993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2064112 
End bp2066556 
Gene Length2445 bp 
Protein Length814 aa 
Translation table11 
GC content58% 
IMG OID643566533 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_002463028 
Protein GI219848595 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAATC ACATTGAACA TCTGGTACGT CAACTAACGC TTGACGAAAA AATCGCCTTA 
CTTGCCGGCG CCGATGCATG GCATACCGTC GCCATTCCGC GCCTCGGCAT CCCCGCCATC
AAGGTCACCG ACGGCCCCAA CGGTGCGCGC GGTGTCAGCC GCAACGGCAT CCACACCTCT
GCCTGTTTTC CGATTGGCGT CGCGATGGGG GCCACGTGGA ACCCGGCACT GGTTCGTCAA
ATCGGCGAAG CCTTAGCCGA AGAGACGAAG GATAAAGGGG CGCATATTCT GTTGGCACCG
ACGGTTAACA TCCACCGCTC ACCGCTGGCC GGGCGCAACT TCGAGTGCTT TTCCGAAGAC
CCCTACCTGA CCGGCGTGAT GGCAGCGGCC TACATTACCG GCTTGCAAAG CCGTGGCGTC
GGTGCATGTA TCAAGCACTT CGTGTGCAAC GACTCGGAAT TTGAGCGCTT TAGCATCAGT
TCCGATGTTG GTGAGCGACC ATTGCGCGAA ATCTATCTCC GCCCGTTCGA GATGGCGATC
AAGCAGGCAA AACCGTGGTC GCTTATGTCG GCCTACAACC GGATCAATGG AGTATGGGCC
AGTGAAAATC GCCGGTTATT GGTCGAAATC CTCAAAGGCG AATGGCAGTT CGATGGACTG
GTGATGTCAG ACTGGTATGG TACCTATAGC GCTCGCGCTA CCCACAATGG TCTTGACCTT
GAAATGCCGG GGCCAGCACG TTGGTTGAAC CGTGAGCACG TCTTGGCTGC GCTCGAGCGT
GGCGATCTGC GTGAATCTGA CCTCGACGAT AAGGTATACC GATTGTTACG CACTATCGAA
CGGGTCGGCG GCTTTGCGAA CCCCACACCG GCAATTGAGC AGGCCAACGA CCGGCCTGAG
CATCGGACCC TCATTCGGCG CGCCGGTGTC GAATCAATCG TGCTGCTCAA GAACGAAGGA
CGAATCCTCC CGTTGAATCC CTCCCAAGGG CAATCCATTG CCGTTATCGG CGCCAACGCG
CATTGGGCGG CGATTATGGG CGGCGGCAGC TCGGAAGTTG CACCGCACTA CGTGGTTACT
CCGTTACAGG GCATTCGAGC GCGCGCCGGC GAGCAGTGCG TGGTGGACTA CGCCATCGGC
ACCCCCATTT TCCGTCGCTT GCCGAACATT GATCCGGCAT GGGTGCGCAT CCCCGCTAGC
GACCGACCGG GAGTGCAGCT TGATTATTAC ACCGATCTCG ACTTCGGTGG TGAACCGGTA
CGCACCGACA CCCTGACGAC CCTCGAAGCG AGCTGGTTTG GCGACCGGAT CGAATATCTC
AACCTCACCA CCTTCGCCGC CAAACTCAGT TGCGAACTCT TACCGCCCCA CACCGGTCAT
TACCACATTG GGATGTCATG CGTCGGGCAA GCCCGGGTTT GGCTCGATGG TGAATTAGTG
CTCGACCGGT GGGAAAAAGA ACTCATGGAC GGCAATGAGC AGCGCCAAAC GATAGCCCTC
GAAACCGGAC GCCGCCACCG GCTCGTGGTT GAGTTCCGCT GGCCAACGCC CGGCAACTGG
CGTGCCGTGC AGGTTGGTCT CTGGCCGGAA CGGGAAGATG ATCCCATCGC GGAAGCGGTA
GAGCTGGCTG CTCGCTCGCA TGTAGCAATT GTGTTTGCCG GCCTAACCAA AGAGTGGGAG
AGCGAAGGCT TCGACCGGAT CGATATGGAA CTCCCCGGTC GCCAAAATGA ACTGATTCGC
CGAGTGGCGG CGGTCAATCC GCGCACAATC GTAGTGCTCA ACGCCGGATC GCCCGTACAT
ATGCCGTGGA TTGACGAAGT AGCGGCTGTG ATCCAGGCGT GGTACGGCGG CCAAGAGGCG
GGTAATGCCA TTGCCGATGT GCTTTTCGGC GACGCCGATC CGGGTGGACG ATTACCGACG
ACCTTCCCCA AACGACTGGC CGACAATCCG GCGTACATCA ATTATCCCGG CGAAAACGGG
CACGTCCTCT ACGGCGAAGG CCTATTTGTC GGCTATCGCT ACTACGACCG TAAAGGCATC
GAACCGCTCT TTCCATTTGG GTTTGGGCTG AGTTACACCG AGTTTTCTTA CGATCGGCTT
CAGCTCTCTG CCCCGATGAT GCGACCAGAT GAAACCATTA CCGTCAGTGT CGATGTTACG
AACATCGGTG ATCGTCCGGG AATGGAGGTT GTCCAGCTCT ACATCCACGA CCGGGTGGCC
CGACTGATGC GACCCGATAA AGAGCTGAAG GGCTTTGCAA AGGTGACTCT GCAACCGGGT
GAAACCACGA CGGTGACGTT TACCATTGAC CGACAGGCAC TGAGTTATTA TGATCCGGCG
GTCCCGGGTT GGATTGCCGA ACCGGGCACC TTCACCGTGC TCGTTGGCCG ATCGGTGGCC
GATATTCGGC TGAAGGCCTC ATTTGAGCTG GTCGCCGAAG GATGA
 
Protein sequence
MPNHIEHLVR QLTLDEKIAL LAGADAWHTV AIPRLGIPAI KVTDGPNGAR GVSRNGIHTS 
ACFPIGVAMG ATWNPALVRQ IGEALAEETK DKGAHILLAP TVNIHRSPLA GRNFECFSED
PYLTGVMAAA YITGLQSRGV GACIKHFVCN DSEFERFSIS SDVGERPLRE IYLRPFEMAI
KQAKPWSLMS AYNRINGVWA SENRRLLVEI LKGEWQFDGL VMSDWYGTYS ARATHNGLDL
EMPGPARWLN REHVLAALER GDLRESDLDD KVYRLLRTIE RVGGFANPTP AIEQANDRPE
HRTLIRRAGV ESIVLLKNEG RILPLNPSQG QSIAVIGANA HWAAIMGGGS SEVAPHYVVT
PLQGIRARAG EQCVVDYAIG TPIFRRLPNI DPAWVRIPAS DRPGVQLDYY TDLDFGGEPV
RTDTLTTLEA SWFGDRIEYL NLTTFAAKLS CELLPPHTGH YHIGMSCVGQ ARVWLDGELV
LDRWEKELMD GNEQRQTIAL ETGRRHRLVV EFRWPTPGNW RAVQVGLWPE REDDPIAEAV
ELAARSHVAI VFAGLTKEWE SEGFDRIDME LPGRQNELIR RVAAVNPRTI VVLNAGSPVH
MPWIDEVAAV IQAWYGGQEA GNAIADVLFG DADPGGRLPT TFPKRLADNP AYINYPGENG
HVLYGEGLFV GYRYYDRKGI EPLFPFGFGL SYTEFSYDRL QLSAPMMRPD ETITVSVDVT
NIGDRPGMEV VQLYIHDRVA RLMRPDKELK GFAKVTLQPG ETTTVTFTID RQALSYYDPA
VPGWIAEPGT FTVLVGRSVA DIRLKASFEL VAEG