Gene Cagg_1689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1689 
Symbol 
ID7268991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2061309 
End bp2063168 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content57% 
IMG OID643566531 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_002463026 
Protein GI219848593 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCTGC AATATCTGCT TTGTCTTGTG ATGTTGGTCG GCGTATTTGG CGCTTGTGGG 
ACGCTGAATG CACCCCCCGC TACACCGACA CCGGTCATAC CGCTCTACCG TAATCCGGCA
GCACCTATCG CCGAGCGGGT CGAGGATCTG CTACAGCGGA TGACATTGGC CGAGAAGATC
GGCCAGATGA CGCTGATCGA AAAAAATAGC ATCACCGCCG ATCAGGTACG TGAATTGGCC
ATCGGTGGTG TGCTCAGCGG TGGCGGTGGC TATCCAGACG ACGAGAACTC GCCGATGGCG
TGGGTGGAGA TGGTTAATGC CTTGCAACAG GCGGCATTGA ATAGCCGGCT CGGCATTCCG
ATCATCTATG GGGCTGATGG TGTTCACGGA CACAACAACC TCTACGGTGC CGTCATCTTT
CCGCATAACA TCGGGTTGGG GGCAGCGAAT GACCCCGCAC TGGTCGAGCA GATCGGGCGG
GTGACGGCCC GCGAGATGGC GGCTACCGGT GTCTTTTGGA ACTACGCGCC GGGGGTGATG
GTAGTGCAAG ATGTGCGTTG GGGGCGTACC TACGAAAGCT ATGCCGAACG TCCTGAACAC
GTTGCATCGT TGGCAGTCGC TTTTTTGCGT GGCTTGCAAG CTCCCGATAT TGCAGCACCA
AACCGGATCA TCGGCACTCC CAAACACTAT GTCGGTGATG GCGGTACGAC ATGGGGCACG
TCAACCACGG CAAACTATCA ACTCGATCAG GGGGAGACGT TTGGTGATGA AACCACGATC
CGAACCGTGC ATCTCCCACC GTACCGCGCG ACCATCGCTG CCGGTGCGCA TGTGATTATG
GCGTCGTATT CGAGCTGGAA CGGACAGAAG ATGCACGCCA GTTCGTATTG GCTCACCAAT
GTGCTGAAAG AAGAACTCGG CTTTACCGGT TTTATTGTCT CAGATTGGGA AGCCATCGAT
CAGATTGATC CCGACTATGA ACGGGCGGTG GTGACGGCCA TAAATGCCGG GATCGATATG
AATATGGTGC CTTACGATGC GGTGCGCTTC ATCGAGACCC TGACTCGCGC CGTCAATACC
GGTATGGTGA GCGAAACGCG GATTGACGAT GCGGTGCGAC GAATCTTGAC GACCAAGTTT
GCGATGGGGT TATTTGATCA ACCTTTCGCC CACACCGAAC TACTGGGCGA CATCGGTAGT
CCGGCCCACC GCGCATTAGC CCGTACCGCC GTTGCCCAAT CGTTGGTCTT GCTCAAAAAT
GACGGTAACC TCCTCCCCTT ACCGAAAGAT GTTGCCCATC TCTACATCGG TGGGCAGGCT
GCTCACGATC TCGGTATCCA AGCCGGCGGC TGGACAATTG AGTGGCAAGG GAAGCCGGGT
GCGATTATCC CGGGAACGAC GATTCTCGAA GGGATTCAAG CGGCTGTTAC AGCACAAACG
GTCATTGAGT ACGATCCACA CGGACGGTTT CGCGGTGATC CGATGGCGAC CGATGCCGTC
TGCATTGCGG TCGTCGGCGA ATTGCCTTAC GCCGAAGGAC GCGGCGACAG CGCAACCTTA
CGCTTACCAC CGAACGAACA GCGCACACTG CGTCGGATGG AGGAAAGCTG TGCCCGTCTC
ATTGTCGTAC TCGTCAGTGG CCGTCCGCTG ATCATCACCG ACGATCTGCC TCGTTGGGAT
GCGCTTGTCG CCGCGTGGCT ACCCGGTAGC GAAGGGGCCG GTGTCGCCGA TGTTCTGTTT
GGCGATCAAC CATTTCGCGG GCGATTACCG GTGACGTGGC CGCGCAGCCT CGATCAATTA
CCGCTCGGAT CAGGAAGCGG CGAGCCACTC TTTCCCTATG GATTTGGACT AACCCCATAA
 
Protein sequence
MRLQYLLCLV MLVGVFGACG TLNAPPATPT PVIPLYRNPA APIAERVEDL LQRMTLAEKI 
GQMTLIEKNS ITADQVRELA IGGVLSGGGG YPDDENSPMA WVEMVNALQQ AALNSRLGIP
IIYGADGVHG HNNLYGAVIF PHNIGLGAAN DPALVEQIGR VTAREMAATG VFWNYAPGVM
VVQDVRWGRT YESYAERPEH VASLAVAFLR GLQAPDIAAP NRIIGTPKHY VGDGGTTWGT
STTANYQLDQ GETFGDETTI RTVHLPPYRA TIAAGAHVIM ASYSSWNGQK MHASSYWLTN
VLKEELGFTG FIVSDWEAID QIDPDYERAV VTAINAGIDM NMVPYDAVRF IETLTRAVNT
GMVSETRIDD AVRRILTTKF AMGLFDQPFA HTELLGDIGS PAHRALARTA VAQSLVLLKN
DGNLLPLPKD VAHLYIGGQA AHDLGIQAGG WTIEWQGKPG AIIPGTTILE GIQAAVTAQT
VIEYDPHGRF RGDPMATDAV CIAVVGELPY AEGRGDSATL RLPPNEQRTL RRMEESCARL
IVVLVSGRPL IITDDLPRWD ALVAAWLPGS EGAGVADVLF GDQPFRGRLP VTWPRSLDQL
PLGSGSGEPL FPYGFGLTP