Gene Cagg_2093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2093 
Symbol 
ID7267600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2563638 
End bp2566076 
Gene Length2439 bp 
Protein Length812 aa 
Translation table11 
GC content54% 
IMG OID643566927 
Productglycoside hydrolase family 57 
Protein accessionYP_002463416 
Protein GI219848983 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1449] Alpha-amylase/alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAGC GGCTGATTTG TATTCACGGT CATTTCTACC AGCCGCCACG CGAAAATCCG 
TGGCTCGAGG CAGTCGAACA GCAAGACTCG GCGTACCCCT ATCACGACTG GAACGAGCGG
ATTACAGCCG AGTGCTACGA GCAAAATGCT GCCTCGCGCA TTCTTGATAG CCAAAACCAA
ATTATTCGCA TTGTCAACAA CTATAGTCGG ATCAGTTTTA ACTTTGGCCC CACCCTGCTA
ACGTGGTTAG CCGCTCACGC ACCGCAGGTG TATCAAGCGA TCCTGCAAGC CGATCAGGAA
AGTCAACATT ACTTCGGGGC CGGCTCGGCG ATGGCACAGT GTTATAACCA CATCATTATG
CCACTCGCAT CGCGCCGTGA TAAGGTGACG CAAGTTATCT GGGGTATTCG CGACTTTGTC
CATCGCTTTG GACGGGAACC CGAAGGTATG TGGCTTCCCG AAACGGCAGT TGATCTTGAG
ACGCTCGACA TTATGGCCGA ACACGGCATT AAGTTCACGA TACTTGCGCC CACGCAAGCC
AGTCACGTGC GTAAAATCGG CGAAATGATT TGGCATGATG TGAGCGGTGG GCGAATCGAT
CCGACGCAGC CGTATCTGGT GAAATTGGCG AGTGGCCGAG CCATCACGGT CTTTTTCTAC
GATGGGCCGG TCTCGCGCGC GGTAGCGTTC GAGCGACTCC TTAGCAGTGG GGTTGGGTTT
GCCAATCGGC TGGCCAGTAT TTTTAACGAC CAACGCTCGT GGCCGCAACT CGCTCATATT
GCGACCGATG GTGAGACCTA CGGCCATCAC CATCGCCACG GCGATATGGC ATTGGCGTAT
GCCTTACACT ACATCGAAGA AACCGGTCTG GCGAAACTTA CCAACTACGC AGCGTATCTA
CAACGCTACC GGCCAACGCA TGAAGTTCAG ATCATCGAAC GAACTTCGTG GAGTTGTGCG
CACGGTGTCG GGCGCTGGTC AACCGATTGT GGTTGCAATA CTGGCAGCAA TCCTGGTTGG
AATCAGGCAT GGCGTGCGCC GCTACGCGCT GCCCTAGACT GGTTACGCGA TACGATTGCG
CCACGGTTCG AGGGTTATGC CCGCCGCCTG TTGCACGATC CATGGGCAGC GCGTGATGAT
TACATCAGCG TCATTCTTGA CCGTTCACCG GAAAACGTCG CCGCTTTTAT CGGTCGGCAT
AGCCGTGGCC GGCTGGATGA TCACCAGCGG ATTGCAATCC TGAAGTTGAT GGAACTGCAG
CGCCACCTGA TGCTGATGTA TACCTCGTGT GGCTGGTTTT TCGATGATCT GAGCGGGATC
GAGACGATAC AGGTCATGAT GTACGCCGGC CGGGCTATCC AACTGGCTCA CGAACTGTTT
GGTGAAGAGA TCGAGGGTGA ATTCCTCAAT CGACTGGCAC AAGCGCGCAG CAACCTTCCG
TCACGGGGTA ACGGACGCGA TCTCTACGAA CGGCACGTGC GTCCGGCGAT GGTCGATCTG
CGCAAAGTCG GTGCGCATTA TGCAATGACT GCCCTCTTCA ACGGAGTAGG TGAACACGAA
CAGATCTATG CCTATACCGT TGAGCGTGAA GATTACCATC TCTTGCTGGC GGGAAAATCG
CGATTGGCAC TTGGCCGCAT TCGCATTATC TCCAACATCA CCGGGGAATC CACACGCCTC
AGCTTCGGCG TCTTACATCT CGGCGATCAC AACATATCCG GTGGTGTCCG TGAATACCAG
AACGAGCAGA TATACCAGCA ACTAATTGAA GAGTTGAGCG AACTCTTTCT GCGCGCCGAT
ATACCCGGCG TCATTCGTAT GGTCGACCGC AATTTTGGTC AAGAGCAATA TTCACTCAAA
CTTCTCTTCG GCGACGAGCA GCGCCAGATC CTCAATCGCA TTCTTACATC GAGTCTGGCT
GAAGCCGAAG CTGCCTATCG CCAAATCTAC GAAAATCACG CGCCGCTGAT GCGCTTTCTC
GCCAGCATGG GAATGCCGGT CCCGCGAGAA TTTCAGATTG CAGCCGAATT TGCCATCAAT
ACCGAATTGC GTCGCCTCTT TGAAACGGAA CCTCTCGATT TTGACCGCAT TAACAGCCTC
TTGCGCGAAG CACAGCGGTC GGGGGTGACG CTCGATGCAG AAGGGCTTAG TTATGCACTG
GCCCGTACCA TTCGCAATAT TAGCGAGAAC TTCTATCAGA ATCCGGAAGA TCGCGCATTG
CTGACTCAGC TCGATGCTGC GGTCGGCCTG GCGCGCAACC TCCCGTTCGA GGTTGATGTC
TGGCATACGC AAAACGTGTA TTACAAGTTG TTGCAAACCG TGTATCCGCA GATGGAAGCC
GATACCCGTG CCGGATTTGC CGATGCTTAT GCATGGATAA GGCTCTTCCG GTCATTGGGG
ACTAAATTGC GCTTCCGTCT GCCAGCAGGA GAGCCATGA
 
Protein sequence
MAERLICIHG HFYQPPRENP WLEAVEQQDS AYPYHDWNER ITAECYEQNA ASRILDSQNQ 
IIRIVNNYSR ISFNFGPTLL TWLAAHAPQV YQAILQADQE SQHYFGAGSA MAQCYNHIIM
PLASRRDKVT QVIWGIRDFV HRFGREPEGM WLPETAVDLE TLDIMAEHGI KFTILAPTQA
SHVRKIGEMI WHDVSGGRID PTQPYLVKLA SGRAITVFFY DGPVSRAVAF ERLLSSGVGF
ANRLASIFND QRSWPQLAHI ATDGETYGHH HRHGDMALAY ALHYIEETGL AKLTNYAAYL
QRYRPTHEVQ IIERTSWSCA HGVGRWSTDC GCNTGSNPGW NQAWRAPLRA ALDWLRDTIA
PRFEGYARRL LHDPWAARDD YISVILDRSP ENVAAFIGRH SRGRLDDHQR IAILKLMELQ
RHLMLMYTSC GWFFDDLSGI ETIQVMMYAG RAIQLAHELF GEEIEGEFLN RLAQARSNLP
SRGNGRDLYE RHVRPAMVDL RKVGAHYAMT ALFNGVGEHE QIYAYTVERE DYHLLLAGKS
RLALGRIRII SNITGESTRL SFGVLHLGDH NISGGVREYQ NEQIYQQLIE ELSELFLRAD
IPGVIRMVDR NFGQEQYSLK LLFGDEQRQI LNRILTSSLA EAEAAYRQIY ENHAPLMRFL
ASMGMPVPRE FQIAAEFAIN TELRRLFETE PLDFDRINSL LREAQRSGVT LDAEGLSYAL
ARTIRNISEN FYQNPEDRAL LTQLDAAVGL ARNLPFEVDV WHTQNVYYKL LQTVYPQMEA
DTRAGFADAY AWIRLFRSLG TKLRFRLPAG EP