Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Afer_2006 |
Symbol | |
ID | 8324106 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidimicrobium ferrooxidans DSM 10331 |
Kingdom | Bacteria |
Replicon accession | NC_013124 |
Strand | - |
Start bp | 2122293 |
End bp | 2125877 |
Gene Length | 3585 bp |
Protein Length | 1194 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644953131 |
Product | cellulose-binding family II |
Protein accession | YP_003110581 |
Protein GI | 256372757 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0781083 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCTCTTG CTCTTGGCCT CGTTGCCGGT GGCATCGCGA GCGTGTTGGG AGCACAGACA GCGCTGACGG CAGCCGCGAC CGCGTTCCCG GCGACACCTC CTGGCGCGGT GAGCGGTCCG TTCCAGGTCC GTGGCAACGT CATCGTCGAC GCAGCCGGAG CGCCCGTGTA TCTCCACGGC GTGGACTGGC CGAGCCTCGT GTGGAATCCG GATGGCCAGT GGGCCAACGC GAGCCAGTCG GGGGTCAATC CGAATGAGTT CGTCGCGATG GCCAAGCAGT GGGGTGCGAA CGCCGTCCGT ATCCCGGTTA ACGAAGCCTT CTGGCTGAAG GGTTCGCCGG AGTACGCCCC TGGCTACATC GCAACCGTCG AGTCGGTCGT GTCGCTCGTC GAGCACAACG GCATGATCCC GATCATCGAT CTTCACCGGG TGATCGGGGC CGACTCGGTG ACCGCGACAC CCTCGGCCAA CCCCGCGTGC GCCCCGGACG TGCCCAGTGA GACCTTCTGG CAGCAGGCCG CCTCGATCTT CAAGGGTGAT CCGAACGTCA TGTTCGAGCT CTACAACGAG CCCCACGACA TCCCATGGTC CGTGTGGCGC AACGGCGGGG CCATCACCTG TGCCGATACC GGCCAGTCCT ACACGGCCGT TGGCGAGCAG CAGCTCCTCG ATATCGTGCG CGCGACCGGC GCCGAGAACG TGGTGATCGT CGACGGCAAT CACTGGGCAG GCAACCTCGC GCCGATCGCC GAGTGGGGCC TCGCTGGTGC GAATGTCGCC TATGGCTTCC ATCTCTACGT GCACGAGAGC ACGCCCACCA CCCCGTCGCA ATGGAGCGCA TCGCTCGGAA CCGTGCCGTC GCTCGCCCCG GTCGTCGCGA CGGAGTTCGG CGTGCTGGGG TGTGCGACCC CGTATCCGAC CTCCACCGAG CAGTCGATCG TCGACTACCT CGAGCAGCAC GGCATCGGTT GGACCGCGTG GGGCTGGTTC GCCGGACCCA ACGGCGGCAG CTGCAGTTTT CCGTCGCTGA TCGCCAACGA GCAGGGAACC CCCTTCGACG GAGGGGTCGT GGTCCAGCAG CAGAGCCTGG GCCTTGCCAG TGGTGCGGTG CAGGCGAACG AGCCCGCCGC GCCCTCGGTG ACGACGGCGT CGGGCTCGAC GGTCTCGGTG CCCGCGCTCG GAGCGCTCGG TCAGGCCTAT GCGGTCGTCG ACGAGACGGC GGGCTCCGGG TCGCTCGCTC TCACGCCCGT GGGGGCCGCG TCCCCGGAGA CGATCGCACT GAGTACGGGG CAGAACCTCG TGCCGCTCGT CCAACCGGGC TCGGCCGTCG AGACGAGCAC CCCCGTGGCG CTCACCGTGC CTCAGGGTGC AACCCTCACG GGTGTGATCG ACGAGCCTTC GATCAGCGGC GTCGCGACGC CCATCGCCAT GGCGGGCACG TCGGGTGCTG CGGCCTGCAC GCCCCAGGTG CGATGGGGGA ACGCCACGAT CACGCAGACG AGTGCCGGTA TCAGCGTCAC CGCAACGAGC GGCTATCCGG CTGTCGAGCT CGTGGGGCCG AGCTGTGCGC TGAGCGGTGC CGACGGCTAC ACCGTGTCGG TGAGCTCACC CACCGGGGAT GCGCAGGTGA CGCCGTTCGC ACTGTCCTCC TCGTACCACG CCTCGTTCCT GCCTGACGCA ACCGTGGGCC AGCAGGCCAC GACCCTCGCG CTCGGTCCGG GGGTCGGAAC CTCGCCCGTC CTCGGTCTCC AGCTCGATGC GGCTACCAGT CCCGAGACCA TCGTGATCTC GAACCTGACC GGTTGGGCCC CGACGACCAC CGTGGCAGCC TCGACGATGC AGGCAAGCAC GGTCTCCTCG TCCACCTCGA CGACTTCGTC GACCTCGACG GCCTTCGCGC CCTACGTGGA CATGACCCTG CCGCCGACCG GCACCCTCGC CCAGCTCGGC TCCGAGTCCG GCGCCAAGGC GCTCACGCTG GCGTTCATTG TGAGCTCGAA GGGCACCTGC TATCCGAGCT GGGGCAACTA CTTCCCGGTG GGCCAGAACA ACGGCCTCTT CCGCAACGAG ATCGCTGCCT ACCAAGCCGA GGGTGGCACG CCCATCGTCT CCTTCGGCGG TGAGATCAAC CAGGAGCTCG CCCAGGTGTG CTCCTCGCCC CAGGCGCTCG CCCAAGCCTA CGAGACCGTC ATCAACACCT ACCACGTCTA CAACCTCGAC TTCGATATCG AGGGTTCGGA CCTGAACGAC CAGGCCGCTG TGAACCTGCG CAACCAGGCC CTCGCCCTGG TCCAGCAACA AGAAGCAGCG CAAGGCCACC CAGTGAGCGT CTCCTACACG CTGCCGGTTA TGCCGTGGGG TCTGCTCGCG AACTCGCTGT ATCTGCTGAA CTCGGCGAAG ACCTACGGTG TCGACGTGTC CAACGTCAAC GTCATGGCGA TGGACTACGG CATCCCCCAG GCCCAAGGTG CGATGGGCAC GATGGCGATC GAGGCCGCCC AGGCCACCGA GCAACAGCTC GCCTCGATCT GGTCGAACCT GTCCACCGCC CAGCTCTGGC AGATGGTCGG CGTGACCCCG ATGATAGGGC AGAACGACCT CTCGGGTGAG ATCTTCACCA CCCAAGACGC CCAGCAGCTC GGGGCCTTCG CCGAGCAGGT TGGCCTCGGC AGGCTCTCGA TGTGGGAGAT CCACCGCGAC GTCGAGTGCG CGAACAACGC CGACGAGGAC TCGAACTACT GCTCCGGCAC GACCGAGACC CCGTGGCAGT TCTCGCAGAT CTTCGAGCAG GCCGCCGGTG GGTCACTGCC CGCTCCGTCG GATCCCACGC CACCGCCGTC GGATCCCACG CCACCGCCGT CTAACCCGAC GCCCGCTCCC ACCAGCCCGA CGACGCCTGG CCCGATCTCG TTCTCCTCGG GCTCCTTGGC GGGCACCGCG ACGGTGACCT CCACGTGGTG GGGTGGTGGT CAGGTGGACG TGACCATCAA GAACACCGGC ACGGCTCCGG TCTCGGGATG GACCCTCGGG TTCACGGTGC CCTCGGGTGA GAGCATCGGG AGCCTCTGGA ACGGCACGGT CTCCGGTTCG ACCGGCACCG TCACCGTGAC GCCGGCGAGC TGGAACGGCA CCATCGAGCC GGGCGCGAGC ATCCAGGTGG GCTTCACGCT CAACGGTGGG CCCGAGAACG GCGTGTTCCC CTCGAGCTAT GAACTCAGTG GGTCGTCGAC CGCGAGTGAC CCTGCGCCGA CGACGCCGAC ACCGACGCCG ACCTCGTCCG GCCAGCTCAC CGCCACCGCG ACGGTGACCT CGACCTGGTG GGGTGGTGGT CAGGTGGACG TGACCATCAA GAACACCGGC ACGGCGCCGG TGTCGGGATG GACCCTCGGG TTCACGGTGC CCTCGGGTGA GAGCATCGGG AGCCTCTGGA ACGGCACGGT CTCCGGTTCG ACCGGCACCG TCACCGTGAC GCCGGCGAGC TGGAACGGCA CCATCGAGCC GGGCGCGAGC ATCCAGGTGG GCTTCACCAT CCAGTCATCG GCGAAGTCTG CGACGCTGCC GACGACGGTC AGCGTCAGCG CCTAG
|
Protein sequence | MALALGLVAG GIASVLGAQT ALTAAATAFP ATPPGAVSGP FQVRGNVIVD AAGAPVYLHG VDWPSLVWNP DGQWANASQS GVNPNEFVAM AKQWGANAVR IPVNEAFWLK GSPEYAPGYI ATVESVVSLV EHNGMIPIID LHRVIGADSV TATPSANPAC APDVPSETFW QQAASIFKGD PNVMFELYNE PHDIPWSVWR NGGAITCADT GQSYTAVGEQ QLLDIVRATG AENVVIVDGN HWAGNLAPIA EWGLAGANVA YGFHLYVHES TPTTPSQWSA SLGTVPSLAP VVATEFGVLG CATPYPTSTE QSIVDYLEQH GIGWTAWGWF AGPNGGSCSF PSLIANEQGT PFDGGVVVQQ QSLGLASGAV QANEPAAPSV TTASGSTVSV PALGALGQAY AVVDETAGSG SLALTPVGAA SPETIALSTG QNLVPLVQPG SAVETSTPVA LTVPQGATLT GVIDEPSISG VATPIAMAGT SGAAACTPQV RWGNATITQT SAGISVTATS GYPAVELVGP SCALSGADGY TVSVSSPTGD AQVTPFALSS SYHASFLPDA TVGQQATTLA LGPGVGTSPV LGLQLDAATS PETIVISNLT GWAPTTTVAA STMQASTVSS STSTTSSTST AFAPYVDMTL PPTGTLAQLG SESGAKALTL AFIVSSKGTC YPSWGNYFPV GQNNGLFRNE IAAYQAEGGT PIVSFGGEIN QELAQVCSSP QALAQAYETV INTYHVYNLD FDIEGSDLND QAAVNLRNQA LALVQQQEAA QGHPVSVSYT LPVMPWGLLA NSLYLLNSAK TYGVDVSNVN VMAMDYGIPQ AQGAMGTMAI EAAQATEQQL ASIWSNLSTA QLWQMVGVTP MIGQNDLSGE IFTTQDAQQL GAFAEQVGLG RLSMWEIHRD VECANNADED SNYCSGTTET PWQFSQIFEQ AAGGSLPAPS DPTPPPSDPT PPPSNPTPAP TSPTTPGPIS FSSGSLAGTA TVTSTWWGGG QVDVTIKNTG TAPVSGWTLG FTVPSGESIG SLWNGTVSGS TGTVTVTPAS WNGTIEPGAS IQVGFTLNGG PENGVFPSSY ELSGSSTASD PAPTTPTPTP TSSGQLTATA TVTSTWWGGG QVDVTIKNTG TAPVSGWTLG FTVPSGESIG SLWNGTVSGS TGTVTVTPAS WNGTIEPGAS IQVGFTIQSS AKSATLPTTV SVSA
|
| |