Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_1896 |
Symbol | |
ID | 9145789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 2109167 |
End bp | 2111107 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | 1, 4-beta cellobiohydrolase |
Protein accession | YP_003636992 |
Protein GI | 296129742 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.776414 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCACAC ACGGCAATCG AACGGCCGGG CGGCGCTTGC GCGCCGTCGC GACCGCTGCG ACGGCGACGG CGCTCGTCGC GGTGCCGCTG ACGCTCGCGA GCACGACCGC GACCGCGGCC GAAGCCCACG TCGACAACCC CTACGCCGGC GCCCGGCAGT ACGTGAACCC GAACTGGGCA GCGACGGTTG AGAGCGCTGC CACGCGCGCC GAGAGCTCGA CCCTCGCCGC GCAGATCCGC ACGGTGGCCA AGCAGCCCAC CGCCGTCTGG ATGGACCGCA GCAGCGCCAT CACCGGCAAC GCCGACGGCC CCGGCCTGAA GTTCCACCTC GACGAGGCCG TCAAGCAGAA GGCGGCGGGC AGCACGCCGC TCGTCTTCAA CCTCGTCATC TACAACCTGC CGGGCCGCGA CTGCTTCGCT CTCGCGTCGA ACGGTGAGCT CCCCGCCACC GACGCCGGCA TGGAGCGCTA CAAGACCGAG TACATCGACC CCATCGTCGA CCTGCTCTCG GACCCGAAGT ACGCGGACAT CCGGGTCGCG GCGACGATCG AGCCGGACTC GCTCCCGAAC CTCATCACGA ACATCTCGGA GAGCACCTGC CAGAAGTCCG CGCCGTACTA CCGCGAGGGC GTCAAGTACG CGCTGGACGA GCTGAAGACG CTCGACAACG TGTACACGTA CCTCGACGCG GCCCACTCGG GCTGGCTCGG CTGGGAGTCG AACTCGGGCC CGACCGCCAA GCTGTTCGCC GAGGTCGCGA AGAGCACCAA GAAGGGCTTC GCGTCGGTCG ACGGCTTCGT CACGAACACG GCGAACACCA CGCCGCTGGC CGAGCCGTTC CTCACGGACC CGACCCTCAA CGTCGGTGGC GTGCCGGTCC GCTCGGCCAA GTTCTACGAG TGGAACCCGG ACTTCGGTGA GCACGCGTGG ACGGCGCAGC TGCACCGTCT GCTCGTCGCC GAGGGCTTCC CGGCCTCGAC CGGCATGCTC ATCGACACGT CCCGCAACGG CTGGGGCGGC CCGGACCGTC CGACCAAGGC GTCGACGAGC ACCAACGTCG ACACCTACGT CAACGAGTCG CGCATCGACC GCCGCACCCA CCGCGGCGCG TGGTGCAACC CGCTGGGCGC CGGCATCGGC GAGCTCCCGC AGGCCACGCC GGCCGGTGCG CCGTCCGCGT CGCACCTCGA CGCGTACGTC TGGATCAAGC CCCCGGGCGA GTCCGACGGT GCCTCGAAGG AGATCCCGAA CGACGAGGGC AAGAGCTTCG ACCGCATGTG CGACCCGACC TACGTGGCGT CCAAGCTGTC GAACAACCTC ACGGGTGCCA CGCCCGACGC GCCGGTCTCC GGCAAGTGGT TCGAGGCGCA GTTCATGACG CTGGTCAAGA ACGCGTACCC GGTGATCACC CCGGACAACG GCTCGACGCC CACGCCCACG CCGACCCCGT CGGTCACGCC GTCGCCGACC CCGTCGGTGA CCCCGTCCCC CACGCCGTCG GTCACGCCGT CGCCGACCCC GTCGGTCACG CCGTCCCCCA CGCCGTCGGT GACGCCGTCG CCGACCCCGT CCCCGACGGT CAGCCCGACG CCGTCGCCCA CCCCGTCGCC GACCCAGAAC CCGGGTGGCG TGTGCACGGT GAGCTACACG GCCAACGCGT GGAACACCGG CTTCACGGCC TCGGTCCGCG TGACCAACAA GGGCGCGGCC CTGTCCAGCT GGAACCTGAC GTTCGACCTG CCGGCCGGCC AGTCCGTCCA GCAGGGCTGG AGCGCCAAGT GGGCCCAGTC GGGCCAGACC GTGACGGTGA GCAACGAGGC GTGGAACGGC AACCTGGGTG CCAACGCCAC GGTGGACATC GGCTTCAACG GCAGCCACAA CGGCAACGGC AACAGCGCCA AGCCGACGCA GTTCAAGCTG AACGGCGCAG CCTGCTCCTG A
|
Protein sequence | MSTHGNRTAG RRLRAVATAA TATALVAVPL TLASTTATAA EAHVDNPYAG ARQYVNPNWA ATVESAATRA ESSTLAAQIR TVAKQPTAVW MDRSSAITGN ADGPGLKFHL DEAVKQKAAG STPLVFNLVI YNLPGRDCFA LASNGELPAT DAGMERYKTE YIDPIVDLLS DPKYADIRVA ATIEPDSLPN LITNISESTC QKSAPYYREG VKYALDELKT LDNVYTYLDA AHSGWLGWES NSGPTAKLFA EVAKSTKKGF ASVDGFVTNT ANTTPLAEPF LTDPTLNVGG VPVRSAKFYE WNPDFGEHAW TAQLHRLLVA EGFPASTGML IDTSRNGWGG PDRPTKASTS TNVDTYVNES RIDRRTHRGA WCNPLGAGIG ELPQATPAGA PSASHLDAYV WIKPPGESDG ASKEIPNDEG KSFDRMCDPT YVASKLSNNL TGATPDAPVS GKWFEAQFMT LVKNAYPVIT PDNGSTPTPT PTPSVTPSPT PSVTPSPTPS VTPSPTPSVT PSPTPSVTPS PTPSPTVSPT PSPTPSPTQN PGGVCTVSYT ANAWNTGFTA SVRVTNKGAA LSSWNLTFDL PAGQSVQQGW SAKWAQSGQT VTVSNEAWNG NLGANATVDI GFNGSHNGNG NSAKPTQFKL NGAACS
|
| |