Gene Cfla_1896 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_1896 
Symbol 
ID9145789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2109167 
End bp2111107 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content70% 
IMG OID 
Product1, 4-beta cellobiohydrolase 
Protein accessionYP_003636992 
Protein GI296129742 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.776414 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCACAC ACGGCAATCG AACGGCCGGG CGGCGCTTGC GCGCCGTCGC GACCGCTGCG 
ACGGCGACGG CGCTCGTCGC GGTGCCGCTG ACGCTCGCGA GCACGACCGC GACCGCGGCC
GAAGCCCACG TCGACAACCC CTACGCCGGC GCCCGGCAGT ACGTGAACCC GAACTGGGCA
GCGACGGTTG AGAGCGCTGC CACGCGCGCC GAGAGCTCGA CCCTCGCCGC GCAGATCCGC
ACGGTGGCCA AGCAGCCCAC CGCCGTCTGG ATGGACCGCA GCAGCGCCAT CACCGGCAAC
GCCGACGGCC CCGGCCTGAA GTTCCACCTC GACGAGGCCG TCAAGCAGAA GGCGGCGGGC
AGCACGCCGC TCGTCTTCAA CCTCGTCATC TACAACCTGC CGGGCCGCGA CTGCTTCGCT
CTCGCGTCGA ACGGTGAGCT CCCCGCCACC GACGCCGGCA TGGAGCGCTA CAAGACCGAG
TACATCGACC CCATCGTCGA CCTGCTCTCG GACCCGAAGT ACGCGGACAT CCGGGTCGCG
GCGACGATCG AGCCGGACTC GCTCCCGAAC CTCATCACGA ACATCTCGGA GAGCACCTGC
CAGAAGTCCG CGCCGTACTA CCGCGAGGGC GTCAAGTACG CGCTGGACGA GCTGAAGACG
CTCGACAACG TGTACACGTA CCTCGACGCG GCCCACTCGG GCTGGCTCGG CTGGGAGTCG
AACTCGGGCC CGACCGCCAA GCTGTTCGCC GAGGTCGCGA AGAGCACCAA GAAGGGCTTC
GCGTCGGTCG ACGGCTTCGT CACGAACACG GCGAACACCA CGCCGCTGGC CGAGCCGTTC
CTCACGGACC CGACCCTCAA CGTCGGTGGC GTGCCGGTCC GCTCGGCCAA GTTCTACGAG
TGGAACCCGG ACTTCGGTGA GCACGCGTGG ACGGCGCAGC TGCACCGTCT GCTCGTCGCC
GAGGGCTTCC CGGCCTCGAC CGGCATGCTC ATCGACACGT CCCGCAACGG CTGGGGCGGC
CCGGACCGTC CGACCAAGGC GTCGACGAGC ACCAACGTCG ACACCTACGT CAACGAGTCG
CGCATCGACC GCCGCACCCA CCGCGGCGCG TGGTGCAACC CGCTGGGCGC CGGCATCGGC
GAGCTCCCGC AGGCCACGCC GGCCGGTGCG CCGTCCGCGT CGCACCTCGA CGCGTACGTC
TGGATCAAGC CCCCGGGCGA GTCCGACGGT GCCTCGAAGG AGATCCCGAA CGACGAGGGC
AAGAGCTTCG ACCGCATGTG CGACCCGACC TACGTGGCGT CCAAGCTGTC GAACAACCTC
ACGGGTGCCA CGCCCGACGC GCCGGTCTCC GGCAAGTGGT TCGAGGCGCA GTTCATGACG
CTGGTCAAGA ACGCGTACCC GGTGATCACC CCGGACAACG GCTCGACGCC CACGCCCACG
CCGACCCCGT CGGTCACGCC GTCGCCGACC CCGTCGGTGA CCCCGTCCCC CACGCCGTCG
GTCACGCCGT CGCCGACCCC GTCGGTCACG CCGTCCCCCA CGCCGTCGGT GACGCCGTCG
CCGACCCCGT CCCCGACGGT CAGCCCGACG CCGTCGCCCA CCCCGTCGCC GACCCAGAAC
CCGGGTGGCG TGTGCACGGT GAGCTACACG GCCAACGCGT GGAACACCGG CTTCACGGCC
TCGGTCCGCG TGACCAACAA GGGCGCGGCC CTGTCCAGCT GGAACCTGAC GTTCGACCTG
CCGGCCGGCC AGTCCGTCCA GCAGGGCTGG AGCGCCAAGT GGGCCCAGTC GGGCCAGACC
GTGACGGTGA GCAACGAGGC GTGGAACGGC AACCTGGGTG CCAACGCCAC GGTGGACATC
GGCTTCAACG GCAGCCACAA CGGCAACGGC AACAGCGCCA AGCCGACGCA GTTCAAGCTG
AACGGCGCAG CCTGCTCCTG A
 
Protein sequence
MSTHGNRTAG RRLRAVATAA TATALVAVPL TLASTTATAA EAHVDNPYAG ARQYVNPNWA 
ATVESAATRA ESSTLAAQIR TVAKQPTAVW MDRSSAITGN ADGPGLKFHL DEAVKQKAAG
STPLVFNLVI YNLPGRDCFA LASNGELPAT DAGMERYKTE YIDPIVDLLS DPKYADIRVA
ATIEPDSLPN LITNISESTC QKSAPYYREG VKYALDELKT LDNVYTYLDA AHSGWLGWES
NSGPTAKLFA EVAKSTKKGF ASVDGFVTNT ANTTPLAEPF LTDPTLNVGG VPVRSAKFYE
WNPDFGEHAW TAQLHRLLVA EGFPASTGML IDTSRNGWGG PDRPTKASTS TNVDTYVNES
RIDRRTHRGA WCNPLGAGIG ELPQATPAGA PSASHLDAYV WIKPPGESDG ASKEIPNDEG
KSFDRMCDPT YVASKLSNNL TGATPDAPVS GKWFEAQFMT LVKNAYPVIT PDNGSTPTPT
PTPSVTPSPT PSVTPSPTPS VTPSPTPSVT PSPTPSVTPS PTPSPTVSPT PSPTPSPTQN
PGGVCTVSYT ANAWNTGFTA SVRVTNKGAA LSSWNLTFDL PAGQSVQQGW SAKWAQSGQT
VTVSNEAWNG NLGANATVDI GFNGSHNGNG NSAKPTQFKL NGAACS