Gene Cfla_2913 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2913 
Symbol 
ID9146825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3224579 
End bp3225988 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content73% 
IMG OID 
Product1, 4-beta cellobiohydrolase 
Protein accessionYP_003637995 
Protein GI296130745 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.489381 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.473063 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCCCC GCCCGAAGCC GGCGCTCGCC GCCGGTCGCA AGGCCGTCGC GGTGCTCGCC 
GCCGGCGCGG TGCTCGCCGC CGGCACGACC GTCCTGGCCT CCACCGCGGC GAACGCCGCC
GCCGGCTGCC GCGTCGACTA CGCGGTCACC AACGAGTGGC CCGGCGGTTT CGGCGCCAGC
ATCAACGTCA CGAACCTCGG CGACCCGCTG TCGTCGTGGG ACCTGCGCTG GACGTTCCCC
AGCGGCCAGT CGATCCAGCA GCTCTGGAAC GGCGCGGCCT CGTCGAGCGG CTCGCAGGTC
ACGGTCGTGA ACTCCCCGTG GAACGGCTCG GTCGGCACCA ACGGCGTCAT CTCGCTCGGG
TTCAACGGCT CGTGGAACGG CTCGAACGCC AAGCCCACGT CGTTCACGCT CAACGGCACG
GCGTGCACCG GTTCGGTGTC CGGCGGCCAG CCGACCCAGC AGCCGACCCA GCAGCCGACG
CAGCAGCCCA CCCAGCAGCC GACGCAGCAG CCCACCCAGC AGCCGACGCA GCAGCCCACC
CAGCAGCCCA CCCAGCAGCC GCAGCCCAGC GGCGACTTCT ACGTCGACCC CGAGACGGCG
GCCTACGCCG CCTGGCAGGC CGCGTCGGGC AACGACAAGG TGCTGCTGGC GAAGATCGCG
CAGACGCCGC AGGCGCTGTG GATCGGCGAC TGGTCCAGCG CGTCCGTCAT CCAGCAGCAG
GTCCGCGACT ACACCGGCAA GGCGCGTTCC GCCGGCAAGA TCGCGCAGAT CGTCGTGTAC
GCCATCCCGG GCCGTGACTG CGGCAACTAC TCGGGCGGCG GCGTCGCCAC CTCGGAGTAC
GCGCGCTGGG TCGACACCGT CGCGCAGGGC GTCCAGGGCA ACCCGTGGGT GATCCTCGAG
CCCGACGCGC TCGCGCAGCT CGGCGACTGC CAGGGCCAGG GCGACCGGGT CGGCTTCCTG
CAGTACGCGG CCAAGGCGTT CGCCGCCAAG GGTGCGCGCG TGTACATCGA CGCCGGCAAC
TCGGCGTGGC TCTCCGCGTC GGAGGCGGCC AACCGCCTCA ACCGCGTCGG CTGGGACGGT
GCCGTCGGCT TCTCGCTCAA CGTCTCCAAC TACCGCACGA CGGCCGAGGC CAAGGCGTAC
GGCCAGGAGA TCTCCCGCCT CACCGGTGGC AAGAAGTTCG TCATCGACAC GTCGCGGAAC
GGCAACGGCG CGTCGGGCTC CGAGTGGTGC AACCCGAGCG GGCGCGCCCT GGGCGACCGC
CCGACCCGCG TCAACGACGG CAGCGGGCTC GACGCGCTGC TGTGGATCAA GCGTCCCGGC
GAGTCGGACG GCACCTGCAA CGGCGGCCCG GCGGCCGGCG CCTGGTGGCA GTCGATGGCC
CTGGAGCTCG CGCGCAACGC CAAGTGGTGA
 
Protein sequence
MSPRPKPALA AGRKAVAVLA AGAVLAAGTT VLASTAANAA AGCRVDYAVT NEWPGGFGAS 
INVTNLGDPL SSWDLRWTFP SGQSIQQLWN GAASSSGSQV TVVNSPWNGS VGTNGVISLG
FNGSWNGSNA KPTSFTLNGT ACTGSVSGGQ PTQQPTQQPT QQPTQQPTQQ PTQQPTQQPT
QQPTQQPQPS GDFYVDPETA AYAAWQAASG NDKVLLAKIA QTPQALWIGD WSSASVIQQQ
VRDYTGKARS AGKIAQIVVY AIPGRDCGNY SGGGVATSEY ARWVDTVAQG VQGNPWVILE
PDALAQLGDC QGQGDRVGFL QYAAKAFAAK GARVYIDAGN SAWLSASEAA NRLNRVGWDG
AVGFSLNVSN YRTTAEAKAY GQEISRLTGG KKFVIDTSRN GNGASGSEWC NPSGRALGDR
PTRVNDGSGL DALLWIKRPG ESDGTCNGGP AAGAWWQSMA LELARNAKW