Gene Cfla_2912 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2912 
Symbol 
ID9146824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3222752 
End bp3224227 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content72% 
IMG OID 
Product1, 4-beta cellobiohydrolase 
Protein accessionYP_003637994 
Protein GI296130744 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.426935 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCCCC GCTCGAAGAG ACCCCTCACC ACCAGACGCA AGGTCGTCGC GGCCGTCGCG 
GCCGGAGCCG TCCTCGCCGG CGGCGTCACC GCCCTGACCT CGAGCATCGC GCAGGCCGCC
GCCGGCTGCC GCGTCGACTA CGCCGTGACG AGCCAGTGGC CCGGTGGCTT CGGTGCAGCC
GTCACCGTCA CGAACCTCGG CGACCCGCTC TCGTCCTGGG AGCTGAGCTG GACGTTCCCC
GACGGCCAGG GCGTGCAGCA GCTCTGGAAC GGCGTGCACT CGACCTCCGG TTCGAACGTC
ACCGTGAAGA ACATGTCGTG GAACGGTTCG GTCGGCACCA ACGCCAGCGT CCAGGTCGGC
TTCAACGGCT CCTGGAACGG CGCGAACAAC GCGCCGACGT CCTTCACGCT CAACGGCACC
TCGTGCAACG GTGCGGTCGG TGGCCCGACG ACGGAGCCGA CGCCCGAGCC GACCCCGGAG
CCCACGCCCG AGCCGACGCC GGAGCCGACG CCCGAGCCGA CGCCGGAGCC CACGCCCGAG
CCGACGCCGG AGCCCACGCC CGAGCCGACC CCGGAGCCCA CGCCCGAGCC CACGCCCGAG
CCCACGCCCG AGCCCACGAT GCCGCCGGTC CAGGCCGGTC AGTTCCACGT CGACACCACG
AACCAGTCGT ACCGCGCCTG GCAGGCGGCC AGCGGCTCCG ACAAGGACCT GCTGGCGAAG
ATCGCCCTGA CGCCGCAGGC GTACTGGGTC GGCAACTGGA ACGAGGCCTC GCACGCGCAG
CAGGAGGTCC GTGACATCAC GTCGGCCGCT GCGGCCGCCG GCAGGACCGC CGTGCTCGTC
GTCTACGCCA TCCCGGGCCG CGACTGCGGC CAGCACTCCA GCGGCGGCGT GTCGACCTCC
GAGTACGCGC AGTGGATCGA CACGGTCGCC CAGGGCATCG TCGGCAACCC GTGGGTGGTC
CTCGAGCCCG ACGCGCTGCC GATGCTCGGC GACTGCGACG GCCAGGGCGA CCGGGTCGGC
TTCCTCAAGT ACGCCGCGAA GTCCCTGACC GCCAAGGGTG CGCGCGTCTA CATCGACGCC
GGCCACTCGG CGTGGCTGTC GCCGTCGGAG GCCGCGAACC GCCTCAACCA GATCGGGTTC
GAGGACGCCG TGGGCTTCTC GATCAACGTC TCCAACTACC GCACGACGGC GGAGTCGAAG
ACCTGGGGTC AGCAGGTCTC GCAGCTGACC GGTGGCAAGA AGTTCGTCAT CGACACGTCG
CGCAACGGCA ACGGCCCGTC CGGGTCGGAG TGGTGCAACC CGAGCGGCCG CGCCCTCGGC
GAGCGCCCGA CGCTCGTGAA CGACGGCAGC GGGCTCGACG CGCTGCTGTG GATCAAGCTG
CCCGGTGAGT CGGACGGCGC CTGCAACGGC GGCCCGGGCG CCGGTCAGTG GTGGCAGTCG
ATGGCACTGG AGCTGGCGCG CAACGCGAAG TGGTGA
 
Protein sequence
MHPRSKRPLT TRRKVVAAVA AGAVLAGGVT ALTSSIAQAA AGCRVDYAVT SQWPGGFGAA 
VTVTNLGDPL SSWELSWTFP DGQGVQQLWN GVHSTSGSNV TVKNMSWNGS VGTNASVQVG
FNGSWNGANN APTSFTLNGT SCNGAVGGPT TEPTPEPTPE PTPEPTPEPT PEPTPEPTPE
PTPEPTPEPT PEPTPEPTPE PTPEPTMPPV QAGQFHVDTT NQSYRAWQAA SGSDKDLLAK
IALTPQAYWV GNWNEASHAQ QEVRDITSAA AAAGRTAVLV VYAIPGRDCG QHSSGGVSTS
EYAQWIDTVA QGIVGNPWVV LEPDALPMLG DCDGQGDRVG FLKYAAKSLT AKGARVYIDA
GHSAWLSPSE AANRLNQIGF EDAVGFSINV SNYRTTAESK TWGQQVSQLT GGKKFVIDTS
RNGNGPSGSE WCNPSGRALG ERPTLVNDGS GLDALLWIKL PGESDGACNG GPGAGQWWQS
MALELARNAK W