Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_2912 |
Symbol | |
ID | 9146824 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 3222752 |
End bp | 3224227 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | 1, 4-beta cellobiohydrolase |
Protein accession | YP_003637994 |
Protein GI | 296130744 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.426935 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACCCCC GCTCGAAGAG ACCCCTCACC ACCAGACGCA AGGTCGTCGC GGCCGTCGCG GCCGGAGCCG TCCTCGCCGG CGGCGTCACC GCCCTGACCT CGAGCATCGC GCAGGCCGCC GCCGGCTGCC GCGTCGACTA CGCCGTGACG AGCCAGTGGC CCGGTGGCTT CGGTGCAGCC GTCACCGTCA CGAACCTCGG CGACCCGCTC TCGTCCTGGG AGCTGAGCTG GACGTTCCCC GACGGCCAGG GCGTGCAGCA GCTCTGGAAC GGCGTGCACT CGACCTCCGG TTCGAACGTC ACCGTGAAGA ACATGTCGTG GAACGGTTCG GTCGGCACCA ACGCCAGCGT CCAGGTCGGC TTCAACGGCT CCTGGAACGG CGCGAACAAC GCGCCGACGT CCTTCACGCT CAACGGCACC TCGTGCAACG GTGCGGTCGG TGGCCCGACG ACGGAGCCGA CGCCCGAGCC GACCCCGGAG CCCACGCCCG AGCCGACGCC GGAGCCGACG CCCGAGCCGA CGCCGGAGCC CACGCCCGAG CCGACGCCGG AGCCCACGCC CGAGCCGACC CCGGAGCCCA CGCCCGAGCC CACGCCCGAG CCCACGCCCG AGCCCACGAT GCCGCCGGTC CAGGCCGGTC AGTTCCACGT CGACACCACG AACCAGTCGT ACCGCGCCTG GCAGGCGGCC AGCGGCTCCG ACAAGGACCT GCTGGCGAAG ATCGCCCTGA CGCCGCAGGC GTACTGGGTC GGCAACTGGA ACGAGGCCTC GCACGCGCAG CAGGAGGTCC GTGACATCAC GTCGGCCGCT GCGGCCGCCG GCAGGACCGC CGTGCTCGTC GTCTACGCCA TCCCGGGCCG CGACTGCGGC CAGCACTCCA GCGGCGGCGT GTCGACCTCC GAGTACGCGC AGTGGATCGA CACGGTCGCC CAGGGCATCG TCGGCAACCC GTGGGTGGTC CTCGAGCCCG ACGCGCTGCC GATGCTCGGC GACTGCGACG GCCAGGGCGA CCGGGTCGGC TTCCTCAAGT ACGCCGCGAA GTCCCTGACC GCCAAGGGTG CGCGCGTCTA CATCGACGCC GGCCACTCGG CGTGGCTGTC GCCGTCGGAG GCCGCGAACC GCCTCAACCA GATCGGGTTC GAGGACGCCG TGGGCTTCTC GATCAACGTC TCCAACTACC GCACGACGGC GGAGTCGAAG ACCTGGGGTC AGCAGGTCTC GCAGCTGACC GGTGGCAAGA AGTTCGTCAT CGACACGTCG CGCAACGGCA ACGGCCCGTC CGGGTCGGAG TGGTGCAACC CGAGCGGCCG CGCCCTCGGC GAGCGCCCGA CGCTCGTGAA CGACGGCAGC GGGCTCGACG CGCTGCTGTG GATCAAGCTG CCCGGTGAGT CGGACGGCGC CTGCAACGGC GGCCCGGGCG CCGGTCAGTG GTGGCAGTCG ATGGCACTGG AGCTGGCGCG CAACGCGAAG TGGTGA
|
Protein sequence | MHPRSKRPLT TRRKVVAAVA AGAVLAGGVT ALTSSIAQAA AGCRVDYAVT SQWPGGFGAA VTVTNLGDPL SSWELSWTFP DGQGVQQLWN GVHSTSGSNV TVKNMSWNGS VGTNASVQVG FNGSWNGANN APTSFTLNGT SCNGAVGGPT TEPTPEPTPE PTPEPTPEPT PEPTPEPTPE PTPEPTPEPT PEPTPEPTPE PTPEPTMPPV QAGQFHVDTT NQSYRAWQAA SGSDKDLLAK IALTPQAYWV GNWNEASHAQ QEVRDITSAA AAAGRTAVLV VYAIPGRDCG QHSSGGVSTS EYAQWIDTVA QGIVGNPWVV LEPDALPMLG DCDGQGDRVG FLKYAAKSLT AKGARVYIDA GHSAWLSPSE AANRLNQIGF EDAVGFSINV SNYRTTAESK TWGQQVSQLT GGKKFVIDTS RNGNGPSGSE WCNPSGRALG ERPTLVNDGS GLDALLWIKL PGESDGACNG GPGAGQWWQS MALELARNAK W
|
| |