Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_0172 |
Symbol | |
ID | 9144038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 206996 |
End bp | 208084 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | chitin-binding domain 3 protein |
Protein accession | YP_003635290 |
Protein GI | 296128040 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.808366 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00471086 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTCATCC CCACCCGGAG CCGGTTCGGC CGACTCGCTC GGCTCGCCCT CGCCGTCCCC CTCGCCCTCG CGGCCACCGG CATCGTCGCC ACGTCCGCCT CCGCCCACGG CTCCGTCACC GACCCGCCGT CGCGCAACTA CGGCTGCTGG GAGCGCGAGG GCGGCACGCA CATGGACCCC GCCATGGCGC AGCGCGACCC CATGTGCTGG CAGGCCTTCC AGGCCAACCC CAACACCATG TGGAACTGGA ACGGCAACTT CCGTGAGGGC GTCGGCGGCC GCCACGAGCA GGTCATCCCC GACGACCAGC TCTGCTCGGC CGGCAAGACG CAGAACGGCC TGTACGCGTC GCTCGACACC CCCGGCCCGT GGATCATGAA GACGGTCCCG CACAACTTCA CGCTCACGCT GACGGACGGC GCCATGCACG GTGCCGACTA CATGCGCATC TACGTGTCGA AGGCGGGTTA CGACCCGACG ACCGACCCGC TGGGCTGGGA CGACATCGAG CTGATCAAGG AGACGGGCCG CTACGGCACG ACCGGTCTCT ACCAGGCGGA CGTCTCCATC CCGTCCAACC GCACGGGCCG CGCGGTGCTG TTCACGATCT GGCAGGCCTC GCACCTCGAC CAGCCGTACT ACATCTGCTC GGACATCAAC ATCAACGGGA CCGCGCCGAC GCAGCAGCCG ACGCAGCAGC CGACGCAGCA GCCCACCCAG CAGCCGACGC AGCAGCCCAC CCAGCAGCCC ACCCAGCAGC CGACGCAGCA GCCGACGCAG CAGCCCACGC AGCAGCCCAC GCAGAACCCG GGCACCGGTG CCTGCACCGC GACGGTCAAG GCCGCCAGCA CGTGGGGCAA CGGCTGGCAG GGTGAGGTCA CCGTGACGGC CGGCTCCAGC GCGATCAACG GCTGGAAGGT CACCGTCGGT GGCGCGTCGA TCACGCAGGC ATGGAGCGGC TCCTACAGCG GTGGGACGTT CTCCAACGCC GAGTGGAACG GCAAGCTCGC GGCAGGTGCC TCGACGACGG CCGGCTTCAT CGCCTCGGGT ACGCCCGGCA CGCTGACGGC CACCTGCACC GCGGCCTGA
|
Protein sequence | MFIPTRSRFG RLARLALAVP LALAATGIVA TSASAHGSVT DPPSRNYGCW EREGGTHMDP AMAQRDPMCW QAFQANPNTM WNWNGNFREG VGGRHEQVIP DDQLCSAGKT QNGLYASLDT PGPWIMKTVP HNFTLTLTDG AMHGADYMRI YVSKAGYDPT TDPLGWDDIE LIKETGRYGT TGLYQADVSI PSNRTGRAVL FTIWQASHLD QPYYICSDIN INGTAPTQQP TQQPTQQPTQ QPTQQPTQQP TQQPTQQPTQ QPTQQPTQNP GTGACTATVK AASTWGNGWQ GEVTVTAGSS AINGWKVTVG GASITQAWSG SYSGGTFSNA EWNGKLAAGA STTAGFIASG TPGTLTATCT AA
|
| |