Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_2848 |
Symbol | |
ID | 9146756 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 3163841 |
End bp | 3165403 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | Alpha-N-arabinofuranosidase |
Protein accession | YP_003637931 |
Protein GI | 296130681 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.724859 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0053695 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCGATCAG CCCGTCCCAC CGGAGCAGCG GCACCTGCGC GCCGCCGCTG GGCGACCGCG CTCGTCGCGG TCGCGACGGC CGTGTCGGCC GGTGCCGCCG CGCTCGTCGC CGCACCTGCG GCGTCAGCGG CCACGGTCGA CACGACCGCG TCCTACCTGC TGATCAACCG CAACAGCGGC AAGGCCCTCG ACGTCTACAA CCTCGCCATG AACGACGGTG CGCGCATCAC GCAGTGGTCG CGCAACGACG GCACGCAGCA GCAGTGGCAG TTCGTCGACT CGGGCAACGG CTACTACCGC ATCAAGTCGG TCCTGTCGGG CAAGGTGCTC GACGTCTACA ACTGGTCGAC GTCGAACGGT GGCTCGATCG TCCAGTACAC CGACCGCAAC CAGAACAACC AGCAGTGGCG CCTGAACGAC GGCCCCAACG GCACGGTCAT CCTGGTCAGC AGGCACTCCA ACCTCGCGCT CGAGGTCCAG GGCGCGTCGA CGGCCGACGG TGCCAACATC GTCCAGTACA GCAACTGGGG CGGCACCAAC CAGCAGTGGC AGCTCGTGCG CGTCGGTGGT GGGGGCGGCC AGCCCACGCA GCAGCCGACC CAGCAGCCGA CGCAGCAGCC GACGGCGCAG CCGACCCAGC AGCCCACCTC GTCGCCGACG TGCAACCTGC CGTCGTCGTA CCGCTGGCGC GACTCCGGCG TGCTGGCGCA GCCGCGCCAG GGCTGGGTGT CGCTGAAGGA CTTCACGGTC GCGCCGTACA ACGGCCAGCA GCTCGTGTAC GCCACGACCA ACACCGGCAC CGCGTGGCAG TCGACGATGT TCTCGCCGTT CTCGAGCTAC TCGCAGATGG GCTCCGCGCA GCAGCAGACC ATGCCGTTCA CCGCCGTCGC GCCGTCGCTG TTCTACTTCG CACCCAAGAA CATCTGGGTG ATGGCGTACC AGTGGGGCGG CCCGGCGTTC TCCTACCGCA CGTCGAGCAA CCCGTCGAAC GTCAACGGCT GGAGCGCGCA CCAGACGCTG TTCACCGGCT CGATCTCGAA CTCCGGCACC GGTCCGATCG ACCAGGCGCT GATCGGTGAC GACCGGAACA TGTACCTGTT CTTCGCCGGC GACAACGGGC GGATCTACCG CGCGTCGATG CCGATCGGCA ACTTCCCCGG CTCGTTCGGG TCGAACTACC AGACGATCAT GACCGACACG CAGGCCAACC TCTTCGAGGC CGTGCAGGTC TACAAGCTGG CCGGTCAGCA GCGCTACCTC ATGATCGTCG AGGCCATCGG GTCGCAGGGC CGCTACTTCC GGTCCTTCAC CGCCACGAGC CTGGACGGCC AGTGGACCCC GCAGGCCGCG TCCGAGTCGA ACCCGTTCGC CGGCAAGGCC AACTCCGGTG CGACGTGGAC GAACGACATC AGCCACGGCG AGCTGCTGCG CACCAGCGCG GACCAGACCA TGACGGTCGA CCCCTGCAAC CTGCAGCTGC TCTACCAGGG CCGCTCCCCG CAGTCCGGCG GCGACTACGG CGCCCTGCCG TACCGCCCCG GCCTGCTGAC CCTGCAGCGG TAA
|
Protein sequence | MRSARPTGAA APARRRWATA LVAVATAVSA GAAALVAAPA ASAATVDTTA SYLLINRNSG KALDVYNLAM NDGARITQWS RNDGTQQQWQ FVDSGNGYYR IKSVLSGKVL DVYNWSTSNG GSIVQYTDRN QNNQQWRLND GPNGTVILVS RHSNLALEVQ GASTADGANI VQYSNWGGTN QQWQLVRVGG GGGQPTQQPT QQPTQQPTAQ PTQQPTSSPT CNLPSSYRWR DSGVLAQPRQ GWVSLKDFTV APYNGQQLVY ATTNTGTAWQ STMFSPFSSY SQMGSAQQQT MPFTAVAPSL FYFAPKNIWV MAYQWGGPAF SYRTSSNPSN VNGWSAHQTL FTGSISNSGT GPIDQALIGD DRNMYLFFAG DNGRIYRASM PIGNFPGSFG SNYQTIMTDT QANLFEAVQV YKLAGQQRYL MIVEAIGSQG RYFRSFTATS LDGQWTPQAA SESNPFAGKA NSGATWTNDI SHGELLRTSA DQTMTVDPCN LQLLYQGRSP QSGGDYGALP YRPGLLTLQR
|
| |