Gene Cfla_2848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2848 
Symbol 
ID9146756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3163841 
End bp3165403 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content69% 
IMG OID 
ProductAlpha-N-arabinofuranosidase 
Protein accessionYP_003637931 
Protein GI296130681 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.724859 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0053695 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCGATCAG CCCGTCCCAC CGGAGCAGCG GCACCTGCGC GCCGCCGCTG GGCGACCGCG 
CTCGTCGCGG TCGCGACGGC CGTGTCGGCC GGTGCCGCCG CGCTCGTCGC CGCACCTGCG
GCGTCAGCGG CCACGGTCGA CACGACCGCG TCCTACCTGC TGATCAACCG CAACAGCGGC
AAGGCCCTCG ACGTCTACAA CCTCGCCATG AACGACGGTG CGCGCATCAC GCAGTGGTCG
CGCAACGACG GCACGCAGCA GCAGTGGCAG TTCGTCGACT CGGGCAACGG CTACTACCGC
ATCAAGTCGG TCCTGTCGGG CAAGGTGCTC GACGTCTACA ACTGGTCGAC GTCGAACGGT
GGCTCGATCG TCCAGTACAC CGACCGCAAC CAGAACAACC AGCAGTGGCG CCTGAACGAC
GGCCCCAACG GCACGGTCAT CCTGGTCAGC AGGCACTCCA ACCTCGCGCT CGAGGTCCAG
GGCGCGTCGA CGGCCGACGG TGCCAACATC GTCCAGTACA GCAACTGGGG CGGCACCAAC
CAGCAGTGGC AGCTCGTGCG CGTCGGTGGT GGGGGCGGCC AGCCCACGCA GCAGCCGACC
CAGCAGCCGA CGCAGCAGCC GACGGCGCAG CCGACCCAGC AGCCCACCTC GTCGCCGACG
TGCAACCTGC CGTCGTCGTA CCGCTGGCGC GACTCCGGCG TGCTGGCGCA GCCGCGCCAG
GGCTGGGTGT CGCTGAAGGA CTTCACGGTC GCGCCGTACA ACGGCCAGCA GCTCGTGTAC
GCCACGACCA ACACCGGCAC CGCGTGGCAG TCGACGATGT TCTCGCCGTT CTCGAGCTAC
TCGCAGATGG GCTCCGCGCA GCAGCAGACC ATGCCGTTCA CCGCCGTCGC GCCGTCGCTG
TTCTACTTCG CACCCAAGAA CATCTGGGTG ATGGCGTACC AGTGGGGCGG CCCGGCGTTC
TCCTACCGCA CGTCGAGCAA CCCGTCGAAC GTCAACGGCT GGAGCGCGCA CCAGACGCTG
TTCACCGGCT CGATCTCGAA CTCCGGCACC GGTCCGATCG ACCAGGCGCT GATCGGTGAC
GACCGGAACA TGTACCTGTT CTTCGCCGGC GACAACGGGC GGATCTACCG CGCGTCGATG
CCGATCGGCA ACTTCCCCGG CTCGTTCGGG TCGAACTACC AGACGATCAT GACCGACACG
CAGGCCAACC TCTTCGAGGC CGTGCAGGTC TACAAGCTGG CCGGTCAGCA GCGCTACCTC
ATGATCGTCG AGGCCATCGG GTCGCAGGGC CGCTACTTCC GGTCCTTCAC CGCCACGAGC
CTGGACGGCC AGTGGACCCC GCAGGCCGCG TCCGAGTCGA ACCCGTTCGC CGGCAAGGCC
AACTCCGGTG CGACGTGGAC GAACGACATC AGCCACGGCG AGCTGCTGCG CACCAGCGCG
GACCAGACCA TGACGGTCGA CCCCTGCAAC CTGCAGCTGC TCTACCAGGG CCGCTCCCCG
CAGTCCGGCG GCGACTACGG CGCCCTGCCG TACCGCCCCG GCCTGCTGAC CCTGCAGCGG
TAA
 
Protein sequence
MRSARPTGAA APARRRWATA LVAVATAVSA GAAALVAAPA ASAATVDTTA SYLLINRNSG 
KALDVYNLAM NDGARITQWS RNDGTQQQWQ FVDSGNGYYR IKSVLSGKVL DVYNWSTSNG
GSIVQYTDRN QNNQQWRLND GPNGTVILVS RHSNLALEVQ GASTADGANI VQYSNWGGTN
QQWQLVRVGG GGGQPTQQPT QQPTQQPTAQ PTQQPTSSPT CNLPSSYRWR DSGVLAQPRQ
GWVSLKDFTV APYNGQQLVY ATTNTGTAWQ STMFSPFSSY SQMGSAQQQT MPFTAVAPSL
FYFAPKNIWV MAYQWGGPAF SYRTSSNPSN VNGWSAHQTL FTGSISNSGT GPIDQALIGD
DRNMYLFFAG DNGRIYRASM PIGNFPGSFG SNYQTIMTDT QANLFEAVQV YKLAGQQRYL
MIVEAIGSQG RYFRSFTATS LDGQWTPQAA SESNPFAGKA NSGATWTNDI SHGELLRTSA
DQTMTVDPCN LQLLYQGRSP QSGGDYGALP YRPGLLTLQR