Gene Cfla_3048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3048 
Symbol 
ID9146960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3391318 
End bp3392865 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content74% 
IMG OID 
ProductAlpha-N-arabinofuranosidase 
Protein accessionYP_003638130 
Protein GI296130880 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0608814 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCG TGACGAACCC GATCCTGCCC GGGTGCTACC CGGACCCGTC GATCTGCCGG 
GTCGGTGAGG ACTACTACCT CGTCACCTCG ACGTTCGAGT ACCTCCCGGG CCTGCCCGTG
CTGCACAGCC GCGACCTGGT GACCTGGGAG ACGATCGGGC ACGTCGTGCA CCGCCCCGGG
CAGCTGGACT ACACGGGGAT CGCGTCGTCC GGCGGGCTGT ACGCGCCGAC GATCCGCCAC
CACGACGGGC TGTTCTGGGT CATCTGCACG CTCGTGGGGC AGCCGCAGGG CAACCCGGGC
GGCAACTTCC TCATGACCGC GACGGACCCC GCCGGTCCCT GGTCGGACCC CATGTGGCTC
GACGCGGACG GCATCGACCC GTCGATCTTC TTCGACGACG ACGGCCGCAT CTGGGTGCAC
GGCACGCGAC TGGCCCGCGA ACCCGAGTGG TTCCACCAGA CGGAGGTGTG GATCCGCGAG
CTCGACCCGG AGACGAAGAA GCTCACCGGG CCGGAGCACG TCGTGTGGTC CGGAGCGGTC
AAGGGCGCCG TGTGGGCGGA GGCGCCGCAC CTGTACAAGG TCGACGGCAC GTACTACCTG
ATGGCGGCCG AGGCGGGCAC GGAGTTCCAC CACGCCGAGA GCATCGCGCA GGCCGAGGTC
GTCACCGGCC CGTACACCGG CACCCGGGGC AACCCGATCC TCACGCACCG GCACCTGGGC
CGGCAGCACC CGGTCGTCGG GGCCGGGCAC GCGGACCTCG TCGAGGCCGC CGACGGCTCG
TGGTGGGCCG TGCTGCTCGC CATGCGCACC TACGGCGGCT ACCACTACCC GCTGGGCCGT
GAGACGTTCC TCGTGCCCGT CGTGTGGGAG GACGGGTGGC CGGTGTTCGC CCCGGGCGTC
GGCAAGGTCC CCGACGAGGT CGAGGTGCCG TTCGCGACGG GCCCGGCGCG CGGGAGCGCG
CAGGGACTGG CGTCGGGGGT CGTCGCGCCG GACGACCTGC GGTGGACCGC GGTGCGGGCC
CTGCCCGCCG AGGTCGCGAC GCCCGCCGGC CAGGGCTGGG ACCTGCCGCT GCGCGCCGCG
ACGCTCGACT CGCTGGAGGT GCCCGCCTTC CTGGGCGTCC GCCTGCAGCA CACCCACGCG
GACTTCCTGG CGCGCGTGAC CGTGGACCTC GCCGTGGGCG AGGAGGTCGG GGTCGTCGTC
CGGCAGTCCG AGAAGGACCA CGCGCGCCTC GCGGTGACGC GTGGCGGGAC CGGCCTCGTG
GCCCGCGCGG TGCACCGGCA GCGGGACGAG GACCGCGAGG TCGGCTCGGC ACCCGTTGCC
GACGGCCCGC TGACGCTCGC CGTGCGGGCC CGCGGTGTCG ACCTCGAGCT GCTCGTCGCG
TCGGGCGACG CCGAGCCCGT CGTCGTCGGC ACGGTCCCCG CGACCGACCT CGACACCGTG
GCCACCGGCG GCTTCCTCGG CCTCTGGCTC GGCGTCTACG CCACGAGCGA CGGCGCGGAG
ACGAGCACGG TCGCCCACGT CGAGGTCACC TACACCCCGG CGTCCTGA
 
Protein sequence
MSTVTNPILP GCYPDPSICR VGEDYYLVTS TFEYLPGLPV LHSRDLVTWE TIGHVVHRPG 
QLDYTGIASS GGLYAPTIRH HDGLFWVICT LVGQPQGNPG GNFLMTATDP AGPWSDPMWL
DADGIDPSIF FDDDGRIWVH GTRLAREPEW FHQTEVWIRE LDPETKKLTG PEHVVWSGAV
KGAVWAEAPH LYKVDGTYYL MAAEAGTEFH HAESIAQAEV VTGPYTGTRG NPILTHRHLG
RQHPVVGAGH ADLVEAADGS WWAVLLAMRT YGGYHYPLGR ETFLVPVVWE DGWPVFAPGV
GKVPDEVEVP FATGPARGSA QGLASGVVAP DDLRWTAVRA LPAEVATPAG QGWDLPLRAA
TLDSLEVPAF LGVRLQHTHA DFLARVTVDL AVGEEVGVVV RQSEKDHARL AVTRGGTGLV
ARAVHRQRDE DREVGSAPVA DGPLTLAVRA RGVDLELLVA SGDAEPVVVG TVPATDLDTV
ATGGFLGLWL GVYATSDGAE TSTVAHVEVT YTPAS