Gene Cfla_3031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3031 
Symbol 
ID9146943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3365859 
End bp3369176 
Gene Length3318 bp 
Protein Length1105 aa 
Translation table11 
GC content73% 
IMG OID 
Productglycoside hydrolase family 9 
Protein accessionYP_003638113 
Protein GI296130863 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTTCCA CACCCGACCC GCGCGGCAGG CGCGTCCGGT GGCTGGCCGG CACGACGGCC 
GCCGCGCTGC TCGTCGCGCC GGCCCTCGCC TCCACCGCCT CCGGCGCGCC CGTCAGCACC
GTGCACGACT TCTCCGACGG CCCGCAGGGC TGGTTCTCCT ACGACAACAC CGGCTCGGTG
TCGTCGTCGG CGGACACCGG TGAGCTGTGC GCGGTCGTCG ACGGCGGCGA CCAGCCGTGG
GACATCGCGC TCCAGCACGA CGACGTGACG TACGAGCGCG ACGCGACGTA CACCGTGTCG
TTCGACGCGC ACGCCAGCGC ACCGGTGACG GTCCCGATGC AGGGCGGCGT GGGGTACCCC
GCGGCGTTCG GCCACTCGGT CGTGCTCGAC GGCACGTCCA CGCCGACGCA CGTCGAGTTC
ACCTTCACCC CCGCGGACTG GCCGACGAGC CCGGACCCGG CCGTCTCCCC CGTCGACGAC
GACTGGACCA GCGCGACCGG CCAGGTGTCC TTCCAGCTCG GGGGGCAGTC CGCGTCGTAC
ACGTTCTGCG TCGACGACTT CTCGCTGACG TCGGGCACGC GGATCGTCCA CGACTTCACG
GCCGGGGACA TGGGCGAGTT CGACATGTAC GACTCGGCCG GCGGCGGCAC CGCCCGGCCC
GGCACCGACG GCGTGAGCGC CTGCATCGAC CTGCAGGGGG GCTACGCCAA CCCGTACGAG
GCAGGGCTCG AGTACAAGTA CGTCGACGTC GTGGAGGGCC GCAACTACGT CCTGGAGCTC
ACCGCCTACG CGAGCGAGGA GGCGAACGTC AACGTGCTCG TCGGGCAGTA CGGCGACCCG
TGGCACCGCG TGCTGTCGAC CGAGGCCGCA CTGACCACCA CGCCCCAGAC GTTCCGGTAC
CCGTTCACCG CGGACGCGAC CTTCAGCTCC GACCCCGCGA CCGCGTGGGG GCGCATCCAG
GTCGAGCTGG GCCGCAAGGT CGCTCCGTAC ACGTTCTGCG TCACGAGCCT GTCGCTCGTC
GAGACCACGC AGGCTCCCCC GCCGTACGCG CCGGAGACCG GCCCGCGGGT GCGCGTCAAC
CAGGTCGGCT ACCTGCCCGA CGGTCCGCAG CGCGCCACGC TCGTCACGGA CGAGACGGAC
GCGGTGACGT GGGAGCTGCT GTCCGGCGCC ACGGTCGTGG AGACGGGCGA GACCACGCCG
CACGGGGTGG ACCCGAGCGC CGGCCTCAAC GTGCACGTCA TCGACCTCGG CGGCGTCCCC
GCCGGCTCCT ACACCCTGCG GGCCGACGGC GAGACGAGCC ACCCGTTCGT CGTCGACGCC
GGCATCTACC AGGACCTGCG GCAGGACGCG CTCGACTACT TCTACCCCGT GCGCTCCGGC
ATCGCGATCG ACGGCGCGAT CATCGGCGAC GCCCGGTTCA CGCGCGCCGC CGGGCACGTC
GGGCGCCCCG GCGAAGCGAC GCCCAACCAG GGTGACGTCG CCGTCCCGTG CATCACCCCG
GCGGAGGCGC AGAACCTCTA CGGCGACTGG ACGTGCGACT ACACGCTCGA CGTGACCGGT
GGCTGGTACG ACGCCGGCGA CCACGGCAAG TACGTCGTCA ACGGCGGCAT CGCGGTCGCG
CAGCTGCTCG GCACCTACGA GCGCACCCTG TACGCCCCCA CCGGCGACCC GGACGCGCTC
GGCGACGGCA GCATGGACAT CCCGCTCGAC GAGCAGAGCA ACGGCGTGCC GGACCCGCTG
GACGAGGCCC GCTGGGAGCT CGAGTGGATG CTGCGCATGC AGGTCCCCGC CGGGCAGCCG
CTGGCCGGCA TGGTCCACCA CAAGGTGCAC GACGTGGACT GGACGGGCCT GCCGCTGATG
CCGGCCGACG ACCCGCAGGA GCGGCGTCTG CACCGTCCGT CGACCGCGGC GACCCTCAAC
CTCGCGGCCG TCGCGGCACA GGGCGCCCGC CTGTGGGAGC CGTACGACCC GGAGTTCGCC
GCCGAGCTGC TCGCCGCGGC CCGCGTGGCC TGGGACGCGG CGCAGGCCAA CCCCGTCCTG
CTCGCGCCGG CGCCCAACGC CGACCCGAGC CCCGGCGGTG GCCCGTACGA CGACACGGAC
GTCAGCGACG AGGCCTACTG GGCCGCGGCC GAGCTGTTCC TCACCACCGG TGAGAACGCG
TTCCGTGACG CGGTCCTGAC GAGCGAGCAG CACACGGCCG ACGTCTTCTC CGACGGGTTC
TTCTGGGGCG AGGTCGCCGC GCTGGCGCGC ATGGACCTCG CGGTCGTCGA GTCCGAGATC
CCCGGTCGCA CGGCGATCCG CCGGTCGGTC GTGGAGGGCG CGGAGCTGTT CCTCGCGAAG
CAGCAGGCCC AGCCGTTCGG CCAGGCGTAC GCCGGGGACG CCGACGGCGA CTACGACTGG
GGGTCGAACT CCTCGATCCT CAACAACCAG GTGATCCTCG GGACCGCGTT CGACCTGACG
AGCGAGCAGC GGTTCGCCGA CGCCGTCCTG GAGTCGATGG ACTACCTGCT GGGCCGCAAC
GCGCTCAACC TGTCGTACGT CACGGGGTAC GGCACGGCGT TCTCGCAGAA CCAGCACAGC
CGGTGGTTCG CCCACTCGCT GACCGAGTCG CTGCCGAACC CCCCGAAGGG CTCGGTCGCC
GGCGGCCCCA ACTCGCTGAC CGGCACCTGG GACCCGGTGA TCGCAGGCCT GTACGGCCCG
GACCGCATGT GCGCGCCGCA GCTGTGCTAC GTCGACGACA TCCAGTCGTG GTCGACCAAC
GAGATCACCG TCAACTGGAA CTCGGCACTC TCGTGGGTGG CGTCGTTCGT CGCCGACCAG
CAGGCCGGTG ACCGGTCGGA CGCCGGCACG GTGGCGTGGG TCGTGACGGA CCCGTCCGAC
ACGTCGGTCG CCGCCGGCGC GGACGCCACG TTCACGGTCG GGACCACGGG CTCGCCCACC
CCGACGGTGC AGTGGCAGCA GCTCGTCGAC GGCGCCTGGG TCGACGTGGC CGACGCCACC
GGGGCGACCC TCCGGCTCAC GGCACGCACG GCGGACTCCG GCGCGCAGTA CCGCGCGTAC
GTCGCCAACG CGTTCGGCGG CGCCTACTCG GAGCCGGCGA CGCTCACGGT GACGGCCGCG
GGCACCGGCA CCCCGACCCC GGGGGCCGAC ACCTCCGGGA CGCCCGGCAC CCCCGGCGGC
GGACCGCGCG TCGCCGGCGC CGGACCACTG GCCGCGACCG GCGCGCACGC AGGCGCGCTG
CTCGGCACCG GACTCCTTCT CCTCGTCGCG GGAGCCGGTG CGGTGGCCAT GGCACGCAGG
GCCCGACCGC GCGTCTGA
 
Protein sequence
MVSTPDPRGR RVRWLAGTTA AALLVAPALA STASGAPVST VHDFSDGPQG WFSYDNTGSV 
SSSADTGELC AVVDGGDQPW DIALQHDDVT YERDATYTVS FDAHASAPVT VPMQGGVGYP
AAFGHSVVLD GTSTPTHVEF TFTPADWPTS PDPAVSPVDD DWTSATGQVS FQLGGQSASY
TFCVDDFSLT SGTRIVHDFT AGDMGEFDMY DSAGGGTARP GTDGVSACID LQGGYANPYE
AGLEYKYVDV VEGRNYVLEL TAYASEEANV NVLVGQYGDP WHRVLSTEAA LTTTPQTFRY
PFTADATFSS DPATAWGRIQ VELGRKVAPY TFCVTSLSLV ETTQAPPPYA PETGPRVRVN
QVGYLPDGPQ RATLVTDETD AVTWELLSGA TVVETGETTP HGVDPSAGLN VHVIDLGGVP
AGSYTLRADG ETSHPFVVDA GIYQDLRQDA LDYFYPVRSG IAIDGAIIGD ARFTRAAGHV
GRPGEATPNQ GDVAVPCITP AEAQNLYGDW TCDYTLDVTG GWYDAGDHGK YVVNGGIAVA
QLLGTYERTL YAPTGDPDAL GDGSMDIPLD EQSNGVPDPL DEARWELEWM LRMQVPAGQP
LAGMVHHKVH DVDWTGLPLM PADDPQERRL HRPSTAATLN LAAVAAQGAR LWEPYDPEFA
AELLAAARVA WDAAQANPVL LAPAPNADPS PGGGPYDDTD VSDEAYWAAA ELFLTTGENA
FRDAVLTSEQ HTADVFSDGF FWGEVAALAR MDLAVVESEI PGRTAIRRSV VEGAELFLAK
QQAQPFGQAY AGDADGDYDW GSNSSILNNQ VILGTAFDLT SEQRFADAVL ESMDYLLGRN
ALNLSYVTGY GTAFSQNQHS RWFAHSLTES LPNPPKGSVA GGPNSLTGTW DPVIAGLYGP
DRMCAPQLCY VDDIQSWSTN EITVNWNSAL SWVASFVADQ QAGDRSDAGT VAWVVTDPSD
TSVAAGADAT FTVGTTGSPT PTVQWQQLVD GAWVDVADAT GATLRLTART ADSGAQYRAY
VANAFGGAYS EPATLTVTAA GTGTPTPGAD TSGTPGTPGG GPRVAGAGPL AATGAHAGAL
LGTGLLLLVA GAGAVAMARR ARPRV