Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_3031 |
Symbol | |
ID | 9146943 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 3365859 |
End bp | 3369176 |
Gene Length | 3318 bp |
Protein Length | 1105 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | glycoside hydrolase family 9 |
Protein accession | YP_003638113 |
Protein GI | 296130863 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTTTCCA CACCCGACCC GCGCGGCAGG CGCGTCCGGT GGCTGGCCGG CACGACGGCC GCCGCGCTGC TCGTCGCGCC GGCCCTCGCC TCCACCGCCT CCGGCGCGCC CGTCAGCACC GTGCACGACT TCTCCGACGG CCCGCAGGGC TGGTTCTCCT ACGACAACAC CGGCTCGGTG TCGTCGTCGG CGGACACCGG TGAGCTGTGC GCGGTCGTCG ACGGCGGCGA CCAGCCGTGG GACATCGCGC TCCAGCACGA CGACGTGACG TACGAGCGCG ACGCGACGTA CACCGTGTCG TTCGACGCGC ACGCCAGCGC ACCGGTGACG GTCCCGATGC AGGGCGGCGT GGGGTACCCC GCGGCGTTCG GCCACTCGGT CGTGCTCGAC GGCACGTCCA CGCCGACGCA CGTCGAGTTC ACCTTCACCC CCGCGGACTG GCCGACGAGC CCGGACCCGG CCGTCTCCCC CGTCGACGAC GACTGGACCA GCGCGACCGG CCAGGTGTCC TTCCAGCTCG GGGGGCAGTC CGCGTCGTAC ACGTTCTGCG TCGACGACTT CTCGCTGACG TCGGGCACGC GGATCGTCCA CGACTTCACG GCCGGGGACA TGGGCGAGTT CGACATGTAC GACTCGGCCG GCGGCGGCAC CGCCCGGCCC GGCACCGACG GCGTGAGCGC CTGCATCGAC CTGCAGGGGG GCTACGCCAA CCCGTACGAG GCAGGGCTCG AGTACAAGTA CGTCGACGTC GTGGAGGGCC GCAACTACGT CCTGGAGCTC ACCGCCTACG CGAGCGAGGA GGCGAACGTC AACGTGCTCG TCGGGCAGTA CGGCGACCCG TGGCACCGCG TGCTGTCGAC CGAGGCCGCA CTGACCACCA CGCCCCAGAC GTTCCGGTAC CCGTTCACCG CGGACGCGAC CTTCAGCTCC GACCCCGCGA CCGCGTGGGG GCGCATCCAG GTCGAGCTGG GCCGCAAGGT CGCTCCGTAC ACGTTCTGCG TCACGAGCCT GTCGCTCGTC GAGACCACGC AGGCTCCCCC GCCGTACGCG CCGGAGACCG GCCCGCGGGT GCGCGTCAAC CAGGTCGGCT ACCTGCCCGA CGGTCCGCAG CGCGCCACGC TCGTCACGGA CGAGACGGAC GCGGTGACGT GGGAGCTGCT GTCCGGCGCC ACGGTCGTGG AGACGGGCGA GACCACGCCG CACGGGGTGG ACCCGAGCGC CGGCCTCAAC GTGCACGTCA TCGACCTCGG CGGCGTCCCC GCCGGCTCCT ACACCCTGCG GGCCGACGGC GAGACGAGCC ACCCGTTCGT CGTCGACGCC GGCATCTACC AGGACCTGCG GCAGGACGCG CTCGACTACT TCTACCCCGT GCGCTCCGGC ATCGCGATCG ACGGCGCGAT CATCGGCGAC GCCCGGTTCA CGCGCGCCGC CGGGCACGTC GGGCGCCCCG GCGAAGCGAC GCCCAACCAG GGTGACGTCG CCGTCCCGTG CATCACCCCG GCGGAGGCGC AGAACCTCTA CGGCGACTGG ACGTGCGACT ACACGCTCGA CGTGACCGGT GGCTGGTACG ACGCCGGCGA CCACGGCAAG TACGTCGTCA ACGGCGGCAT CGCGGTCGCG CAGCTGCTCG GCACCTACGA GCGCACCCTG TACGCCCCCA CCGGCGACCC GGACGCGCTC GGCGACGGCA GCATGGACAT CCCGCTCGAC GAGCAGAGCA ACGGCGTGCC GGACCCGCTG GACGAGGCCC GCTGGGAGCT CGAGTGGATG CTGCGCATGC AGGTCCCCGC CGGGCAGCCG CTGGCCGGCA TGGTCCACCA CAAGGTGCAC GACGTGGACT GGACGGGCCT GCCGCTGATG CCGGCCGACG ACCCGCAGGA GCGGCGTCTG CACCGTCCGT CGACCGCGGC GACCCTCAAC CTCGCGGCCG TCGCGGCACA GGGCGCCCGC CTGTGGGAGC CGTACGACCC GGAGTTCGCC GCCGAGCTGC TCGCCGCGGC CCGCGTGGCC TGGGACGCGG CGCAGGCCAA CCCCGTCCTG CTCGCGCCGG CGCCCAACGC CGACCCGAGC CCCGGCGGTG GCCCGTACGA CGACACGGAC GTCAGCGACG AGGCCTACTG GGCCGCGGCC GAGCTGTTCC TCACCACCGG TGAGAACGCG TTCCGTGACG CGGTCCTGAC GAGCGAGCAG CACACGGCCG ACGTCTTCTC CGACGGGTTC TTCTGGGGCG AGGTCGCCGC GCTGGCGCGC ATGGACCTCG CGGTCGTCGA GTCCGAGATC CCCGGTCGCA CGGCGATCCG CCGGTCGGTC GTGGAGGGCG CGGAGCTGTT CCTCGCGAAG CAGCAGGCCC AGCCGTTCGG CCAGGCGTAC GCCGGGGACG CCGACGGCGA CTACGACTGG GGGTCGAACT CCTCGATCCT CAACAACCAG GTGATCCTCG GGACCGCGTT CGACCTGACG AGCGAGCAGC GGTTCGCCGA CGCCGTCCTG GAGTCGATGG ACTACCTGCT GGGCCGCAAC GCGCTCAACC TGTCGTACGT CACGGGGTAC GGCACGGCGT TCTCGCAGAA CCAGCACAGC CGGTGGTTCG CCCACTCGCT GACCGAGTCG CTGCCGAACC CCCCGAAGGG CTCGGTCGCC GGCGGCCCCA ACTCGCTGAC CGGCACCTGG GACCCGGTGA TCGCAGGCCT GTACGGCCCG GACCGCATGT GCGCGCCGCA GCTGTGCTAC GTCGACGACA TCCAGTCGTG GTCGACCAAC GAGATCACCG TCAACTGGAA CTCGGCACTC TCGTGGGTGG CGTCGTTCGT CGCCGACCAG CAGGCCGGTG ACCGGTCGGA CGCCGGCACG GTGGCGTGGG TCGTGACGGA CCCGTCCGAC ACGTCGGTCG CCGCCGGCGC GGACGCCACG TTCACGGTCG GGACCACGGG CTCGCCCACC CCGACGGTGC AGTGGCAGCA GCTCGTCGAC GGCGCCTGGG TCGACGTGGC CGACGCCACC GGGGCGACCC TCCGGCTCAC GGCACGCACG GCGGACTCCG GCGCGCAGTA CCGCGCGTAC GTCGCCAACG CGTTCGGCGG CGCCTACTCG GAGCCGGCGA CGCTCACGGT GACGGCCGCG GGCACCGGCA CCCCGACCCC GGGGGCCGAC ACCTCCGGGA CGCCCGGCAC CCCCGGCGGC GGACCGCGCG TCGCCGGCGC CGGACCACTG GCCGCGACCG GCGCGCACGC AGGCGCGCTG CTCGGCACCG GACTCCTTCT CCTCGTCGCG GGAGCCGGTG CGGTGGCCAT GGCACGCAGG GCCCGACCGC GCGTCTGA
|
Protein sequence | MVSTPDPRGR RVRWLAGTTA AALLVAPALA STASGAPVST VHDFSDGPQG WFSYDNTGSV SSSADTGELC AVVDGGDQPW DIALQHDDVT YERDATYTVS FDAHASAPVT VPMQGGVGYP AAFGHSVVLD GTSTPTHVEF TFTPADWPTS PDPAVSPVDD DWTSATGQVS FQLGGQSASY TFCVDDFSLT SGTRIVHDFT AGDMGEFDMY DSAGGGTARP GTDGVSACID LQGGYANPYE AGLEYKYVDV VEGRNYVLEL TAYASEEANV NVLVGQYGDP WHRVLSTEAA LTTTPQTFRY PFTADATFSS DPATAWGRIQ VELGRKVAPY TFCVTSLSLV ETTQAPPPYA PETGPRVRVN QVGYLPDGPQ RATLVTDETD AVTWELLSGA TVVETGETTP HGVDPSAGLN VHVIDLGGVP AGSYTLRADG ETSHPFVVDA GIYQDLRQDA LDYFYPVRSG IAIDGAIIGD ARFTRAAGHV GRPGEATPNQ GDVAVPCITP AEAQNLYGDW TCDYTLDVTG GWYDAGDHGK YVVNGGIAVA QLLGTYERTL YAPTGDPDAL GDGSMDIPLD EQSNGVPDPL DEARWELEWM LRMQVPAGQP LAGMVHHKVH DVDWTGLPLM PADDPQERRL HRPSTAATLN LAAVAAQGAR LWEPYDPEFA AELLAAARVA WDAAQANPVL LAPAPNADPS PGGGPYDDTD VSDEAYWAAA ELFLTTGENA FRDAVLTSEQ HTADVFSDGF FWGEVAALAR MDLAVVESEI PGRTAIRRSV VEGAELFLAK QQAQPFGQAY AGDADGDYDW GSNSSILNNQ VILGTAFDLT SEQRFADAVL ESMDYLLGRN ALNLSYVTGY GTAFSQNQHS RWFAHSLTES LPNPPKGSVA GGPNSLTGTW DPVIAGLYGP DRMCAPQLCY VDDIQSWSTN EITVNWNSAL SWVASFVADQ QAGDRSDAGT VAWVVTDPSD TSVAAGADAT FTVGTTGSPT PTVQWQQLVD GAWVDVADAT GATLRLTART ADSGAQYRAY VANAFGGAYS EPATLTVTAA GTGTPTPGAD TSGTPGTPGG GPRVAGAGPL AATGAHAGAL LGTGLLLLVA GAGAVAMARR ARPRV
|
| |