Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_3575 |
Symbol | |
ID | 9147491 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 3964009 |
End bp | 3966924 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | Monophenol monooxygenase |
Protein accession | YP_003638646 |
Protein GI | 296131396 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCACCC GGAAGAACGT CCGCAAGCTG ATCGTCGAGT ACAACGCGGC GGCCGACAAG ACGACCACCG CGCTGTACCG GTTCCGCGAG GCCGTCGTCG CCCTCAAGGC GGCACCCAGC CAGCTGGCCC CACCGCACAC GCAGGCCAAC CGCTACGACG ACTACGTCTA CATCCACCAG CAGTCCATGG CCGGGCACTC GAACAACGAC CCCGGCCCGC ACCCCGGGCA CCGCGGCCCC ACCTTCTTCC CGTGGCACCG GGAGTTCCTG CGGCGCTTCG AGCAGGACCT GCGGACCGTC GCGGGCGACC CGTCGATCTG CCTGCCCTAC TGGGACTGGA GCGTCGACAG GACGTCGGCC GACCCCGGCT GGCCGTTCTT CACCGACTTC CTCGGGGGCG ACGGCACGGG GGCCGGCATC GTCGTGCCCG ACGGGCCCTT CGCCAGCGCG AACGGCTGGG TCCTCGCCAT CGCCGACCCC TTCAGCAACA ACGCGCAGCA CCAGGCGAAC CTCGACAAGC TGCAGCGCAG CTTCAACGGC ACGCTGCCGA CCCGGGCCGC CGTGATGGAC GCCCTGGCGG TCGACGAGTA CGACGTCGGC CCGTGGAACA TCACCTCCGC GGGCGCGCAG AGCTTCCGCA ACCGCGTCGA GGGCTGGGTC GGCCCGCAGG CGGGAGCCAA CACCCACAAC CGGGTGCACG TGTTCGTCGG CGGCTCCATG CTGCCCGGGA CCTCCCCCAA CGACCCCGTC TTCTTCCTCA ACCACGCCAA GGAGGACGAG CTCTGGGCCG TGTGGATGCA GAAGTACCCC GGCGTGCCGC ACTACCTCCC GCTGGACAGC GAGCCGCTGC CCGCGGGGCA CTCGCACCTC GTCCGGCTGA GCGACCACAT GGAGTCCCTC GCCGAGTACT TCGGCGCCGG CACCATCGAC CGGCCCGTCG ACCTGCTCGA CCACAAGGCC ATCACCTGGT ACGACACCGA CCTGCCGGAC ATCGTCGTGG AGTCGGGCCC GGCGCTCGGG TTCACCGACG TCCCCGCAGG TCTCACCCAG ACGAAGTTCA TCCGGTTCCG GGTACGGACA CCGCGCACCG TGTCGTTCAG CGTCACCGCC GCCCCCACCG GCAACTTCAC CGTGCTGGGC GGCCCGGACT TCCCGGTCGT GCCGGACGAG GCGAACGACT CCGAGGTCCT CGAGATCGGC GTGCAGTTCC ACGCCGTGGG CGCGAACGTG CAGGTCGCCG CCGTCGACCT GCAGGCCACG GTCGTCGACG ACGAGGGCTA CTACGCGGCG AACCAGGGCG ACCCCTTCGT CGTCGGGACG TTCCACGTCG AGCTCGTCGC CAGCAACATC GTCACCACCG ACAGCTCGCT CGCCCTCGTC CTGGACCGCT CGGGCAGCAT GGCCGACGTC GCCGCCGGTG GGGCCACCAA GAGCACGCTG CTCAAGCGCG CCGTCGGCGT CGTGCACAGC CTCATGCAGC CGACCGACGA GATCGGCATC GCACGCTTCG GCACCACCGC GGACGTGGTC CTGCCGATGA CGGCCGCCTC GGCGGGCCTC GGCACGGTGC TGACCGGCAC CGCCCTCGAC CCGGCCGGCG CCACGGCGCT CGGCCGCGGG CTGCAGGAGG GCAGCGGCCT GATCAACGGC CCCGGGGCCA CCAAGCCCAA CAAGGCCGTC ATCGTCATGA CCGACGGCAA CGAGAACATC CCGCCGTTCG TCGACGACCT CCCCGCGGGC ACCGTCAGCC AGACGACGTT CGCCATCGGC CTGGGCCTGC CCGGCCAGGT CAGCGACCCC GTCCTCGACG CCGTCGCCGC CAACACCGGC GGGTACCTGC TGGTCACCGG TGACGTGTCG TCCGACACCG AGCGCTTCAC GCTCGCGAAG TTCTTCCTCC AGGTCCTCAA GGACGCCACG CTCAACCAGA CCGTGGTGGA CCCTGCCGGC GACCTGCTGT GGAACGGCGG CAAGCACGTG GTGCCGTTCC AGGTCGCCGA CACCGACGTC TCGGTCGACG TGGTCGTCCT CACCGCGCTC CCCTTCGCGC TCGACCTGCG GCTCGTCACG CCGAGCGGTG TCGAGATCAC GCGGGACACG CCCGCCACGG AGCCGAACGT GCAGTACGTC GTCGGCGACG ACGTGGCGTA CTACCGGCTG ATGCTGCCCG CGCTCGCCGC CGACCTCGCC GGCTCCCACC GCGGCCGGTG GCAGGCCGTG CTCACGCTGC GCCCGGTGGA CGAGGTGGTC GAGGAGATCC GCACGCAGGA GAACCGCACG ACCGTGAAGG AGCTGCTGAC CCGGCTGCGG ACCGCGGACA AGACCGTGCC CTACAACCTG TCGGTGCACA CGTTCTCCAA CCTGCGGCTC GACGTCGAGC TCGCGCAGAA GAGCCGCGAG CCCGGCAGCA CCGCCACGCT GGTCGCGTCC CTGCACGAGT ACGACGTGCC GCTGATCGGC GGGGCGAAGG TGTGGGCGAC GGTCTCCGGC CCGGGGTACA CCGGCGTCAC CGTGCCGTTC GACGACCTCG GTGACGGCAC GCACCGCCTC GACCGGCCGC TCAAGCGCGC CGGCGCCCAC CGGTTCGTCG TGCACGCCGA GGGCCGGACC TCCGGCGGCG ACCCGTTCAC GCGCGAGACG CTGGTGACCG CGGGCGTGTG GCGCGGCGGC GACAAGCCGT GGGAGCCGAA GGAGGCGTCC GACGAGGAGA AGAAGGAGGA GCGCGGTCGC GACGACAAGG CCGAGCGCGC CGGGGAGTCG ACCGTCGACG TCGGCAAGGT CGCCGAGCGC CTGCGGCGGG CCGCACAGGC CGAGCCGCTC AGCACGCCCG TGAAGCGGCG CGACCGGCGA CCCGGCACGC CGGGCAACCT GTTCGTCGTC GAGGGGTTCG TCAACCCCGA GGGCGGCGAG GACTGA
|
Protein sequence | MRTRKNVRKL IVEYNAAADK TTTALYRFRE AVVALKAAPS QLAPPHTQAN RYDDYVYIHQ QSMAGHSNND PGPHPGHRGP TFFPWHREFL RRFEQDLRTV AGDPSICLPY WDWSVDRTSA DPGWPFFTDF LGGDGTGAGI VVPDGPFASA NGWVLAIADP FSNNAQHQAN LDKLQRSFNG TLPTRAAVMD ALAVDEYDVG PWNITSAGAQ SFRNRVEGWV GPQAGANTHN RVHVFVGGSM LPGTSPNDPV FFLNHAKEDE LWAVWMQKYP GVPHYLPLDS EPLPAGHSHL VRLSDHMESL AEYFGAGTID RPVDLLDHKA ITWYDTDLPD IVVESGPALG FTDVPAGLTQ TKFIRFRVRT PRTVSFSVTA APTGNFTVLG GPDFPVVPDE ANDSEVLEIG VQFHAVGANV QVAAVDLQAT VVDDEGYYAA NQGDPFVVGT FHVELVASNI VTTDSSLALV LDRSGSMADV AAGGATKSTL LKRAVGVVHS LMQPTDEIGI ARFGTTADVV LPMTAASAGL GTVLTGTALD PAGATALGRG LQEGSGLING PGATKPNKAV IVMTDGNENI PPFVDDLPAG TVSQTTFAIG LGLPGQVSDP VLDAVAANTG GYLLVTGDVS SDTERFTLAK FFLQVLKDAT LNQTVVDPAG DLLWNGGKHV VPFQVADTDV SVDVVVLTAL PFALDLRLVT PSGVEITRDT PATEPNVQYV VGDDVAYYRL MLPALAADLA GSHRGRWQAV LTLRPVDEVV EEIRTQENRT TVKELLTRLR TADKTVPYNL SVHTFSNLRL DVELAQKSRE PGSTATLVAS LHEYDVPLIG GAKVWATVSG PGYTGVTVPF DDLGDGTHRL DRPLKRAGAH RFVVHAEGRT SGGDPFTRET LVTAGVWRGG DKPWEPKEAS DEEKKEERGR DDKAERAGES TVDVGKVAER LRRAAQAEPL STPVKRRDRR PGTPGNLFVV EGFVNPEGGE D
|
| |