Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_1515 |
Symbol | |
ID | 9145401 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 1681839 |
End bp | 1685141 |
Gene Length | 3303 bp |
Protein Length | 1100 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | glycoside hydrolase family 9 |
Protein accession | YP_003636612 |
Protein GI | 296129362 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTTTCTC GAAGGACGCG CCGCGGACTG CGCGCCGCGA CAGCCATCAC CAGTGCGGGC GTTCTCGCCG CGATCGGGCT GCTGCCCGCA GTCGCGGCGG TCGACGACGC CTTCGACGAC GGGCCGGGCG CGTGGATCGC GTACGGCACC GACGGGGACA TCGACACCAG CTCCGGCGCG CTCTGCGTCG ACGTGCCGGC CGGCTCCGCG CAGTACGGCG TGGGCGTCGT GCTGAACGGC GTGGCCGTGG AGGAGGGCGC GACCTACACG TTCGCGTACA CCGCGAGCGC CTCGACGGAC GTGACCATCC GGGCCCTCGT GGGCCAGAAC GGCGCGCCCT ACGGCACCGT GCTCGACACG ACGCCGGCCC TGACCGCCGA GCCGACGGCC GTCGAGGAGA CGTTCACCGC CTCCGCGTCC TACCCCGCCA CGGCGACCGA CGAGTCCCCC GAGGGGCAGA TCGCGTTCCA GCTCGGCGGC TTCAGCGACG ACGCGTGGAC CTTCTGCCTC GACGACGTGT CGCTGAGCAG CGAGAGCGAG CTGCTGCCGC ACACGTCGTT CGCCGAGGGG CTGGGCCCGT GGGGCCTGTA CGGCACGGGC GACCCGGTCT TCGCGGACGG CGAGATGTGC GTCGAGGTCC CCGGGGGGTA CAGCAACCCG TGGGACGCCG GCCTGTCGTT CACCGGCCTG CCGATCGAGG AGGGCGAGAA CTACGTCCTC ACGCTCACCG GCCGCACCGA GCCGGCGACG CCCGTGCGCG TCATCGTCGG CGAGGGCGGT GGCACCTACC GCATGGTGCT CGAGCAGTCG TCCGCGCCGC TGGGCACCGA GGCGACCACG CTCACCTACC CGTTCACCGC GAACACGTCC TTCCCGGCGG ACGGCGACGC GCCCGGCCAG GTGGCGATCC ACCTCGGCAA GACCGCGGGC TACACCTTCT GCCTCTCGGA GGTCTCGTTG AGCACGTCCG CCACGCCGCC GCCACCCTAC GAGCCGGACA CCGGCCCGCG CGTGCGCGTC AACCAGGTGG GCTACCTGCC GTTCGGCCCG AAGCGCGCGA CGCTCGTCAC CGACGCGACC GAGGCGGTCG CGTGGCAGCT CCTCGGCGCC GACGGCGACG AGGTGGCCGA CGGTGAGTCG CAGCCGGTGG GCGCCGACGC GTCCTCGGGC GAGAACGTGC ACGTCATCGA CTTCTCCGAC GTCACGGCGA CCGGCGAGGG GCTGACGCTC GTGGCCGACG GCGAGACGAG CCACCCCTTC GCGATCGACG CGGACCTCTA CGCGCAGCTG CGGTACGACG CGCTGAACTA CTTCTACCTC GCGCGCTCGG GCATCGACAT CGAGGCGTCC ATCGTGGGGG AGGAGTACGC ACGAGAGGCC GGTCACGTGG GCGTCGCTCC GAACCAGGGC GACACCGAGG TGCCCTGCAT CGGTCCGCGT GACTACTACG ACGGCTGGAC CTGCGACTAC ACGCTCGACG TCACCGGCGG CTGGTACGAC GCCGGTGACC ACGGCAAGTA CGTCGTCAAC GGCGGCATCT CCGTCGCCCA GCTCCTGGCG ACCTACGAGC GCACGCTGCA CGTCGAGGGC GCGTCCACCG AGGCACTGGC CGACGGGACG CTGAACCTCC CCGAGCACGG CAACGGCGTG CCCGACGTGC TCGACGAGGC GCGCTGGGAG CTGGAGTGGA TGCTGAAGAT GGTGGCGCCG TCCGGTGAGT ACGCCGGCAT GGTGCACCAC AAGATCCACG ACGAGGGCTG GACCGGTCTG CCGCTCATGC CGGCGGACGA CCCGCAGGCG CGCTCGCTGC ACCGTCCCTC GACGGCGGCG ACGCTCAACC TGGCGGCCGT GGCCGCGCAG GGCGCCCGAC TGTTCAGCGA GTACGACCAG GAGTTCGCCG ACACGCTGCT GGCCGTGGCG CGCAGCACGT ACGAGGCGGC GCTCGCGCAC CCCGACGTGT ACGCGCCGGG TGCCGCGGGC AGCGACGGCG GTGGCCCGTA CGACGACTCC GACGTGTCGG ACGAGTTCTA CTGGGCGGCG GCCGAGCTGT ACGTCACGAC CGGTGAGGCG CGCTACGCCA CGGACCTCAC GGCCAACCCG CACCACACGG GCGACGTCTT CCCGGCCGGC GGCTTCTACT GGGGCGGCGT GGCCGCACTG GGCCGGCTGA CGCTCGCGAC CGTGCCCAAC GACCTGCCCG ACCGTGACGA GGTGCAGGCG TCGGTCGTCG CGGCCGCCGG CCGGTACGTC GCCGCGCAGC AGGAGCACCC CTGGGGCTCG GTCTACTCGC CCGACGGCGG CGTGTACGCC TGGGGCTCGA ACTCCTCGGT CGCGAACATC CTCGTGGTCG TGGGCACGGC GTACGACCTC ACGGGCGACG TCACGTACCA GCGGGCGGCG CTCGAGGGGC TGGACTACCT GTTCGGCCGC AACGCGCTCA ACCAGTCGTA CGTGACCGAC TGGGGCACGA CGACGTCGCA GAACCAGCAC TCGCGCTGGT TCGCCCACCA GCTCGACCCG GCGCTGCCGA ACCCGCCGAA GGGCTCGCTC GCCGGCGGTC CGAACTCGCA GGTCAGCACG TGGGACCCGA CGATGCAGTC CACGCTCTCC GCGGACTGCC CGCCGGCCAC CTGCTACCTC GACGAGATCA GCTCGTGGGC GTCGAACGAG ATCACCGTGA ACTGGAACTC GGCGCTGTCG TGGGTCGCGT CGTTCGCGGC CGACCAGACC GGTGCCCAGC AGGAGACCCC GGTTGCGCCC GTCGTCACCA GCCAGCCCTC CGACCAGTCG GTCGGTGTCG GGGCGACGGC GAGCTTCACC GCGGCCGCGT CGGGCGTCCC GGCGCCGACG GTGCAGTGGC AGGTGCGTCA GGGCAAGAAG TGGCGTGACG TGCCGGGTGC GACCTCGACC ACGCTCACCG TCGTGGCCTC GACGCCGAAG GCGGGCGGCG AGTACCGCGC GGTGTTCACG AACGCGTCGG GGACGGCGAC CAGCGACGTC GCGCGGCTCA CCGTCAAGCA CCAGGCGCCG GTCGTGATCA AGCACCCGGT CGACACGGTC GGGAAGGTCG GACAGCAGGT GACGTTCACC GCGGCCGCCT CCGGCCACCC GCAGCCGACG GTCCGCTGGC AGCAGCGTCG TGGTGAGGGG ACGTGGACCG ACGTGAAGGG GGCGAAGAAG GAGGAGCTCC GCGTCCTCGT CACGCCGAAG TCGCAGGGTC TGCAGTACCG GGCCGTGTTC ACCAACCCCG GTGGCACGGT GACGACCGAC GCGGCACGCC TGACGGTCCG CGGCGGCAGC TGA
|
Protein sequence | MVSRRTRRGL RAATAITSAG VLAAIGLLPA VAAVDDAFDD GPGAWIAYGT DGDIDTSSGA LCVDVPAGSA QYGVGVVLNG VAVEEGATYT FAYTASASTD VTIRALVGQN GAPYGTVLDT TPALTAEPTA VEETFTASAS YPATATDESP EGQIAFQLGG FSDDAWTFCL DDVSLSSESE LLPHTSFAEG LGPWGLYGTG DPVFADGEMC VEVPGGYSNP WDAGLSFTGL PIEEGENYVL TLTGRTEPAT PVRVIVGEGG GTYRMVLEQS SAPLGTEATT LTYPFTANTS FPADGDAPGQ VAIHLGKTAG YTFCLSEVSL STSATPPPPY EPDTGPRVRV NQVGYLPFGP KRATLVTDAT EAVAWQLLGA DGDEVADGES QPVGADASSG ENVHVIDFSD VTATGEGLTL VADGETSHPF AIDADLYAQL RYDALNYFYL ARSGIDIEAS IVGEEYAREA GHVGVAPNQG DTEVPCIGPR DYYDGWTCDY TLDVTGGWYD AGDHGKYVVN GGISVAQLLA TYERTLHVEG ASTEALADGT LNLPEHGNGV PDVLDEARWE LEWMLKMVAP SGEYAGMVHH KIHDEGWTGL PLMPADDPQA RSLHRPSTAA TLNLAAVAAQ GARLFSEYDQ EFADTLLAVA RSTYEAALAH PDVYAPGAAG SDGGGPYDDS DVSDEFYWAA AELYVTTGEA RYATDLTANP HHTGDVFPAG GFYWGGVAAL GRLTLATVPN DLPDRDEVQA SVVAAAGRYV AAQQEHPWGS VYSPDGGVYA WGSNSSVANI LVVVGTAYDL TGDVTYQRAA LEGLDYLFGR NALNQSYVTD WGTTTSQNQH SRWFAHQLDP ALPNPPKGSL AGGPNSQVST WDPTMQSTLS ADCPPATCYL DEISSWASNE ITVNWNSALS WVASFAADQT GAQQETPVAP VVTSQPSDQS VGVGATASFT AAASGVPAPT VQWQVRQGKK WRDVPGATST TLTVVASTPK AGGEYRAVFT NASGTATSDV ARLTVKHQAP VVIKHPVDTV GKVGQQVTFT AAASGHPQPT VRWQQRRGEG TWTDVKGAKK EELRVLVTPK SQGLQYRAVF TNPGGTVTTD AARLTVRGGS
|
| |