Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_1332 |
Symbol | |
ID | 9145212 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 1477450 |
End bp | 1480482 |
Gene Length | 3033 bp |
Protein Length | 1010 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | Endo-1,3(4)-beta-glucanase |
Protein accession | YP_003636429 |
Protein GI | 296129179 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.137473 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00217358 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCAACAC ATCACAGGAC CCCCCAGGGG TCCCCATCCC ACCATGCCCG CAGCAGGGGC CCGGCGCCGC ACGCGCAGCG GCGGACAGCG GCCGGAACCG CACTGGCGCT GGCCGTCGCA GCCGCACTCG TCGTCGTGCC GTCGGGCGCG TCGACGGCGG CCGAGGGCGT CGTCGACGTC GGTGCCGGCG GGTACGCCGC CGCGCCCGTC GGTCCGACGC CCGAGGGGTG CGACTCGATC GAGGCCGATC CCCGCTCGGC CCTGACCGAG GACGCGCCGC AGGGCCCGCT GCCGACCAAC GACTGGTGGT CGTCGCTGCT GTACAAGCGC CTCGACTGCC GCATGAGCGA ACCCCTGCAC GCGCACCCCG CGTCCTACCA GCCCACCCCC GCGGGCCTGG GGATCTCGAC GCCTCGCGAG GCGACGCTGT CCGGGACGAA GGGCGGCATC GGTGAGTTCC ACTTCACGTA CGTCCAGGAC GTCCTGGTCG GTGTCGCGGG CCTCGACGCC CCGACGGTCC AGGTCGCCGG CTGGACCGAC TGGACCGTGA CACCGTCGTG GTCCGACGGG ACGCGTTCGC TGCGCGCCAC GATCGGGCAC GGCCTGCCCA CGTCCTGGTA TCACGTCGAG GGCGGTGACG CCCTGCTGCG CTCGCAGCAC GACGTGCGCG TCTGGCAGCG CGACGGCAGC ACCGTCGGGT TCACGGCCAA CGGTCACGAC TACGCGGCGT TCGCGCCGTC GGGCGCGAGC TGGGACGTCT CCGGTTCCAC GCTGCGCTCC TCGCTCGCGG GCCAGGGCTA CCTCGCGGTG ACGGCGCTTG CCACGGCCGC CGACGCGACC GACGCCGACC GCGAGGCGGC GCTCCAGGCG GTCGCCGGCT CCGCGTTCGC CGAGGTCACC AGCACCGAGG CGAGCTACAG CTACGACGCT GCCGGCGCCG TGGTCTCGAC GACCTACGAG ATCGGGACGA GCGCGCTCGA GGGCTACGCC GAGGGTGCCG TCGTCGCGCT CTACCCGCAC CAGCAGCGCT ACCTCGCGGA CGTCGACGGC GACGAGCTCG ACGCGACCTA CCCGAGCCCG CGCGGCACGA TGACGGCGTA CGCGGGCACC ACGTCGTTCA CCACCGAGAC GCCGTTCACC GGCATCCTCC CCGAGGTCCC GGCCGTCGCG ACGGCCGACG GCGAGGGCCG TGCCACGCTC GACCGCCTGC TCGCCGAGGC CGCAGCCGAC CCGCTGCCGA TCCTGCGGGC CGACACCTAC TGGACCGGCA AGGCCCTGGG CCGTGCCACG CGGATCATCG AGATCGCCGA CCAGCTCGGC GAGACCGAGG TGCGCGACCG CACTCTGCGG CTGGTCCGCG ACACCCTGAC CGACTGGTTC ACCGCCGAGC CCGGCAAGTC CGAGCAGGTC TTCGCCTACG ACGAGCGCTG GGGCACCCTC ATCGGCTACC CCGCGTCCTA CGGCTCGGAC ACCGAGCTCA ACGACCACCA CTTCCACTAC GGCTACTTCA TCGCCGCAGC GGCCACGCTC GCACGCTTCG ACCCCGCGTG GGCGTCGGAC GAGCAGTACG GCGGCATGGT CGACCTGCTG ATCCGCGACG CCAACGGGTA CGACCGCGCC GAGACGCGGT TCCCGTACCT GCGGGACTTC GACATCTACG CCGGGCACGA CTGGGCCTCG GGACACGGTG CGTTCGCGGC CGGCAACAAC CAGGAGTCGA GCTCCGAGGG CCAGAACTTC GCGGGCGCGC TCGTCCAGTG GGGCGAGGCG ACGGGGAACA CGGCGGTGCG CGACGCCGGT GCGTACCTCT ACGCCACGCA GGCCGCGACG ATCCAGGAGT ACTGGTTCGA CCAGGCCAAG GCGATCCCGG ACGAGTTCGG CCACACGACC CTCGGCATGG TCTGGGGCGA CGGCGGCACG TACTCGACGT GGTTCTCCGC CGAGGCGGAG ATGATCCAGG GCATCAACAC GCTGCCCATC ACCGGCTCGC ACCTGTACCT CGGTATCCGG CCCGACGACG TGGTCGAGAA CTACGCCGAG CTCGTCAAGG CCAACGGCGG CAAGCCCACG GTCTGGCAGG ACATCCTGTG GAGCTATCTC GCGCTCGGTA ACGGCGAGGA GGCGCTGGAG CAGCTGGAGG CCGACCCCGG CTACGCCGTC GAGGAGGGCG AGTCGCGGGC GCACACCTAC CACTGGGTCG CCAACCTGGC GGCGCTGGGG AACCTCGACA CCACCGTGCG CGGGTCGAGC CCGCTGTCGG CGGCGTTCGT CAAGGACGGT GCGCGGACGT ACGTCGCCGC CAACGTCTCG TCGAAGGCCC GCACGGTCGT CTTCAGTGAC GGGACGAAGG TCGAGGTCCC GCCGGGCAAG ACCGTCGCGA CGGGTGCGCA CACGTGGTCC GGCGGCGGCG CGGTCGGCGG TCCCGGTGGT CCGCAGCCGA CCACCGAGCC GAAGCCGACG GTGACGCCGG CGCCCACGGC CACGCCGAAG CCGACGCCCA CGGCCACGCC GAAGCCGACG GTGAGCCCGA AGCCGACCCC GAAGCCCCAG CCGACGACGC CCGCGGGCGG GTTCCGTCTG GCCTTCGGTC CCGGCGGCAC GCTCGTGCCG TCGCCGGGCG CGCCGGGCGC CCTCGAGGTG CCGGCGGCGC GCGGGATCGA CACGGCCACC GAGGCGCCGG ACGCGGTCGT GGCGCAGGCC ACGGGCCTGA ACGGCACCGC CACGGGCGCC GCGACGGCGT TCGACCTCGC GCTGGACGCG GGCACCCGGG TCGGCAACGG CACGCGCGTG TCGGTCTCGT ACGACCTCAC GGGCGACGGG ACGTGGGACC GGGTCGAGGT GTACCGGTAC TTCGCGACCG ACCCGGTGCC GGGCCCCGAG CGCTACACGC AGACCGTCGG CCTGGACCGG GTGACCGGAG AGCTCGGTGA CCTGCGCAAC GGCACGGTGC GGGTCGCGGT GTGGAACGCG ATCGGCAGCA GCCCGACGTC GGTGAGCACC GGTGACTCCG TGGTGGAGCT GCCGTTCCGC TGA
|
Protein sequence | MSTHHRTPQG SPSHHARSRG PAPHAQRRTA AGTALALAVA AALVVVPSGA STAAEGVVDV GAGGYAAAPV GPTPEGCDSI EADPRSALTE DAPQGPLPTN DWWSSLLYKR LDCRMSEPLH AHPASYQPTP AGLGISTPRE ATLSGTKGGI GEFHFTYVQD VLVGVAGLDA PTVQVAGWTD WTVTPSWSDG TRSLRATIGH GLPTSWYHVE GGDALLRSQH DVRVWQRDGS TVGFTANGHD YAAFAPSGAS WDVSGSTLRS SLAGQGYLAV TALATAADAT DADREAALQA VAGSAFAEVT STEASYSYDA AGAVVSTTYE IGTSALEGYA EGAVVALYPH QQRYLADVDG DELDATYPSP RGTMTAYAGT TSFTTETPFT GILPEVPAVA TADGEGRATL DRLLAEAAAD PLPILRADTY WTGKALGRAT RIIEIADQLG ETEVRDRTLR LVRDTLTDWF TAEPGKSEQV FAYDERWGTL IGYPASYGSD TELNDHHFHY GYFIAAAATL ARFDPAWASD EQYGGMVDLL IRDANGYDRA ETRFPYLRDF DIYAGHDWAS GHGAFAAGNN QESSSEGQNF AGALVQWGEA TGNTAVRDAG AYLYATQAAT IQEYWFDQAK AIPDEFGHTT LGMVWGDGGT YSTWFSAEAE MIQGINTLPI TGSHLYLGIR PDDVVENYAE LVKANGGKPT VWQDILWSYL ALGNGEEALE QLEADPGYAV EEGESRAHTY HWVANLAALG NLDTTVRGSS PLSAAFVKDG ARTYVAANVS SKARTVVFSD GTKVEVPPGK TVATGAHTWS GGGAVGGPGG PQPTTEPKPT VTPAPTATPK PTPTATPKPT VSPKPTPKPQ PTTPAGGFRL AFGPGGTLVP SPGAPGALEV PAARGIDTAT EAPDAVVAQA TGLNGTATGA ATAFDLALDA GTRVGNGTRV SVSYDLTGDG TWDRVEVYRY FATDPVPGPE RYTQTVGLDR VTGELGDLRN GTVRVAVWNA IGSSPTSVST GDSVVELPFR
|
| |