Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_0016 |
Symbol | |
ID | 9143880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 18226 |
End bp | 21198 |
Gene Length | 2973 bp |
Protein Length | 990 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | glycoside hydrolase family 9 |
Protein accession | YP_003635135 |
Protein GI | 296127885 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.114686 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAGCA CCTCCGCCCA CCGCGCGCGC CGCCGCTCGC TGTGGGCCCG CACCGCCGCC GTCGCGCTCG TCGTCAGCGG CACCGTGCTG CCCCTCCAGG GCGCGCAGGC GGCGCCGGCG TACAACTACG CCGAGGCGCT GCAGAAGTCG ATGTTCTTCT ACCAGGCGCA GCGCTCGGGC GACCTCCCGG CGAACTACCC GGTCTCGTGG CGCGGCGACT CCGGGCTCGA CGACGGCAAG GACGTCGGCA AGGACCTCAC GGGCGGCTGG TACGACGCCG GTGACCACGT GAAGTTCGGG CTGCCGATGG CGTTCACCGC CACGATGCTG GCGTGGGGCG CGCTCGAGAG CCCCGACGGG TACGCGGCGG CCGGCCAGAC CGACGAGCTG CGCGACAACC TGCGCGCCGT CAACGACTAC TTCGTCAAGG CACACACCGC GCCCAACGAG CTGTACGTGC AGGTCGGCAA GGGCGACGAC GACCACAAGT GGTGGGGCCC CGCCGAGGTC ATGACCATGG CCCGCCCCGC CTACAAGATC ACGGCCGCCT GCCCGGGCTC GGACGCCGCA GCCGAGACGG CCGCCGCCAT GGCGTCCGCC TCGCTCGTCT TCCAGGGCAG CGACCCCTCC TACGCGGCGA CGCTGCTGAC CCACGCCAAG CAGCTCTACT CGTTCGCCGA CACCTACCGC GGCAAGTACT CCGACTGCGT CACCGACGCG CAGTCGTTCT ACAAGTCGTG GTCCGGCTAC CAGGACGAGC TGGTCTGGGG CGCGTACTGG CTCTACAAGG CCACGGGCGA CGCCACCTAC CTGGCGAAGG CCGAGGCGGA GTACGACAAG CTGAGCAACG AGAACCAGAC GACGACGAAG TCGTACAAGT GGACCGTCGC GTGGGACGAC AAGTCGTACG CGGCGTACGC GCTGCTCGCC ATGGAGACGG GCAAGCAGAA GTACGTCGAC GACGCGAACC GCTGGCTCGA CTACTGGACC GTCGGCGTCA ACGGCGCCAA GGTGACCTAC TCCCCCGGCG GCATGGCCGT GCTCGACTCG TGGGGCGCGC TGCGGTACGC GGCGAACACC TCGTTCGTCG CCCTGGTGTA CTCCGACTGG CTGACCGACA GCACCCGGAA GGCGCGGTAC CACGACTTCG GCGTCCGCCA GATCAACTAC GCGCTGGGCG ACAACCCCCG CAAGTCGTCG TACGTCGTGG GCTTCGGGGC GAACCCGCCG AAGAACCCGC ACCACCGCAC GGCGCACGGC TCGTGGCTGG ACTCCCTCAA GGACCCGGCC GAGACGCGGC ACGTGCTGTA CGGCGCGCTC GTCGGCGGCC CCGGGTCGGC CAACGACGCG TACACCGACG ACCGCGGCGA CTACGTCGCC AACGAGGTCG CCACGGACTA CAACGCCGGG TTCACGAGCG CGCTGGCGTA CCTGACGGCC CAGTACGGCG GAACACCGCT GGCGTCGTTC CCGCGGCCCG AGACGCCGGA CCAGGACGAG TTCTTCGTCG AGGCCAAGCT CAACCAGCCT GCCGGTGGCA CGTTCACCGA GGTCAAGGCC GTCGTGCGCA ACCGGTCGGC GTTCCCGGCC CGGTCCCTGA CCAACGCCTC GATCCGTTAC TGGTTCACGC TCGACGCGGG CGTCGACGCG TCCGCGGTCA CGGTCTCGAC CAACTACAGC GAGTGCGGCA CGACGCCCTC GAAGGTCGTC CACGCCGGCG GCAGCCTGTA CTACGCCGAG CTGAGCTGCG CCGGGCAGAA CATCCACCCG GGCGGGCAGT CGCAGCACCG GCGGGAGATC CAGTTCCGCG TCGCCGGGAC CACCGGGTGG GACGCCACGA ACGACCCGTC GTTCGCCGGC CTGCCGGCGT CCGGCGACCC CGTCAGGACG AAGGGCATCA CGCTGCACGA AGGCAGCACC CTGCTGTGGG GGACGGTGCC GGGCGGGACC ACGCCGACGC CCACCCCCAC GCCCACCCCG ACGCCGACGC CCACCCCGAC GCCGACGCCC ACGCCGACGC CCACGCCGAC GCCCTCGCCG TCCCCGACGA TGCACCCGAT CATCGCGAGC CCGCAGGACG TGCGGCTGGT CGCCGCGGGT GACGAGCTGC GCCTGACGTG GACCGCGCCG ACGATCGGCG CGGCCGTCAC GGGCTACCAG ACGCAGCTCG ACGACGAGCC GCTCCAGAGC ACGCGCAACA CCTGGTACAA CCGGCTGGAC CTGGCACCGG GCGAGCACAC GATCCGCGTC CGTTCGCAGG CGGCCGAGGG CACGTCGGCG TGGGTCGAGC GGACCGTCAG CATCCCGGCC GACGGCGAGT GGCCGACGAC CGTGCCGGTG CCGGCGAACT TCGAGGCCAA GGGCGGCTCG CACGGCTTCT CGTTCACCTG GGAGACGCCC ACCGGCGGAC CGGAGGTCGT GGCGTTCGAG ACGATCATGA CGTGGTCCGG GAGCACGGGC AGCCCGACGA TCTCGCGGGA CACCGCGCAC GCGATCATGG GGTACGACTT CCCGCGCGGT GACTACACGG TGTCCGTCCG GTCGGTCGCC GCGGACAACT CCGTGTCGCC GTGGGTCACG GGGACGGCCA CGATCACCGA GGGCGGGATC CCCGGGATCA CCCCGCCCCC GACCCCGCCC CCGACCACGC CGCCGGCCGC GTCCTGCGCC GTCACGTACA CCGCGAACAG CTGGAACAGC GGCTTCACGG CCTCGATCCG ACTGACGAAC ACCGGTGCGA CGGCCGTGAC CTGGAAGCTC GCGTTCGACC TGGCGGCCGG GCAGAAGGTC CAGCAGGGCT GGAGCGCCAC GTGGGCCCAG TCGGGCACCA CGGTGACGGC CACCGGTGCC GCCTGGAACG CGACCCTGGC GCCCGGCGGG ACCATCGACC TGGGCTTCAA CGGGTCGCAC ACCGGCCAGA ACCCCGACCC GACGTCGTTC ACGCTGAACG GCGCGACCTG CACGGCCGGG TAG
|
Protein sequence | MPSTSAHRAR RRSLWARTAA VALVVSGTVL PLQGAQAAPA YNYAEALQKS MFFYQAQRSG DLPANYPVSW RGDSGLDDGK DVGKDLTGGW YDAGDHVKFG LPMAFTATML AWGALESPDG YAAAGQTDEL RDNLRAVNDY FVKAHTAPNE LYVQVGKGDD DHKWWGPAEV MTMARPAYKI TAACPGSDAA AETAAAMASA SLVFQGSDPS YAATLLTHAK QLYSFADTYR GKYSDCVTDA QSFYKSWSGY QDELVWGAYW LYKATGDATY LAKAEAEYDK LSNENQTTTK SYKWTVAWDD KSYAAYALLA METGKQKYVD DANRWLDYWT VGVNGAKVTY SPGGMAVLDS WGALRYAANT SFVALVYSDW LTDSTRKARY HDFGVRQINY ALGDNPRKSS YVVGFGANPP KNPHHRTAHG SWLDSLKDPA ETRHVLYGAL VGGPGSANDA YTDDRGDYVA NEVATDYNAG FTSALAYLTA QYGGTPLASF PRPETPDQDE FFVEAKLNQP AGGTFTEVKA VVRNRSAFPA RSLTNASIRY WFTLDAGVDA SAVTVSTNYS ECGTTPSKVV HAGGSLYYAE LSCAGQNIHP GGQSQHRREI QFRVAGTTGW DATNDPSFAG LPASGDPVRT KGITLHEGST LLWGTVPGGT TPTPTPTPTP TPTPTPTPTP TPTPTPTPSP SPTMHPIIAS PQDVRLVAAG DELRLTWTAP TIGAAVTGYQ TQLDDEPLQS TRNTWYNRLD LAPGEHTIRV RSQAAEGTSA WVERTVSIPA DGEWPTTVPV PANFEAKGGS HGFSFTWETP TGGPEVVAFE TIMTWSGSTG SPTISRDTAH AIMGYDFPRG DYTVSVRSVA ADNSVSPWVT GTATITEGGI PGITPPPTPP PTTPPAASCA VTYTANSWNS GFTASIRLTN TGATAVTWKL AFDLAAGQKV QQGWSATWAQ SGTTVTATGA AWNATLAPGG TIDLGFNGSH TGQNPDPTSF TLNGATCTAG
|
| |