Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_1897 |
Symbol | |
ID | 9145790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 2111228 |
End bp | 2113021 |
Gene Length | 1794 bp |
Protein Length | 597 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | glycoside hydrolase family 5 |
Protein accession | YP_003636993 |
Protein GI | 296129743 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.754177 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCACCT CACGTGTGCG CAGGCTGCGC ACCGCGCTCG GCAGCGTCCT CGCCGCCGGC GCCCTGACGC TCCCCCTCGC CCTGTCCGCG GCGCCTGCAC AGGCGGCCGA CACGCCCGAC TGGCTGCACG TCCAGGGCAA CAAGATCGTC GACGCCTCCG GCAAGGAGGT GTGGCTCACC GGCGTGAACT GGTTCGGGTT CAACGCCGAC GAGCGCGTCT TCCACGGCCT GTGGTCCGCG AACATGCGGA CGCTCACCAA GGGCATGGCC GACCGCGGCC TCAACGTCGT GCGCGTGCCG ATCTCCGCCG AGCTGATGCT CGAGTGGAAG GCGGGCACGT TCACCAAGCC GAACGTCAAC GAGTTCGCCA ACCCCGAGCT CGCCGGGCTC AACAGCCTGC AGATCTTCGA GAAGTTCCTC GAGATGAGTG ACGAGTACGG CCTCAAGGTC TTCCTCGACG TCCACTCCGC CGAGGCGGAC AACTCGGGGC ACGTCTACCC CGTGTGGTGG AAGGGCGACA TCACCACCGA GCACGTCTAC GAGGCGTGGG AGTGGGCGGC GGCCCGCTGG AAGACGAACG ACACGCTCAT CGGCGCCGAC CTGAAGAACG AGCCGCACGG CACCCAGGGC CAGACCGAGC GCGCCAAGTG GGACGGGTCG ACCGACAAGG ACAACTTCAA GCACTTCGCC GAGACGGCGG CGAAGAAGGT CCTGGCGATC AACCCGAACT GGCTGATCTT CGTCGAGGGC ATCGAGATCT ACCCGAAGGA CGGCGTGTCC TGGTCCTCGA CAGGTCTGAC CGACTACCAC AACATGTGGT GGGGCGGGAA CCTGCGCGGC GTGCGGGACT TCCCGATCGA CCTCGGTGCG AACCAGGACC AGCTCGTGTA CTCCCCGCAC GACTACGGGC CGCTCGTGCA CCTGCAGCCG TGGTTCAAGG GTGAGTGGAG CCGCGAGACG CTCGAGCGCG ACGTGTGGGA CCCGAACTGG CTGTACCTGC ACAAGGAGAA CACCGCACCG CTGCTCATCG GCGAGTGGGG CGGCTTCATG GACGGCGGGC CCAACGAGAA GTGGATGGTC GCGCTGCGCG ACCTCATCGT CGACCGGCGC CTGAGCCACA CGTTCTGGGT GCTCAACCCG AACTCGGGTG ACACCGGGGG CCTGCTCGGC TACGACTGGG CCACGTGGGA CGAGGAGAAG TACGCGCTGC TCAAGCCGGC GCTGTGGCAG GACGGCGGCA AGTTCGTCGG CCTCGACCAC GACGTCCCGC TGGGCGGCGT GGGCAGCACG ACGGGCAAGT CGCTGTCCCA GGTCAACGGC ACGTTCCCGT CGCCCACCGC GACGGCGACC CCGACGCCGA CCCCCTCGGT CACGCCGACG CCCACGCCGT CGGTGACGCC GACCCCGACC CCCTCGGTCA CGCCGACGCC GACGCCCAGT GCCACGCCGA CACCCACGGC GACGCCGACG AGCGCCGCAG GTGCGTGCAC GGTGACGTTC AGCGGCAACG CCTGGAACTC GGGCATGACC GGCGCCGTGC GGCTGACGAA CACCGGCAGC ACGCTGTCGG GCTGGACCTT GACGTTCACG GCGCCCGCGG GTGTGACGGT GACCCAGGGC TGGGGCGGCA CGTGGTCGCA GTCCGGCTCC ACGGTGACGG TCCGCAACGC GGACTGGAAC GGCACGCTCG CGACGGGCGG CACGGTCGAG ATCGGGTTCA ACGCCTCGCA CGGCGGCACG ACGGGCACGC CCTCGGGCTT CGCCGTCAAC GGGGCGTCCT GCGCCACGGC CTGA
|
Protein sequence | MTTSRVRRLR TALGSVLAAG ALTLPLALSA APAQAADTPD WLHVQGNKIV DASGKEVWLT GVNWFGFNAD ERVFHGLWSA NMRTLTKGMA DRGLNVVRVP ISAELMLEWK AGTFTKPNVN EFANPELAGL NSLQIFEKFL EMSDEYGLKV FLDVHSAEAD NSGHVYPVWW KGDITTEHVY EAWEWAAARW KTNDTLIGAD LKNEPHGTQG QTERAKWDGS TDKDNFKHFA ETAAKKVLAI NPNWLIFVEG IEIYPKDGVS WSSTGLTDYH NMWWGGNLRG VRDFPIDLGA NQDQLVYSPH DYGPLVHLQP WFKGEWSRET LERDVWDPNW LYLHKENTAP LLIGEWGGFM DGGPNEKWMV ALRDLIVDRR LSHTFWVLNP NSGDTGGLLG YDWATWDEEK YALLKPALWQ DGGKFVGLDH DVPLGGVGST TGKSLSQVNG TFPSPTATAT PTPTPSVTPT PTPSVTPTPT PSVTPTPTPS ATPTPTATPT SAAGACTVTF SGNAWNSGMT GAVRLTNTGS TLSGWTLTFT APAGVTVTQG WGGTWSQSGS TVTVRNADWN GTLATGGTVE IGFNASHGGT TGTPSGFAVN GASCATA
|
| |