Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_2221 |
Symbol | |
ID | 9146121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 2479132 |
End bp | 2480712 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | protein of unknown function DUF404 |
Protein accession | YP_003637311 |
Protein GI | 296130061 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00605646 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGGACC TCTTCGACGA CTACCCGGCG GGGGCCGCCT GGGACGAGAT GCTCGGTCCC GACGGCGAGG TGAGCGGCGC CTACCGGCAC GTGCACGCGG CCCTGGCGCA GCTCTCGGCC GGCGAGCTGC GCGCCCGCGC CGACACGCTC GCGCGCTCCT ACCTCAAGCA GGGCGTGACG TTCGACTTCG CGGGCGAGGA GCGACCCTTC CCGCTCGACG TCGTGCCGCG CGCACTGGCG GGCGAGGAGT GGGACCACGT CGCGCCCGGA GTCGCGCAGC GCGTGCGGGC GCTCGAGGCG TTCCTCGCCG ACGTGTACGG CCCGCAGAAG TCGATCGCCG ACGGTGTCGT GCCGCGGTCG GTCGTGGTGT CGTCGACGCA CTTCCACCGT GCCGCGCGTG GCGTCGAGCC GCCCAACGGC GTGCGGGTGC ACGTCTCGGG CATCGACCTG GTGCGCGACT CCCTCGGCGG CTGGCGCGTC CTGGAGGACA ACGTGCGCGT GCCGTCCGGC GTCAGCTACG TGCTGTCGAA CCGCCGGGCC ATGGCCCAGA CCTTCCCGGA GCTGTTCGCC GCGCTGCGCA TCCGCCCCGT CGTCGACTAC CCCCGGCGCC TGCTGGCGGC ACTGACCGCC GCGGCGCCCC AGGGCGTCGA CGACCCGACG GTCGTGGTGC TCACTCCGGG CGTCTTCAAC TCGGCCTACT TCGAGCACAG CCTGCTGGCA CGCACCATGG GTGTGGAGCT CGTCGAGGGC CGCGACCTCT ACGTCTCCGG TGGCCGGGTC TGGATGCGCA CCACCCAGGG CCGACGGCGC GTCGACGTCA TCTACCGGCG CGTCGACGAC GAGTTCCTCG ACCCGGTGAC GTTCCGCTCG GACTCGCTGC TCGGCTGCCC GGGCCTCATG ACGTGCGCGC GGCTCGGCAC CGTCACCATC GCCAACGCCA TCGGCAACGG CGTGGCCGAC GACAAGCTGC TCTACACCTA CGTGCCGGAC CTCATCCGCT ACCACCTGGG CGAGGAGCCG ATCCTCCCCA ACGTCGACAC GTGGCGCCTC GAGGACCCCG GCGCGCTCGC GGAGGTGCTC GACCGGCTCG ACGAGCTGGT CGTCAAGCCG GTCGACGGGT CGGGCGGCAA GGGTCTGGTC GTGGGACCGC GGGCGACGCG CGCCGAGCTC GACGAGCTGC GGTCACGGCT GCGTGAGGAC CCGCGCGGCT GGATCGCCCA GCCGGTCGTC CAGCTGTCCA CCGTGCCGAC CCTCGTCGAG GACGGTCTGC GGCCGCGGCA CGTCGACCTG CGCCCGTTCG CCGTCAACGA CGGCGAGTCC GTCTACGTGC TGCCCGGCGG TCTCACCCGC GTCGCGCTGC CCGAGGGCCA GCTGGTCGTC AACTCCTCGC AGGGCGGCGG GTCCAAGGAC ACGTGGGTCC TCGGGGGTCG CGTCCCGCGG CGCGCGCAGT CGCAGAGCCA GTCGCTGCCG CAGGCGGTGC CGAAGGACGC GTCGGTGCCG ATCGACTCCC ACCCCTCGGA CCGGCGCGCG CAGGTCATGC AGCAGCAGCA GCAGCGGACG GCGGGGGGCG CGACGTGCTG A
|
Protein sequence | MADLFDDYPA GAAWDEMLGP DGEVSGAYRH VHAALAQLSA GELRARADTL ARSYLKQGVT FDFAGEERPF PLDVVPRALA GEEWDHVAPG VAQRVRALEA FLADVYGPQK SIADGVVPRS VVVSSTHFHR AARGVEPPNG VRVHVSGIDL VRDSLGGWRV LEDNVRVPSG VSYVLSNRRA MAQTFPELFA ALRIRPVVDY PRRLLAALTA AAPQGVDDPT VVVLTPGVFN SAYFEHSLLA RTMGVELVEG RDLYVSGGRV WMRTTQGRRR VDVIYRRVDD EFLDPVTFRS DSLLGCPGLM TCARLGTVTI ANAIGNGVAD DKLLYTYVPD LIRYHLGEEP ILPNVDTWRL EDPGALAEVL DRLDELVVKP VDGSGGKGLV VGPRATRAEL DELRSRLRED PRGWIAQPVV QLSTVPTLVE DGLRPRHVDL RPFAVNDGES VYVLPGGLTR VALPEGQLVV NSSQGGGSKD TWVLGGRVPR RAQSQSQSLP QAVPKDASVP IDSHPSDRRA QVMQQQQQRT AGGATC
|
| |