Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_0689 |
Symbol | |
ID | 9144560 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 745521 |
End bp | 747146 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | chaperonin GroEL |
Protein accession | YP_003635800 |
Protein GI | 296128550 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00401944 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000000708621 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCAAGA TCATCGCCTT CGACGAGGAG GCCCGGCGGA GCATGGAGCG CGGGCTCAAC GTCCTCGCCG ACACCGTCAA GGTCACCCTC GGCCCCAAGG GCCGCAACGT CGTGCTCGAC AAGAAGTGGG GCGCGCCGAC GATCACCAAC GACGGCGTCT CCATCGCCAA GGAGATCGAG CTCGAGGAGC CGTTCGAGAA GATCGGCGCC GAGCTGGTCA AGGAGGTCGC GAAGAAGACG GACGACGTCG CCGGTGACGG CACGACGACC GCGACCGTCC TGGCCCAGGC GCTCGTGCGC GAGGGTCTGC GCAACGTGGC CGCCGGCGCC AACCCGATCG CCCTGAAGAA GGGCATCGAG AAGGCCGTCG AGGCCGTCAC GGCCCAGCTC CTGGCCCAGG CCAAGGAGAT CGAGACCAAG GACGAGATCG CCGCCACGGC CGCCATCTCC GCCGGCGACC CCGCGATCGG CGAGCTCATC GCCGAGGCCC TCGACAAGGT CGGCAAGGAG GGCGTCATCA CGGTCGAGGA GTCCAACGCC CTCGGCCTGG AGCTCGAGCT CACGGAGGGC ATGCGCTTCG ACAAGGGCTT CCTGTCGGCG TACTTCGTGA CCGACCCGGA GCGCCAGGAG GCGGTCCTCG AGGACGCGTA CGTCCTGCTC GTCGAGTCCA AGGTCTCGAA CGTCAAGGAC CTGCTGCCGC TGCTGGAGAA GGTCATCCAG GCCGGCAAGC CGCTGCTCAT CGTGGCCGAG GACGTCGAGT CCGAGGCGCT GGCGACGCTC GTCGTCAACC GCATCCGGGG CATCTTCAAG TCCATCGCCG TCAAGGCGCC GGGCTTCGGT GACCGCCGCA AGGCGATGCT GCAGGACATG GCCGTCCTCA CCGGTGGCCA GGTCGTCTCC GAGACCGTCG GCCTCAAGCT CGACTCGGTC GGCCTCGAGG TGCTCGGCAC CGCGCGCAAG GTCGTCGTGA CGAAGGACGA GACCACGATC GTCGAGGGTG GCGGCGAGGC CGACCAGATC GCCGGCCGCG TCAAGCAGAT CCGCGCCGAG ATCGACAACT CCGACTCGGA CTACGACCGC GAGAAGCTCC AGGAGCGCCT CGCCAAGCTC GCCGGCGGCG TCGCCGTCAT CAAGGCGGGC GCGGCCACCG AGGTCGAGCT CAAGGAGCGC AAGCACCGCA TCGAGGACGC CGTCCGCAAC GCGAAGGCGG CCGTCGAGGA GGGCATCGTC GCCGGTGGTG GCGTCGCGCT CATCCAGGCC GGTGCCAAGG CGTTCGAGAA GCTCGAGCTC GAGGGCGACG AGGCGACCGG TGCCAACATC GTGAAGTACG CGATCGAGGC CCCGCTCAAG CAGATCGCCG TCAACGCCGG CCTCGAGGGC GGCGTCGTCG CGGAGCGCGT GCGCAACCTC CCCGCCGGTC AGGGCCTCAA CGCCGCGACC GGTGTGTACG AGGACCTGCT GGCCGCGGGC GTCAACGACC CGGTCAAGGT CACGCGGTCC GCGCTGCAGA ACGCGGCGTC GATCGCGGCG CTGTTCCTCA CCACCGAGGC CGTCGTGGCC GACAAGCCGG AGAAGGCCGC TGCGCCCGCC GGTGGCGGTG GCGAGGACTT CGGCGGCGGC TTCTGA
|
Protein sequence | MAKIIAFDEE ARRSMERGLN VLADTVKVTL GPKGRNVVLD KKWGAPTITN DGVSIAKEIE LEEPFEKIGA ELVKEVAKKT DDVAGDGTTT ATVLAQALVR EGLRNVAAGA NPIALKKGIE KAVEAVTAQL LAQAKEIETK DEIAATAAIS AGDPAIGELI AEALDKVGKE GVITVEESNA LGLELELTEG MRFDKGFLSA YFVTDPERQE AVLEDAYVLL VESKVSNVKD LLPLLEKVIQ AGKPLLIVAE DVESEALATL VVNRIRGIFK SIAVKAPGFG DRRKAMLQDM AVLTGGQVVS ETVGLKLDSV GLEVLGTARK VVVTKDETTI VEGGGEADQI AGRVKQIRAE IDNSDSDYDR EKLQERLAKL AGGVAVIKAG AATEVELKER KHRIEDAVRN AKAAVEEGIV AGGGVALIQA GAKAFEKLEL EGDEATGANI VKYAIEAPLK QIAVNAGLEG GVVAERVRNL PAGQGLNAAT GVYEDLLAAG VNDPVKVTRS ALQNAASIAA LFLTTEAVVA DKPEKAAAPA GGGGEDFGGG F
|
| |