Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_3365 |
Symbol | |
ID | 9147281 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 3739500 |
End bp | 3740867 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003638443 |
Protein GI | 296131193 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.079846 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00220666 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTCTCGAT CCACTGGAAG GAACGCGGCC GCAGCCACGT CGCTGTTCCT CGCAAGCATC CTGGCCCTCA CGGCGTGCAG CGCTCCCGGC GCCGACAGCA CCGCCGACGA CGCGGCCGAC GACGGCGCCA CCACCGCCGC CGACGGCGAC TGGACCTGCG GCACGGACGA CGTCACGCTC GACGCGTACC TCGAGACCGG CTTCCCGCTG TCCGCCGCGC TGTTCGAGGA GTTCGAGAAG CAGTACCCGA ACGTCACGTT CGACGTCCGC GAGGACCAGT TCGCCGTCAT CACCCAGAAC GCGCCCCGCG TGCTGGCCGA CAACCCGCCG GACCTCATGC GCCTGCCGCA GATGTCCGAC CTCGTCGGCG ACGGCCTGCT CTACAACCTC GACGAGGCCG CCGCGCACTT CGGCTGGGAC GAGTGGCCGG CCTCGCAGCT CGCGCAGATG CGGGTCGACG ACGAGGGCCG CCGCGGCGAC GGACCCCTGT ACGCGATGGG CAAGAACTAC TCGATGACCG GCGTCTTCTA CAACACCGAG CTCGCCGAGC AGATCGGCAT CACCGAGCCC CCGGCCACGC TGGCCGAGCT CGACGACATG ATGCAGAAGG CCAAGGACGC CGGCATCACG CCGTCCGACC AGTTCAACGG TGGCGCCACC GGCGGCCTGG CGTTCCCGCT GCAGCTCCTC ATGGCGTCCT ACGGCTCCGT CGACCCGATC AACGACTGGA CCTTCCAGAA GCCGGGCGCG CGCATCGACA CCGAGGACAA CATCAAGGCC GCCGAGCACC TCAAGAAGTG GATCGACGCC GGCTACTTCG CCGACGACAT CAACTCGCTC GACTACTCGC AGATGATGGG CCGCTTCATC GACGGCAAGA GCCTGCTGAT CTTCAACGGC GACTGGGAGT CGGGCAACCT CGACACCCAG ATGGCGGGCA AGGCCGGCTT CTTCCTCATG CCCCCGCTCG AGGAGGGTGG CAAGGTCGGC GCCATGTCGG CCCCGCTGAC GTTCGGCATC TCGTCCAAGG CGGAGAACCC CGAGTGCGCG GCGTTCTTCT TCGACTGGAT CGCCACGAAC GACGAGGCGC GCACGATCGC CGTCGAGATC GGTGGCTCGC ACCCCATGGG CCCGGCCGAC GCCTTCATGC CCGAGGTCGA CGCCGACTCC GTGACGGGTC AGACGCTCGC CGCCGGTGCC ACCATCGCCG ACGACAACGG CGCGATGGAG TTCATCGCCA ACGCGACGGG TGCCATCTAC GCCAAGAGCT GGACGCCGAA CCTGCAGAAG CTCGCCGCCG GCGAGCAGAC GGCCGAGGGC CTCCTCACCG CCGTGCAGGC CGACTACGAG AACGACGTCA ACGGCTGA
|
Protein sequence | MSRSTGRNAA AATSLFLASI LALTACSAPG ADSTADDAAD DGATTAADGD WTCGTDDVTL DAYLETGFPL SAALFEEFEK QYPNVTFDVR EDQFAVITQN APRVLADNPP DLMRLPQMSD LVGDGLLYNL DEAAAHFGWD EWPASQLAQM RVDDEGRRGD GPLYAMGKNY SMTGVFYNTE LAEQIGITEP PATLAELDDM MQKAKDAGIT PSDQFNGGAT GGLAFPLQLL MASYGSVDPI NDWTFQKPGA RIDTEDNIKA AEHLKKWIDA GYFADDINSL DYSQMMGRFI DGKSLLIFNG DWESGNLDTQ MAGKAGFFLM PPLEEGGKVG AMSAPLTFGI SSKAENPECA AFFFDWIATN DEARTIAVEI GGSHPMGPAD AFMPEVDADS VTGQTLAAGA TIADDNGAME FIANATGAIY AKSWTPNLQK LAAGEQTAEG LLTAVQADYE NDVNG
|
| |