Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_3052 |
Symbol | |
ID | 9146964 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 3397280 |
End bp | 3398563 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003638134 |
Protein GI | 296130884 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.854825 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.208902 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGCAA GGAAGGCCCT CGCGGCCTGC GCGACGGTGC TGATGAGCGC GCTCGCGCTG ACCGCGTGCG CGGGTTCTGA CGACGGCGGC AGCTCGGAGG GCGCGAGCGA GCTGGTGTTC TGGCACAACT CCACCACCGG TGACGGCAAG GCGTACTGGG AGGAGGTCGG CGCGGCGTTC GAGGAGGAGA CCGGCATCAA GGTCGCCATC CAGTCCATCC AGAACGAGGA CATGGACGGC AAGCTGCAGA CGGCCCTCAA CGGCGGCGAC GCCCCGGACG TCTTCATGTC CCGCGGCGGC GGCAAGCTGG CCGCGGTCGT CGAGGCGGGC CAGGCCATGG ACCTCACCGA CCTCATCGAC GACGACGTGC GCGCGGCGGC CGGTGGCTCG CTCGACGCGT TCTCGGTCGA CGGCAAGGTC TACGGCATGC CCACCGCCGT GCTCCCCGGC GGCATCTGGT ACTCGAAGGA CCTCTTCGAG CAGGCCGGCA TCACCGAGAC GCCCACGACC ATGGGTGACC TCGAGGACGC GGTCGGCAAG CTCAAGGACG CCGGCATCCA GCCGATCGCG CTCGGTGCCA AGGACGCGTG GCCCGCGGCC CACTGGTACT ACTTCTTCGC GCTGCGCGCG TGCGCCCAGG ACACCATCAC GGACGCCGCC GCCGAGATGA ACTTCGACGA CCCGTGCTGG GTCAAGGCCG GCGAGGCCTT CGAGGAGTTC GCCTCGATCG AGCCGTTCAA CAACGGCTTC CTCACCACGA CCGCCCAGCA GGGCGCCGGC TCCTCGGCCG GTCTCCTCGC CAACAAGCAG GCGGCGATGG AGCTCATGGG TGCCTGGAAC CCGGGCGTCA TCGCGGGCCT GACGCCCGAC GGCGAGCCGC TCGCGGACCT CGGCTGGTTC CCCTTCCCGG CGGTCGACGG CGGTGACGGC GACCCCACGG CCATGATGGG CGGCGTCGAC GGCTACAGCT GCTTCGTGGA CGCCCCGAAG GAGTGCGCCG ACTTCCTCAA CTTCTACATG AAGAAGGAGT GGCAGGAGGG CTACGCGGAG GCGTTCGTCA CCATCCCGGC CAGCAAGGAC GCGCAGGCGG CCGTCACCGA CCCGGCCCTC ACGCAGGTCC TCGAGGCGTA CAACGGTGCG GCCTACGTGT CGGTGTGGCT GGACACGCTG TTCGGCAACA ACGTCGGCAA CGCCCTGAAC ACGTCGGTCG TCGAGATGCT CGCGGGCAGC GGCGACGCCG AGAGCATCGT CGCCACGGTC AAGTCCGCGG CAGCCAAGGA GTAA
|
Protein sequence | MKARKALAAC ATVLMSALAL TACAGSDDGG SSEGASELVF WHNSTTGDGK AYWEEVGAAF EEETGIKVAI QSIQNEDMDG KLQTALNGGD APDVFMSRGG GKLAAVVEAG QAMDLTDLID DDVRAAAGGS LDAFSVDGKV YGMPTAVLPG GIWYSKDLFE QAGITETPTT MGDLEDAVGK LKDAGIQPIA LGAKDAWPAA HWYYFFALRA CAQDTITDAA AEMNFDDPCW VKAGEAFEEF ASIEPFNNGF LTTTAQQGAG SSAGLLANKQ AAMELMGAWN PGVIAGLTPD GEPLADLGWF PFPAVDGGDG DPTAMMGGVD GYSCFVDAPK ECADFLNFYM KKEWQEGYAE AFVTIPASKD AQAAVTDPAL TQVLEAYNGA AYVSVWLDTL FGNNVGNALN TSVVEMLAGS GDAESIVATV KSAAAKE
|
| |