Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_3054 |
Symbol | |
ID | 9146966 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 3398929 |
End bp | 3400218 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003638136 |
Protein GI | 296130886 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0183103 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGGCAT CGAGGACGTG GGCGGCGTCG GCCGCCGTGC TCGCGGGCGC GCTGGCGCTC GCGGCGTGCA GCGGGGGCGA CACGGGCGAC GGCGGCGACG GCGGCACCGT GCAGATGACG TTCTGGCACA ACGCGACCAC CGACGACGGC AAGAAGTTCC AGGAGGACGT GGCCGCCGCG TTCGAGGACG CGAACCCGGG CGTCGAGATC ACGCTGCAGG TCGTCCAGAA CGAGGAGATG GACGGCAAGC TCCAGACGGC CCTGAACGGC GGCGACGCGC CGGACATCTT CATGGCGCGC GGCGGCGGCA AGCTCGCCGA CATGGTGGCG GCCGGTCAGA TCATGGACCT CACGGACCTC GTCGACGACG CGTCGCGCGA GGCGCTGGGC GGCTCGCTCG ACGCGTACAC GATCGACGGC AAGGTCTACG GCATGCCGAC CGCCGTGCTG CCGGGCGGCA TCTTCTACAG CAAGGACCTG TTCGAGGCGG CCGGCATCGA GTCGACCCCG ACGACGGTCG ACCAGCTGGT CGACGCGGTC GAGAAGCTCA AGGGCACCGG CGTCGCGCCG ATCGCGCTGG GTGGCAAGGA CGCCTGGCCG GCGGCTCACT GGTACTACTT CTTCGCGCTG CGCGCGTGCT CGCAGGAGAC GCTCGAGAAG GTCGCCGTCG AGCGCACGTT CGACGACCCG TGCTGGTTGG AGGCGGGGCA GGCGTTCGAG GAGTTCGCGG GCGTCGAGCC GTTCAACGAG GGCTTCCTCA CGACGACGGC GCAGCAGGGC GCCGGGTCGT CCGCGGGCCT GGTCGCCAAC CACCAGGCGG CCATGGAGCT CATGGGCGCC TGGAACCCGG GGGTCATCGG CGGGCTCACG CCGGACGGCG AGCCGCTGGC GGACCTCGGC TGGTTCCCGT TCCCCGAGGT CGAGGGCGGC CAGGGCGACC CGACGGCGAT GATGGGCGGT ATCGACGGTC TGAGCTGCCA CGTCGACGCG CCGCCGGAGT GCGCGGAGTT CCTCAACTTC CTCATCCTCA AGGAGAACCA GGAGGACTTC GCCGAGGCGT TCGTGTCCCT GCCGGCGAGC AAGGACGCGC AGGACGTCGT GACGGACCCG GCGCTGAAGG ACATCCTCGC GGCGTACGAC GACGCGGCGT ACGTGACGGT CTGGCTCGAC ACCCTGTTCG GGCAGAACGT CGGCAACGCG CTCAACACCG CGGTCGTCGA GATGCTCGCG GGTCAGGGCG ACGCCCAGAG CATCGTCGAC ACCGTGACCG CCGCCGCGGC CAAGGAGTGA
|
Protein sequence | MRASRTWAAS AAVLAGALAL AACSGGDTGD GGDGGTVQMT FWHNATTDDG KKFQEDVAAA FEDANPGVEI TLQVVQNEEM DGKLQTALNG GDAPDIFMAR GGGKLADMVA AGQIMDLTDL VDDASREALG GSLDAYTIDG KVYGMPTAVL PGGIFYSKDL FEAAGIESTP TTVDQLVDAV EKLKGTGVAP IALGGKDAWP AAHWYYFFAL RACSQETLEK VAVERTFDDP CWLEAGQAFE EFAGVEPFNE GFLTTTAQQG AGSSAGLVAN HQAAMELMGA WNPGVIGGLT PDGEPLADLG WFPFPEVEGG QGDPTAMMGG IDGLSCHVDA PPECAEFLNF LILKENQEDF AEAFVSLPAS KDAQDVVTDP ALKDILAAYD DAAYVTVWLD TLFGQNVGNA LNTAVVEMLA GQGDAQSIVD TVTAAAAKE
|
| |