Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_0501 |
Symbol | |
ID | 9144368 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 533260 |
End bp | 534561 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003635614 |
Protein GI | 296128364 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000488029 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0000184856 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAAGCTTC GCCACCTCGC AGCAGCGTCC GTCCCCGTCC TCCTGCTCGC GGCCTGCTCG AGCGGCGGCG ACGCCGCGGA CGACGGGCCC GTCACGCTCA CGTACTGGGC CAGCAACCAG GGCACGAGCC TCGACCACGA CAAGGAGGTC CTCACGCCCG TCCTGGAGGA CTTCACGGAG CGCACGGGCG TCGAGGTCGA CCTGGAGGTC ATCGGCTGGA GCGACCTGCA GACGCGCATC CAGACGGCCG TCACGTCCGG CCAGGCGCCC GACGTGGTCA ACATCGGCAA CACGTGGGCG GTGTCCCTGC AGGCCACGGG CGCGTTCCTG CCGCTCGACG ACGCGGCGAT GGACGCGATC GGTGGCGCCG ACAAGTTCGT GGCGACCGCG CTGGAGACCG GCGGTGCGCC GGGGACCGAC CCGACGTCGG TGCCGCTGTA CGGGCTGGCG TACGGGCTCT ACTACAACAC GGCGATGTTC GCCGACGCAG GGCTGCAGCC GCCGACGACG TGGGAGGAGA TGGTCGCCGC CGCGCAGGCG CTCACCGACC CCGCGGCGGG CGTGTACGGC ATGGCGCTGG CGGCCGGCTC GTACACGGAG AACAACCACT TCGCGTTCAT CAACGCCACG CAGAACGGCG CCGAGCTGTT CGACGCCGAC GGCAACCCGA CGTTCACGGG CGACGGCGTC GTCGACGGCA TCGTGCGCTA CCTCGACCTC ATGCAGGACG CCGGCGCGGT GAACCCCGCG AACGCGCAGT ACGACAACGC GTCGTTCGCG GCGGCGGACT TCGCGAACGG CAAGGCCGCG ATGATCCTCA ACCAGAGCAA CGCGGGCGCG ACCATCGAGG CGAACGGCAT GGCGCCCGAC GCGTACGGCG TCGTCCCGTT CCCGGCACCG CAGGACGCCG TGAGCGACGT CGCGAGCCAC GTCGCGGGCA TCAACGTGTC GGTCTTCGGC AACACCGAGC ACCCCGACGA GGCGCTGCAG CTCGTCGAGC ACCTGACGAG CGCGGACGTG CAGACCACGC TGGGCAGGCC GTTCTCGTCG CTCCCGGTGC TGAAGGACGC GACGGCGGCG TTCACGGACG ACGCCGAGCT GGCCGCGATC TTCACGGAGA TCTACAACGA GCGCTCCGCA CCGCTGCCCC TGGTGCCCGC GGAGGACCAG TTCGAGACGA CGGTCGGCAA GGCGATGAAC GCGATGTTCG CGACCATCGC CACGGGCGGC ACGGTCACCG CGGACGACGT GCGTGAGGCG ATGCAGACCG CGCAGGACCA GGTGCAGGCG TCGGTCGGCT GA
|
Protein sequence | MKLRHLAAAS VPVLLLAACS SGGDAADDGP VTLTYWASNQ GTSLDHDKEV LTPVLEDFTE RTGVEVDLEV IGWSDLQTRI QTAVTSGQAP DVVNIGNTWA VSLQATGAFL PLDDAAMDAI GGADKFVATA LETGGAPGTD PTSVPLYGLA YGLYYNTAMF ADAGLQPPTT WEEMVAAAQA LTDPAAGVYG MALAAGSYTE NNHFAFINAT QNGAELFDAD GNPTFTGDGV VDGIVRYLDL MQDAGAVNPA NAQYDNASFA AADFANGKAA MILNQSNAGA TIEANGMAPD AYGVVPFPAP QDAVSDVASH VAGINVSVFG NTEHPDEALQ LVEHLTSADV QTTLGRPFSS LPVLKDATAA FTDDAELAAI FTEIYNERSA PLPLVPAEDQ FETTVGKAMN AMFATIATGG TVTADDVREA MQTAQDQVQA SVG
|
| |