Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_1496 |
Symbol | |
ID | 9145382 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 1658763 |
End bp | 1659968 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003636593 |
Protein GI | 296129343 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGCG ACCGACGTGC CCTCCCGGCC GACCCACGCG TGCGCGAGCT GGTGAAGCAG GCCCGGGGCG TCTCCCGGCG CCGCTTCCTG GCGGGGGCGG CGGGCGCCGC CGGCGCCGCG GCCCTGCTCG GCGCGTGCGG TACCGCGGGC CCGAGCCGGG CCGGGACGGG CGACTCGTCG ACAGGTCCGC TGCGGTGGGC GAACTGGACC CAGTACCTCG ACCAGAACGA GGACGGCACC AGCTTCCCCA CCCTCGAGGC CTTCGAGGAG CGGACGGGTA TCGCGGTCGA GTACTCCGAG GACATCGAGG ACAACGACTC GTTCTACGGC AAGATCTCGC AGCAGCTCGA GAACGGTCAG GACATCGGCT ACGACGTCGT CACCCTCACG GACTGGATGA CGGCGCGCTG GATCCGCCAG GGGTACGTCG CCGAGCTCGA CCGCACCCGC ATCCCCAACG CCGCCAACAT CCTGCCCAAC CTCGACGGCG TCGACTTCGA CGACGGCCGC AGGTTCTCGC TCACGTGGCA GTCCGGCTTC GCGGGCATCG CGTGGGACAC GCAGGCCATC CCGGAGGGGC TGCGGTCGGT GTCGGACCTG TGGGACCCGA GGCTCAAGGG GCGGGTCGAG GTGCTGTCGG AGATGCGCGA CACCATCGGC CTCATCATGC TCGAGAACGG CATGGACCCC GCGTCGAACT GGACCACGGA CGACTGGTTC GACGCGCTCG ACGTCGTGCG TCGCCACCTC GACGACGGGC AGATCCGCCG GGTGCGCGGC AACTCCTACA CGCAGGACCT GGCGTCGGGC GACGCGGTCG CGTGCTTCGC GTGGTCAGGT GACATCACGT CGCTCAACTA CGACTACGAC GGCAGGTTCG CGTTCGCGAT CCCCGACGCG GGCGGCACTC TCTGGAGCGA CAACCTCATG GTGCCGAGGT CGTCGGGGCG CAAGGCGCAG GCCGAGGAGC TCTTCGACTA CTACTACGAC CCCGAGGTCG CGGCCGAGGT CGCGGCCTGG GTCAACTACA TCACGCCCGT CCAGGGCGCG CAGGAGGCGA TGGCGCGGAT CGATCCCGAC CTCGCCGAGG ACACGAACAT CTTCCCGACC GCGGAGGTGC TGTCCCAGGT GCACGTCTTC CGGACGCTGA CGCCGGCCGA GGAGGAGCGC TACAACGGCC AGTTCCTCTC CGTGATCGGC GCGTGA
|
Protein sequence | MSSDRRALPA DPRVRELVKQ ARGVSRRRFL AGAAGAAGAA ALLGACGTAG PSRAGTGDSS TGPLRWANWT QYLDQNEDGT SFPTLEAFEE RTGIAVEYSE DIEDNDSFYG KISQQLENGQ DIGYDVVTLT DWMTARWIRQ GYVAELDRTR IPNAANILPN LDGVDFDDGR RFSLTWQSGF AGIAWDTQAI PEGLRSVSDL WDPRLKGRVE VLSEMRDTIG LIMLENGMDP ASNWTTDDWF DALDVVRRHL DDGQIRRVRG NSYTQDLASG DAVACFAWSG DITSLNYDYD GRFAFAIPDA GGTLWSDNLM VPRSSGRKAQ AEELFDYYYD PEVAAEVAAW VNYITPVQGA QEAMARIDPD LAEDTNIFPT AEVLSQVHVF RTLTPAEEER YNGQFLSVIG A
|
| |