Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_2399 |
Symbol | |
ID | 9146302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 2690367 |
End bp | 2691668 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003637488 |
Protein GI | 296130238 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00275508 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCGCAAGA CCACACGCAA GGCGTGGGCG CTCGCAGCGG GTGTGACGAG CATTGCGCTC GTCGCGACCG CCTGCTCCTC GAGCGACGAC CCGGGCGAGG GTGACGAGAC CGCGGACGGC GGCAACATCA CCCTCACGGT CGCCACCTTC AACGAGTTCG GCTACACGGA CGAGATGTTC GACCGGTACG AGGCCGAGCA CCCCGGCGTC ACGATCGAGC AGAAGGTCGC CGCCACCTCG AACGAGGCGC GCGAGAACCT CAACACGCGT CTGGCGGCCG GCTCCGGCAC CGCCGACATC GAGGCGATCG AGGTCGACTG GCTGCCCGAG CTCCTGCAGT ACCCGGACTA CTTCGAGGAC CTGTCCTCCC CCGAGGTCGA GGGTCGCTGG CTCGACTGGA AGGTCCAGCA GGCCACGACG GCCGACGGCA AGCTCATCGG CTACGGCACG GACATCGGCC CGGAGGGCAT CGCCTACCGC GCCGACCTGT TCCAGGCCGC CGGCCTGCCG GCCGACCGCG AGGCCGTCGC CGAGCTCTTC GGTGGCGAGA GCGCGACGTG GGAGAAGTTC TTCGAGGTGG GCAAGACCTA CACCTCCGCG ACGGGCAAGC CCTTCTTCGA CTCCGCGGCC GCCATCTACC AGGGCATGGT CAACCAGGAG GAGGCGGCCT ACGAGGACCC CGACTCGGGT GACGTCATCG CGCTGGAGAA CCCGCGCGTC AAGGAGATGT ACGAGCAGGT CACCACGGCC GCCGTGGGCG ACAACCTGTC CGCCCACTTC GAGCAGTGGC AGCCGGACTG GCAGAACGCC TTCCAGAACG ACGGCTTCGC CGTCATGCTG GCGCCGGGCT GGATGCTGGG CGTCATCGCG GGCAACGCGG CCGGCGTCAC CGGCTGGGAC CTCGCCGACG TGTTCCCCGG CGGTGCCGGC AACTGGGGCG GCTCGTTCCT CACGGTCCCG TCGCAGGGTG CCAACGTCGA GGCCGCCAAG GAGCTGGCCG CGTGGCTGAC GGCCCCCGAG CAGCAGATCG AGGCGTTCCA GAACAAGGGC ACGTTCCCGA GCCAGGTCGA GGCGCTCGAG TCGGACGAGA TCAAGTCCGC CACCAACGAG TTCTTCAACA ACGCTCCGGT CGGCGAGATC CTCGCCAACC GCGCGCAGGG CGTCGTGGTG CCCTTCAAGG GCCCGCAGTA CTTCACCGTG CAGGACGCGA TCAACAACGC GATCACGCAG GTGGACGTCA ACGGTGCCGA CGCGGCCGCC GAGTGGGCGA CCTTCGAGGG CGTGGTCCAG GGTCTCGGCT GA
|
Protein sequence | MRKTTRKAWA LAAGVTSIAL VATACSSSDD PGEGDETADG GNITLTVATF NEFGYTDEMF DRYEAEHPGV TIEQKVAATS NEARENLNTR LAAGSGTADI EAIEVDWLPE LLQYPDYFED LSSPEVEGRW LDWKVQQATT ADGKLIGYGT DIGPEGIAYR ADLFQAAGLP ADREAVAELF GGESATWEKF FEVGKTYTSA TGKPFFDSAA AIYQGMVNQE EAAYEDPDSG DVIALENPRV KEMYEQVTTA AVGDNLSAHF EQWQPDWQNA FQNDGFAVML APGWMLGVIA GNAAGVTGWD LADVFPGGAG NWGGSFLTVP SQGANVEAAK ELAAWLTAPE QQIEAFQNKG TFPSQVEALE SDEIKSATNE FFNNAPVGEI LANRAQGVVV PFKGPQYFTV QDAINNAITQ VDVNGADAAA EWATFEGVVQ GLG
|
| |