Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_1126 |
Symbol | |
ID | 9145005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 1257458 |
End bp | 1258759 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003636229 |
Protein GI | 296128979 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.162135 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000581091 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGAGGA CTGCGGCACT GCGCGCGGTC GCGCTCGGCG CAGCGGTGGC GATGATCGCG ACCGCCTGCA GCTCCGGGAG CGGCAGCGAC GACCCCACCG AGGGTGCTGG CGAGAACGTC ACCCTCACGT GGTGGCACAA CTCCAACACC GGTGCGGGGA AGGACTACTA CGACCAGCTG GCCGACGAGT TCGAGTCGGA CAACCCTGGC GTGACGATCG AGGTCAGCGC CCTCCAGCAC GAGGACATGC TCACCAAGCT CGACGCCGCG TTCCAGACCG GCGACCAGCC GGACATCTTC ATGGAGCGTG GTGGCGGCGA GCTCAAGGCG CACGTCGCCG CGGGCCTCGT CAAGGACATC ACCGACGACG CCGCGGACAC GATCTCGGCG CTCGGCGGCT CGGTCAGCGG CTGGACGGTC GAGGACCGTG TCTACTCGCT GCCCTTCTCC ATGGGTGTTG TCGGCTTCTG GTACAACAAG TCGATGTTCG CCCAGGCGGG CATCACCGAG GCGCCCAAGA CGATGGACGA CCTGTACGCC GCGGTCGAGG CGCTCAAGGG CGCCGGCATC GAGCCGATCT CGGTCGGCGC CGGTTCCGCC TGGCCCGCCG CGCACTACTG GTACTACTTC GCCCTGCGTC AGTGCTCGCA GGACACGATC GCCACGGCGT CCCAGGAGCT CGAGTTCACG GACCCCTGCT GGGTCAAGGC GGGCGAGAGC CTGGCCGACC TGGTGGCGCA GGAGCCGTTC AACACCGGCT TCCTCGGCAC CGAGGCCCAG GGCACGCCCG AGTCGGCCTC CGGCCTCCTC GCGAACCGCA AGGTCGCGAT GGAGCTCGCC GGCCACTGGG AGCCCGGCGT CATGCAGGGC CTGACGGAGG ACGAGCAGGG CCTCGGCGAG GACACCGGCT GGTTCCCGTT CCCGGAGGTC GCGGGTGGCG AGGGTGACCC GGCCGCCCAG CTCGGTGGCG GTGACGCGTG GGCGTGCTCG AACGACGCGC CGGACATCTG CGTCGACTTC ATCGAGTTCA TGCTGTCGAA CGACGTCCAG AAGGGCTTCG CGGAGCTCGA CATGGGCCTG CCGACGCTCC CGTCCGCCAC GGCGTTCGTC GCGGCCCCGG AGCTGGCGCA GCTGCTCTCG TACCGGAACG ACGCTCCGTA CGTCCAGCTG TACTTCGACA CGCAGTTCGG CGAGAACATC GGTGGCGCCA TGAACGAGGC CATCGTGTCG GTGTTCGCGG GGAGCGGGAC GCCTCAGGGC ATCGTCGACG CGACCCAGGC CGCGGCTGAC CTCGAGAAGT GA
|
Protein sequence | MKRTAALRAV ALGAAVAMIA TACSSGSGSD DPTEGAGENV TLTWWHNSNT GAGKDYYDQL ADEFESDNPG VTIEVSALQH EDMLTKLDAA FQTGDQPDIF MERGGGELKA HVAAGLVKDI TDDAADTISA LGGSVSGWTV EDRVYSLPFS MGVVGFWYNK SMFAQAGITE APKTMDDLYA AVEALKGAGI EPISVGAGSA WPAAHYWYYF ALRQCSQDTI ATASQELEFT DPCWVKAGES LADLVAQEPF NTGFLGTEAQ GTPESASGLL ANRKVAMELA GHWEPGVMQG LTEDEQGLGE DTGWFPFPEV AGGEGDPAAQ LGGGDAWACS NDAPDICVDF IEFMLSNDVQ KGFAELDMGL PTLPSATAFV AAPELAQLLS YRNDAPYVQL YFDTQFGENI GGAMNEAIVS VFAGSGTPQG IVDATQAAAD LEK
|
| |