Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_0960 |
Symbol | |
ID | 9144835 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 1064036 |
End bp | 1065319 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003636066 |
Protein GI | 296128816 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.226879 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000138332 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCGACGAC CTGCCGTCCT CCTCGCCCTG GCCCTCGCAG CGCCGCTGGC CGCGTGCGCG TCGGCGGACG AGGGCCCGCC GACGCTCATG TGGTACATCA ACCCGGACTC CGGCGGGCAG GCGCGCATCG CGTCCGAGTG CAGCGAGGAG TCCGGCGGCG CGTACCGCCT CGAGGTGACG TTGCTGCCGC GTGACGCCCC GAGCCAGCGG GAGCAGCTCG TGCGCCGCCT CGCCGCCGGT GACACGTCGA TCGACCTCAT GAGCATCGAC CCGCCGTTCG TCCCGGAGTT CGCCAACGCG GACTTCCTGG CCGAGGTGCC GGACGACGTG GCCGACGCCG TCACGCAGGA CGTCGCCGAG GGGGCGGTGG CGGCCGCGAC GTTCGACGAC GAGCTCGTCG TCGTCCCGTT CTGGGCCAAC ACGCAGCTGC TGTGGTACCG CAAGTCGGTC GCCGAGGCCG CGGGGCTCGA CATGACGCAG CCCGTCACGT GGGACCAGGT CATCCAGGCC GCGGCCGACC AGGACGTCAC GGTCGCCGTG CAGGGCACGC GCGCCGAGTC GCTCACCGTG TGGGTCAACG CGCTCGTGGA GTCGGCCGGC GGGCACGTCC TGGAGAACCC CGAGGAGGAG GACCCGCGCG CGGTCGTCCC GAGCCTCGAC TCCGACGCGG GCCGCACGGC CGCCCAGATC ATCGCCGACC TCACCGCCGC GGGCGTCGGC GGGCCGCAGC TGTCCAACGC GACCGAGGAC ATCAACGCCT CGATGTTCGA GAGCGACGAC GCGGGCTTCA TGGTCAACTG GCCGTTCGTC TGGCAGCGCG GCAAGGCCGG CGTCGAGGGC GGCAGCCTCG ACCAGGCCAC GCTCGACGAC TACGGCTGGG CGATGTACCC GCAGGCCGTG GAGGGCGAGC AGGCCCGGCC GCCCATCGGG GGGGTGACGC TCGGCGTCAG CGCGTTCGGC GAGCACCAGG ACCTCGCGTT CGAGGCCGCC CAGTGCATCG TCCAGCCCGA GAAGCAGGCC GCCTACTACA TCTCCGACGG CAACCCGCCC GGCTCCCTCG CGGCTTTCGA CGACCCCGAG GTCCAGGAGG AGTTCCCCAT GGCCGACCTC ATCCGGGAGT CCCTGCAGCA CGCGGCCCCG CGACCGCAGA CACCGTTCTA CAACGAGGTG TCCTCGAGCA TCCAGCGCAC GTGGCACCCG CCACGCAGCG TGTCGCCGGA CTCCGGTCCC GGCAGGGCCG ACACCCTCGT CATGGACGTG CTCCAGGGAA GGGCGCTGCT GTGA
|
Protein sequence | MRRPAVLLAL ALAAPLAACA SADEGPPTLM WYINPDSGGQ ARIASECSEE SGGAYRLEVT LLPRDAPSQR EQLVRRLAAG DTSIDLMSID PPFVPEFANA DFLAEVPDDV ADAVTQDVAE GAVAAATFDD ELVVVPFWAN TQLLWYRKSV AEAAGLDMTQ PVTWDQVIQA AADQDVTVAV QGTRAESLTV WVNALVESAG GHVLENPEEE DPRAVVPSLD SDAGRTAAQI IADLTAAGVG GPQLSNATED INASMFESDD AGFMVNWPFV WQRGKAGVEG GSLDQATLDD YGWAMYPQAV EGEQARPPIG GVTLGVSAFG EHQDLAFEAA QCIVQPEKQA AYYISDGNPP GSLAAFDDPE VQEEFPMADL IRESLQHAAP RPQTPFYNEV SSSIQRTWHP PRSVSPDSGP GRADTLVMDV LQGRALL
|
| |