Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_2998 |
Symbol | |
ID | 9146910 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 3323634 |
End bp | 3325292 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003638080 |
Protein GI | 296130830 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.58673 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.432395 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGACGA CGAAGAAGCG GCCCCTCGCG CTCGCCGCGA CCGCCGCGAC CCTCGCGCTG GCCCTGGCGG CCTGCTCGGG CGGGTCGGAC GACGAGACGG ACGACGCCCC CGAGGCGAGC GAGCTCGGTC AGGTCGGTGC GATGGAGGAC TACGGCGTCG GCACGACGTT CGTGGCCACC GAGCCGGTGA GCTTCGGCCT GATGTACCGC GACCACCCGA ACTACCCGCT CAAGGAGGAC TGGGACATCC TCACGAAGCT CGAGGAGAAC CAGAAGGTCA CCTTCGAGAT GCAGACCGCC CCGCTGTCCG ACTGGCAGCA GGCGCAGTCG ATCGCGATCG GCGCGGGCAA CGCCCCGGAC ATCATCTCCG TGACCTACCC CGGGCAGGAG GTGGCCTTCG TCGCCGGCGG TGCGATCCTG CCCGTGAGCG ACTACGTCGA GCACATGCCG AACTACCTCG ACAAGGTCGA GAAGTGGGGC CTGGAGGCCG ACATCGACCG GATGCGCCAG CAGGACGGCA AGTACTACGT GCTGCCCGGC CTGCGCGAGT CGGTCCGTCC CTCGTACACG TACGCGGTGC GCAAGGACGT CTGGGAGCAG CTCGGCCTGA GCCTGGAGCC GGAGACCTTC GAGGACTTCG CCGCCGACCT GGCGAAGGTC AAGGCCGCGT ACCCCGACCT GTACCCCCTG TCCGACCGCT GGTCGGCCAA CGGTCCGCTC GAGGCCACTC TCAACGTCGC CGCGTCGAAC TTCGGCACGG CCGCCGGCTG GGGCTACGGC GAGGGCACCT GGTGGGACGA GGACGCGGGC GAGTTCGTCT ACACCGGCGC CATGGACGAG TACCGCGAGC TGCTCGAGTA CTACCACGGC CTCATCGCCG ACGGGCTCAT GGACCCCGAG AGCCTCACGC AGGAGGACGA CCAGGCCATC CAGAAGATGG CGTCGGGCCA GACCTTCGCC CAGCTGACGA ACGACCAGGA GATCCTCAAG GTCCGGACCG CCATGACCGA GGTCGGCACG CAGGGCGAGG TCGCCATGAT CCGCGTCCCC GCCGGCCCCG CCGGTGACGT CCTGGCCGGT TCGCGCCTCG TCAGCGGTCT CATGCTGTCC TCGTCGGCCG CCGAGGAGGA CGACTTCCTC GCGATGCTGC AGTTCATCGA CTGGCTGTAC TACTCCGACG AGGGCCTGGA GTTCGCCAAG TGGGGTGTCG AGGGTGAGAC CTTCACGCGC GAGGGCGACA AGCGCGTGCT CATGCCGGAC ATCGACCAGA ACGGCCTGAA CCCGGGCGCG CCGAAGGCGC TCAACGTCGA CTACGGCTAC CACAACGGCG TGTGGATGCT CGAGCACGGC TCGTCGGACG AGCTGGACCG GTCGATGCTG CGTGACGAGG TCGTCGAGTT CGTCGAGTCC ATGAGCGACA AGGAGCTCGC CCCGGTCTCG CCGCCCGCAC CGCTGGACGA GCTCGAGCGT GAGCAGGTCT CGCTCTGGCA GACCGCGCTG CGCGACCACG TGCTGCAGAA CACCGCCGCG TTCATCCTCG GCCAGCGCGA CCTGTCCGAG TGGGACGCGT ACGTCGCCGA GCTCGAGGGC AAGAACATGC AGCAGTACCT CGACGTGGTG AACGCCGCGC AGGAGCGGTT CGCCGAGCAG AACGGCTGA
|
Protein sequence | MRTTKKRPLA LAATAATLAL ALAACSGGSD DETDDAPEAS ELGQVGAMED YGVGTTFVAT EPVSFGLMYR DHPNYPLKED WDILTKLEEN QKVTFEMQTA PLSDWQQAQS IAIGAGNAPD IISVTYPGQE VAFVAGGAIL PVSDYVEHMP NYLDKVEKWG LEADIDRMRQ QDGKYYVLPG LRESVRPSYT YAVRKDVWEQ LGLSLEPETF EDFAADLAKV KAAYPDLYPL SDRWSANGPL EATLNVAASN FGTAAGWGYG EGTWWDEDAG EFVYTGAMDE YRELLEYYHG LIADGLMDPE SLTQEDDQAI QKMASGQTFA QLTNDQEILK VRTAMTEVGT QGEVAMIRVP AGPAGDVLAG SRLVSGLMLS SSAAEEDDFL AMLQFIDWLY YSDEGLEFAK WGVEGETFTR EGDKRVLMPD IDQNGLNPGA PKALNVDYGY HNGVWMLEHG SSDELDRSML RDEVVEFVES MSDKELAPVS PPAPLDELER EQVSLWQTAL RDHVLQNTAA FILGQRDLSE WDAYVAELEG KNMQQYLDVV NAAQERFAEQ NG
|
| |