Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_0981 |
Symbol | |
ID | 9144856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | + |
Start bp | 1085570 |
End bp | 1087426 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003636086 |
Protein GI | 296128836 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0454363 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGGATCA CACGGAAGGC TGCCCTCGCG TCGGTCGCGA CGGTCAGCGT CCTCGCCCTC GCGGCCTGCA CCGGTAGCGG CGACGACACG TCGGGTGACG ACGACAACGC GGGGATCAAC ACCGACACCG CGATCAACAT CGCCTGGAAC CAGCCGTTCT CGTCCTTCAA CGGCGAGAGC ATCACGGGCA ACGCGACGGC CAACAACATC ATCACGTACA TGGCCAACTC GCGCTTCAAC GACTACAACG CCGACCTCGA GGTCGTGCCG GACGAGTCGT GGGGCACCTA CGAGAAGGTC TCCGACGACC CGCTGACCGT CGCGTACACG TACGCCGACA CGGCGAAGTG GTCGGACGGC GTCTCGGCCG GCCCGGCCGA CCTCCTCCTG GAGTGGGCCG CCCAGAGCGG CAAGTTCAAC AACGTCGAGC CCGAGTACGA CGACGAGGGC AACGTCACCA ACCAGGACGC GCTCGACGCG GGTGTCTACT TCGACGCCGC CAGCCCGTCC GTGGCGCTGA TCACGGAGAC CCCGGAGATC GACGGCGACA CGATCACCCT CGTGTACTCC AAGCCGTTCG CCGACTGGGA GGTCGCGATC GACAACAACC TCCCCGCGCA CATCGTGGCG CAGCGGGCCC TCGGCATCGA GGACCCCGAG GAGGCGACCG AGGCCCTGAT CGCCGCGATC ACCGACGAGA ACCTCGAGGA CCTGTCGAAG ATCGCGAAGG TCTGGAGCGA CGACTGGAAC TTCGCGTCCC TGCCCGACGA CCCGCAGCTG CTCGTCACCT CGGGCCCGTA CACGATCACG GAGTTCGTCG AGCAGCAGTA CCTGACGCTC ACGGCGAACC CCGACTACGA GGGCGACAAG AAGGCCGTCT TCGAGAAGGT CACCGTCCGC TACAACGGCG ACCCGATGGG TCAGGTCCAG GCGCTGCAGA ACGGTGAGGT CGACCTCATC AGCCCGCAGT CGACGGCGGA CGTCCTCAAG GCGCTCGAGG CGATCGACGG CCTGACCGTC GAGACGAACG TCGAGGGCAC GTACGAGCAC GTCGACCTGC AGCAGGGCAA CGGCGGTCCG TTCGACGCCG CGACCTACGG CGGCGACACC GAGAAGGCCC ACAAGATCCG CCAGGCCTTC CTCAAGACGA TCCCCCGCGA GAAGATCGTC ACCGACCTCA TCCAGCCGCT GAACCCCGAC GCCGAGGTCC GCAACTCCTT CACGCAGGTG CCCGGCTCGC CGATGTACGA CGGCATCGTC GAGGCGAACG GCCAGCAGGA CGCCTACGGC GAGGTCGACA TCGAGGGTGC CAAGGCGCTG CTCGCCGAGG CCGGTGTGCC CAGCGTCCAG GTGCGTCTGC TCTTCGACCC GGACAACACG CGCCGTGTGA ACCAGTACGA GCTCATCAAG GGCTCGGCCG CCGAGGCCGG CTTCGACGTC GTCCCCTACA CGGTCCAGAC GGACTGGGGT ACGGACCTGT CGAACGCGCG GTCGTTCTAC GACGCGGCGC TCTTCGGGTG GCAGTCGACC TCGACCGCCG TCACCGAGTC CGACGCGAAC TACCGCACCG GCGCGACGAA CAACTACTAC GGGTACTCCA ACCCCGAGGT GGACGCGCTG TACGACGCGC TGCAGACCGA GACCGACGCC GCCGAGCAGG AGCGGATCCT CGGTGAGGTC GAGAAGCACC TGGTCGACGA CGCGTTCGGC GTGACGATCT TCCAGCACCC GGGCGTCACC GCCTGGAACC CGGAGAAGAT CGGCAACGTC CAGAAGCTGG GGATCGCGCC GACGATCTTC TACGGCTTCT GGGAGTGGAC CGCAGGCGAC GCGGCCACCG AGGGCGCCTC CGAGTGA
|
Protein sequence | MRITRKAALA SVATVSVLAL AACTGSGDDT SGDDDNAGIN TDTAINIAWN QPFSSFNGES ITGNATANNI ITYMANSRFN DYNADLEVVP DESWGTYEKV SDDPLTVAYT YADTAKWSDG VSAGPADLLL EWAAQSGKFN NVEPEYDDEG NVTNQDALDA GVYFDAASPS VALITETPEI DGDTITLVYS KPFADWEVAI DNNLPAHIVA QRALGIEDPE EATEALIAAI TDENLEDLSK IAKVWSDDWN FASLPDDPQL LVTSGPYTIT EFVEQQYLTL TANPDYEGDK KAVFEKVTVR YNGDPMGQVQ ALQNGEVDLI SPQSTADVLK ALEAIDGLTV ETNVEGTYEH VDLQQGNGGP FDAATYGGDT EKAHKIRQAF LKTIPREKIV TDLIQPLNPD AEVRNSFTQV PGSPMYDGIV EANGQQDAYG EVDIEGAKAL LAEAGVPSVQ VRLLFDPDNT RRVNQYELIK GSAAEAGFDV VPYTVQTDWG TDLSNARSFY DAALFGWQST STAVTESDAN YRTGATNNYY GYSNPEVDAL YDALQTETDA AEQERILGEV EKHLVDDAFG VTIFQHPGVT AWNPEKIGNV QKLGIAPTIF YGFWEWTAGD AATEGASE
|
| |