Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_2806 |
Symbol | |
ID | 9146714 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 3118364 |
End bp | 3120034 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003637890 |
Protein GI | 296130640 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.636428 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0208064 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACGTG CGGGAAATCT GGCAGGACGG CGCATGCGCA AGGCGCGTGC CGCCGGTGCA CTGACGCTGG TGGGCGCGCT CGCGCTGGCC GCGTGCAGCG GCGGCGGTGG GGACACCAAC GCCGACGGTG AGGCCGCGGA GCTCGAGTGC TCCTCGGAGG CGGTCGCCGA CCAGCCGTGG AAGGCGGCCG AGCCGCGCGA GTTCTCCCTG CTGTGGACGG ACTGGGCCGA CTACCCGATC ACGGACACGT GGGAGTTCTT CGACGAGATC GAGAAGCGGA CGAACGTCAA GCTCAAGCTG ACGAACATCC CGTTCTCCGA CGCGACCGAG AAGCGCTCGC TGCTCATCAG CGCCGGCGAC GCGCCGCAGA TCATCCCGCT CGTCTACACG GGCGAGGAGC GCCAGTTCGC GGCCTCCGGC GCGGTCGTGC CGCTGAGCGA CTACATCGAC TACATGCCGA ACTTCAAGAA GTACACCGAG GAGTGGGACC TCGTCGACAT GGTCGACGAC CTGCGCCAGG AGGACGGCAA GTACTACATG ACCCCGGGCC TCCAGGAGGT CTCGGTCCCG GTCTTCACGC TCATCATCCG CAAGGACGTC TTCGACGAGG TCGGCGCTCC CGAGCCCGAC ACGTGGGAGG ACCTGCAGGA GGGCCTGGCG CTCATCAAGG AGAAGTACCC GGACTCCTAC CCGCTGGCCG ACGGCTTCGA GGCGTGGTCG ATGATCAACT ACGCCGCGCA CGCGTTCGGC ACGGTCGGTG GCTGGGGCTT CGGCGACGGC GCCTGGTGGG ACGAGGAGAA GGGCGAGTTC GTCTACGCCG CGACCACCGA CGGGTACAAG GACATGGTCA CGTACTTCCG TGGCCTGCAC GACGCCGGTC TGCTCGACGC GGAGTCGTTC ACCGCGTCGA ACGACGGTGG CGGCACGGTC GTCGAGAAGG TCGCGGCCGA GAAGGTCTTC GCGTTCTCGG GCGGCTCGTG GACGGTCCAG GAGTTCGGCA CGGCTCTCGA GGCCGCCGGC GTCACGGACT ACGAGCTCGT GCAGATCGCG CCCCCGGCCG GCCCGGCGGG CAACAACGTC GAGCCGCGCA ACTTCTGGAA CGGCTTCATG CTGACGGCCG ACGCGGCGAA GGACGAGAAC TTCTGCGACC TGCTGCACTT CACGGACTGG CTGTACTACA ACCCCGAGGC CCGTGAGCTG ATCCAGTGGG GCGTCGAGGG CAAGCACTTC ACCAAGGAGG GCGGCAAGTA CACGCTCAAC CCGGAGTTCT CGCTCAAGAA CCTCAACATG AACCCGGACG CCCCGGTCGA CCTCAAGAAG GACCTCGGCT ACGCCAACGA CGTCTTCGCC GGCTCGACCG AGTCGCGCGA GCTGAAGGAG TCGTACAACG TCCCGGCGTT CGTCCAGTAC ATCGACGACG TCCAGACGAA GCGTGAGCCG CGGGAGCCGT TCCCGCCGCA CCCGCTCGAC GAGGCCGAGC TCGAGCAGTC CTCGCTGCTC GGCACGCCGC TGAAGGACAC GGTGGACACG GCGACGCTCG AGTTCATCCT CGGCCAGCGT CCGCTCTCCG ACTGGGACGC GTACGTGGCG CAGCTCGAGG GCCAGGGCCT GCAGAGCTAC ATGGACCTCA TCAACGGCGC GTACAAGCGT GCCGCCGAGG GCCAGGACTG A
|
Protein sequence | MARAGNLAGR RMRKARAAGA LTLVGALALA ACSGGGGDTN ADGEAAELEC SSEAVADQPW KAAEPREFSL LWTDWADYPI TDTWEFFDEI EKRTNVKLKL TNIPFSDATE KRSLLISAGD APQIIPLVYT GEERQFAASG AVVPLSDYID YMPNFKKYTE EWDLVDMVDD LRQEDGKYYM TPGLQEVSVP VFTLIIRKDV FDEVGAPEPD TWEDLQEGLA LIKEKYPDSY PLADGFEAWS MINYAAHAFG TVGGWGFGDG AWWDEEKGEF VYAATTDGYK DMVTYFRGLH DAGLLDAESF TASNDGGGTV VEKVAAEKVF AFSGGSWTVQ EFGTALEAAG VTDYELVQIA PPAGPAGNNV EPRNFWNGFM LTADAAKDEN FCDLLHFTDW LYYNPEAREL IQWGVEGKHF TKEGGKYTLN PEFSLKNLNM NPDAPVDLKK DLGYANDVFA GSTESRELKE SYNVPAFVQY IDDVQTKREP REPFPPHPLD EAELEQSSLL GTPLKDTVDT ATLEFILGQR PLSDWDAYVA QLEGQGLQSY MDLINGAYKR AAEGQD
|
| |