Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cfla_0383 |
Symbol | |
ID | 9144249 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cellulomonas flavigena DSM 20109 |
Kingdom | Bacteria |
Replicon accession | NC_014151 |
Strand | - |
Start bp | 412249 |
End bp | 413454 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | Ricin B lectin |
Protein accession | YP_003635499 |
Protein GI | 296128249 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATCGAT TGGCCCGAGG GCTGCTGCTC GCCGTCACGC TCGGCGTCGC ACTGGCCGGA GGTGTCGTGG CGCCGTCCGC GCCCGCCGCC GCCGAGAGCA ACGGGGGCAC CCGCGTGATG CCCCTCGGCG ACTCGATCAC GGAGGGCATC GGCATGTCCG GGGGCGGCGG CTACCGCGTC GGGCTGTGGC AGCGCCTCGC GCAGGGCGGC TACACGACGG ACTTCGTGGG GTCGGGCTTC AACGGGCCGT CGAGCCTGTG GGACCACGAC CACGAGGGGC ACTCGGGCTG GCGGATCGAC CAGATCGACG CGAACATCGT CAACTGGGTC CGCACCTACC AGCCGCGCAC CGTGCTGCTG CACATCGGCA CCAACGACAT CAGCCAGAAC CGCGACCTGG GCAACGCCCC GAACCGGCTC GCGGGGGTCA TCGACAAGAT CACCAGCACG TCCCCGCAGA CCGACGTCTT CGTCGCGACC CTCATCCCGG TCTCGTACGC GCAGTCGCAG GTGCAGGCCT ACAACTCCGC CATCCCCGGC ATCGTGTCGA GCAGGGCCAA CGCCGGCAAG AAGGTGCACC TGGTCAACAT GAACGGGGCC GTCCCCCTCT CCGACATGCC CGACGGCGTG CACCCCAACG CGGCGGGCTA CGACAAGATG GCGGCCGTCT GGTACAACGC GCTGCGCAGC GTGCCGGGCA GCATCGGCAA CCCGTCGGGC GGCACCACCC CCACGCCGAC GCCCACCCCG ACCCCCGACC CGGGCACGGG CCAGGTGGAC ACCTCGGCCT GGTACACGCT CGTCAACCGC AACAGCGGCA AGGCCATCGA CGTGTACAAC CTGTCCACCG CCGACGGCGC CCGCATCACC CAGTGGTCCC GCAACGGCGG CACCCAGCAG CAGTGGCAGT TCGTCAGCGT CGGCAACGGC TACTACCAGG TGAAGTCGCG GCTCTCGGGC AAGCTCCTCG ACGTGTCCGG ACGGTCCACC ACCGACGGCG CGGCGATCCA CCAGTGGACC AACCACGGCG GCACCAACCA GCAGTTCAGC CTCCAGACGA TCGACGGGTA CGTCCAGCTC ATCGCGCGGA ACAGCGGCAA GGCCGTCGAG GTGCAGGGCG CCTCCACCGC CGACGGCGGC AACATCGTCC AGTACACCGA CTGGAACGGC ACCAACCAGC AGTGGCAGCT GGTGAAGGTG GGCTGA
|
Protein sequence | MHRLARGLLL AVTLGVALAG GVVAPSAPAA AESNGGTRVM PLGDSITEGI GMSGGGGYRV GLWQRLAQGG YTTDFVGSGF NGPSSLWDHD HEGHSGWRID QIDANIVNWV RTYQPRTVLL HIGTNDISQN RDLGNAPNRL AGVIDKITST SPQTDVFVAT LIPVSYAQSQ VQAYNSAIPG IVSSRANAGK KVHLVNMNGA VPLSDMPDGV HPNAAGYDKM AAVWYNALRS VPGSIGNPSG GTTPTPTPTP TPDPGTGQVD TSAWYTLVNR NSGKAIDVYN LSTADGARIT QWSRNGGTQQ QWQFVSVGNG YYQVKSRLSG KLLDVSGRST TDGAAIHQWT NHGGTNQQFS LQTIDGYVQL IARNSGKAVE VQGASTADGG NIVQYTDWNG TNQQWQLVKV G
|
| |