Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1976 |
Symbol | |
ID | 4895564 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 2090876 |
End bp | 2091946 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640112570 |
Product | cellulase |
Protein accession | YP_001043852 |
Protein GI | 126462738 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3405] Endoglucanase Y |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.65444 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.379222 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGAGAC GGACCATCCT GACATCGGCC GCCGCCGCGC TGATGCTGGC CCCTGCAGGA CGCCTCTTCG CGCAGTCGGG CGGAGAGGCC TTGTCTGCGG ACCACCCGCT CCAGGCGGCC TGGCGCAGCT GGAAGGATGC GTTTCTGCTG CCCGCCGGCC GCATCGTCGA CGGGCCGCAG CAGAATGCGA GCCATTCCGA AGGGCAGGGC TACGGCGCCA CGCTCGCCGC GATCTTCGGC GACGAGGAGG CCCTGCGGCG CATCGTCGAC TGGACCGAGG CGAACCTTGC GCGGCGCGAG GACAAGCTTC TGAGCTGGCG CTGGCTGCCC GGTGTGGCGC TGGCCGTGCC CGACGAGAAC AACGCCACCG ACGGCGATCT CTTCTACGCC TGGGGTCTCG CCATGGCCGC GCAGCGGTTC GGCAAAGCCG ATTACGCCGG GCGGGCGACC GAACTGGCGC GCGCCATCGC GCTGCATTGC GTGCGTCCGC ATCCGGACGG CTCCGAGCAG CTCGTGCTGC TGCCGGGGGC CAGCGGCTTC GAGACGCCGG ACGGGGTGGT GCTCAACCCC TCCTACTACA TGCCCCGCGC CCTGACCGAG CTCGCCGCCT TCAGCGGCCA GGACCGGCTG GCGCGCTGTG CCCGTGACGG GGCCGACTGG ATCGCGTCGC TCGGGCTTCC GCCGGACTGG GCGCTGGTGA CGCCCTTCGG CACGCAGCCG GCGCCGGGTC TGTCCCATAA CAGCGGCTAC GATGCGCTGC GGGTGCCCCT GTTCCTGCTC TGGTCCGGGC TGACCGCCAA TCCCGCGCTG CGCCGCGCGG TGGAGGCGGC CGGGGACGCC GCAGCCGGCG ACACGCCGGT GAGGTTCGAC CGCGACACGG GGGCGGTGCT GGAACGGTCC GCCGATCCGG GCTTCCGCGC CGTGCTCGCG CTTGGCGATT GCGCCCTTTC GGGTCGTCCG GGGGCGGCGA TCCCGCCCTT CGACGCGCGC CAACCCTACT ATCCCGCGAC GCTGCATCTG ATGGCGCTCG TGGCACAAGT GGAAGGTTTC TCCGCATGCG TTCCGATCTG A
|
Protein sequence | MRRRTILTSA AAALMLAPAG RLFAQSGGEA LSADHPLQAA WRSWKDAFLL PAGRIVDGPQ QNASHSEGQG YGATLAAIFG DEEALRRIVD WTEANLARRE DKLLSWRWLP GVALAVPDEN NATDGDLFYA WGLAMAAQRF GKADYAGRAT ELARAIALHC VRPHPDGSEQ LVLLPGASGF ETPDGVVLNP SYYMPRALTE LAAFSGQDRL ARCARDGADW IASLGLPPDW ALVTPFGTQP APGLSHNSGY DALRVPLFLL WSGLTANPAL RRAVEAAGDA AAGDTPVRFD RDTGAVLERS ADPGFRAVLA LGDCALSGRP GAAIPPFDAR QPYYPATLHL MALVAQVEGF SACVPI
|
| |