Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_0964 |
Symbol | |
ID | 5085031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 989694 |
End bp | 990764 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640482521 |
Product | cellulase |
Protein accession | YP_001167170 |
Protein GI | 146277011 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3405] Endoglucanase Y |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.585824 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACGTC GCACCCTTCT CCGGCTTGCC GCCGCCACGG CCCTTGCCTC TCCGGCGGGG CGGCTCCTGG CTCAGGAGGG CGGTGTGCTG CCGTCCGATC ATCCGCTTCA GACCGCCTGG GAGAGCTGGA AGGCGGCCTT CCTGCTGCCG GCCGGCCGGA TCGTCGATGG TCCGCAACAG AATGCCAGCC ACTCCGAGGG GCAGGGCTAT GGTGCGGCGC TGGCGGCCAT CTTCGGCGAC GAGGCGGCGC TGCGCCGCAT CGTGGACTGG ACCGAGACGA ATCTCGCCCG CCGTGACGAC AATCTGCTGA GCTGGCGCTG GCTGCCGGGC GTGCCGCTCG CGGTGCCCGA CGAGAACAAC GCCACCGACG GCGACCTGTT CTACGGCTGG GGTCTTGCGC TTGCCGCCCA GCGGTTCGGC AACGCCGACC TTGCCAAACG CGCAACCGAG ATCGCCCGCG CCATTGCGCT GCACTGTGTC CGTCCGCACC CGGATGGCTC CGAGCGGCTG GTGCTGCTGC CGGGCGCCAC AGGGTTCGAG ACCGAGGAGG GGTTGGTGCT CAACCCGTCC TACTACATGC CGCGCGCCAT GACCGAGCTT GCCGCCTTCA GCGGACAGGA GCGGCTGGCC CGTTGCGCGC AGGATGGCGC CCTCTGGATT GGCGGGCTCG GTCTCGCGCC GGACTGGGTG CTGGTGACGT CCACGGGGGA TCTGCCGGCC AAGGGCCTGT CGGCGCACAG CGGCTATGAT GCGATGCGCG TGCCGCTCTT CCTGCTCTGG TCCGGCCTTA CGGCGAACCC CGCCCTTCGC CGCTTCATCG AGGTGCAGCG CGAGGCGGAA CCCGGAACCG GGACCCCCGT CGTCTTCGAC CGCGACACCG GCGCCCTGCT TGAGAGGTCG GCGGATCCGG GTTTTGCCTC GGTGCCCGCT CTGGCGGACT GTGCGCTGTC CGGGCGGCCC GGGGCGGCCA TCCCGCCGTT TGACGCGCGG CAGCCCTACT ATCCCGCGAC GCTGCATCTG ATGACGCTCG TCGCACAAGT GGAAGGTTTT TCCGCATGCG CTCCGATCTG A
|
Protein sequence | MRRRTLLRLA AATALASPAG RLLAQEGGVL PSDHPLQTAW ESWKAAFLLP AGRIVDGPQQ NASHSEGQGY GAALAAIFGD EAALRRIVDW TETNLARRDD NLLSWRWLPG VPLAVPDENN ATDGDLFYGW GLALAAQRFG NADLAKRATE IARAIALHCV RPHPDGSERL VLLPGATGFE TEEGLVLNPS YYMPRAMTEL AAFSGQERLA RCAQDGALWI GGLGLAPDWV LVTSTGDLPA KGLSAHSGYD AMRVPLFLLW SGLTANPALR RFIEVQREAE PGTGTPVVFD RDTGALLERS ADPGFASVPA LADCALSGRP GAAIPPFDAR QPYYPATLHL MTLVAQVEGF SACAPI
|
| |