Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_0846 |
Symbol | |
ID | 5197520 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | + |
Start bp | 943030 |
End bp | 943911 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640580391 |
Product | carboxymethylenebutenolidase |
Protein accession | YP_001261351 |
Protein GI | 148553769 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0412] Dienelactone hydrolase and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.269647 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0925086 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGACG AGGATCTGAA GGCCCGCGCG ATCGCGCTCT ACGACCGCTT CACCCATGGC GCGCAGGATC GCCGCGCCTT CATGGCCGAC ATGACCCGCC TGGCCGGCGG TGCCGCCGCC GCGCAGGTGC TGGTCGCCAC GATCGCCGCC GATCCCGCCG CCGCCGCGAT CGTCGCCGAG GACGACAAAC GGCTGACCGC GCGGATGGTC CATTGGCCGG GCGCCAACGG CCATCAGCTG TTCGGCTATA TGGCGATCCC GAAGAAGCAT GCGAAGAAGC CGCCCGCCGT CCTCGTCGTC CATGAGAATC GCGGGTTGCA GCCCTATACC AGGGACGTCG CCCGGCGGCT GGCGGTCGCC GGCTTCGTCG GCGTCGCGCT CGACTTCCTG TCGCCGCAGG GCGGCACCCC CGCCGACGAG GACAAGGCGC GCGCGATGAT CGCCGCGCTC GACATTCCCG CCGCCACCGC CGACGGGGTG GCGACGATCG ACTGGCTCGC CGCGCACAAG CTGCTGAGCG GCAAGGTCGG GGTGGTCGGC TTCTGCTGGG GCGGCGCGAT GGCCGATCGA CTGGCGGTCG CCGCAGGCCC CGCGCTGAAG GCCTCCGTCG CCTTCTACGG CCCGCCCCCG CCGCCCGCCG ACGCCGCGAA GGTGAAGGCC GCCATGCTGC TCCACTATGC GGGCAGCGAC GACCGGGTGA ACGCGGGCGC CGCGCCCTGG GTGCAGGCGC TCCAGGCCGC GCATGTCGAC GTCCGCCGGT TCGACTATCC CGGCACCCAG CATGCCTTCC ACAACGACAC GTCGGCGGCG CGCTACGATG AGGCGGCCGC GACGCTCGCC TGGGACCGGA CCATCACCTT CCTGCGGGAG AAGCTGGCAT GA
|
Protein sequence | MDDEDLKARA IALYDRFTHG AQDRRAFMAD MTRLAGGAAA AQVLVATIAA DPAAAAIVAE DDKRLTARMV HWPGANGHQL FGYMAIPKKH AKKPPAVLVV HENRGLQPYT RDVARRLAVA GFVGVALDFL SPQGGTPADE DKARAMIAAL DIPAATADGV ATIDWLAAHK LLSGKVGVVG FCWGGAMADR LAVAAGPALK ASVAFYGPPP PPADAAKVKA AMLLHYAGSD DRVNAGAAPW VQALQAAHVD VRRFDYPGTQ HAFHNDTSAA RYDEAAATLA WDRTITFLRE KLA
|
| |