Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1147 |
Symbol | |
ID | 5704408 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1297105 |
End bp | 1297875 |
Gene Length | 771 bp |
Protein Length | 256 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641270662 |
Product | 5-oxopent-3-ene-1,2,5-tricarboxylate decarboxylase |
Protein accession | YP_001536046 |
Protein GI | 159036793 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.608556 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.00000076771 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCGTTCG GGGTCGTTGA GGGGGAGCCG GAGGCGGGCC CCCAGGGTCT GACCGTCGCT GAGGTCGAGG GGCATCCGTT CGGACGGATC ACCTTCTCCG GCGCACGGTG GGCCCTCTCC GATGTCCGCC TGCTCTCGCC GATCCTGCCG AGCAAGGTCG TCTGCGTCGG TCGCAACTAC GCCGAGCACG CCGCCGAGCA CGGCACCGAG GTACCCAAGG AGCCGTTGCT TTTCCTCAAG CCGTCCACCT CGGTAATCGG GCCGCGGGAC GCGATTCGGC TACCGGCGCT GTCGAAGCAG GTCGAGCACG AGGCCGAGCT GGCCGTGGTG ATCGGGGCTC CCGGTGCCCG GCGGGCTGAC CGGGCGGCGG CCCAGCGTGC GATCTTCGGC TACACCTGCG CCAACGATGT CACGGCACGG GACCTGCAAC GAGTGGACGG GCAGTGGACC CGGGCCAAGG GCTTCGACTC GTTCTGCCCG ATCGGCCCCT GGATCACCAC CGGTCTCGAC GTCACCGACC TGGAGATCCG GTGTGAGGTG GGTCGTGATC CGGAGGAGAT GGAGGTCCGC CAGCTCGGCC GAACCCGGGA CATGGTGTTC GACGTGCCAG CCCTGGTGTC GTACGTCTCA CATGTGATGA CGTTGCTTCC CGGCGATGTC ATCCTGACCG GCACGCCAGC CGGGGTTAGT CCGCTCGTGG ATGGGGATAC GGTCACGGTG CGGATCGAGG GGGTCGGTGA GCTGACCAAT CCGGTGGTGC CGGTCGGCTG A
|
Protein sequence | MSFGVVEGEP EAGPQGLTVA EVEGHPFGRI TFSGARWALS DVRLLSPILP SKVVCVGRNY AEHAAEHGTE VPKEPLLFLK PSTSVIGPRD AIRLPALSKQ VEHEAELAVV IGAPGARRAD RAAAQRAIFG YTCANDVTAR DLQRVDGQWT RAKGFDSFCP IGPWITTGLD VTDLEIRCEV GRDPEEMEVR QLGRTRDMVF DVPALVSYVS HVMTLLPGDV ILTGTPAGVS PLVDGDTVTV RIEGVGELTN PVVPVG
|
| |