Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_4624 |
Symbol | |
ID | 5603959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009832 |
Strand | + |
Start bp | 5100396 |
End bp | 5101430 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640940190 |
Product | cupin 2 domain-containing protein |
Protein accession | YP_001480845 |
Protein GI | 157372856 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3435] Gentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR02272] gentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATCG ATATTCAGCA ACTGCAAAAG AAACATCTGT TCCCGCTATG GGAGTCACTG CATGGCCTGG TCCCCAATCA GCCAGAACCT CAGGCTACGC CCTATCAATG GCATTACGCT GAGGTCAAAG ACGCGCTGCT GGATATCGGC TCAAAGATCG ACATCGAACG TGCCGAACGC CGCGTGCTGG TCATGGAAAA TCCGTCGCTG CCACCGGGCT CGTCACGCAT CACCGACACG CTGTATGCCG GAATGCAAAT GGTGCTGCCG GGTGAAATTG CCCCATTGCA TCGCCACACC CCCACCGCGC TGCGCTTTAT TCTGGAAGCC GAAGGCGGCA ACACCACGGT GGACGGCGAA AAGACTACGC TGCACCCGGG GGATTTTATC ATTACCCCTT CATGGCGTTG GCACCAGCAC CAAAATGACA CCGATAAACC GATATTCTGG CTCGATGGCC TGGATGCCCC GCTGCTGCAC TTTCTCAAGG CCGGGTTTCG GCAAGATCGC CTGCCCGCAG GCCAAACGTT GGAGCCACGT CCTGAGGGCG ATGCTCTGGC CCGTTACGGT ACCAGCCTGA TACCGCTGGA GTACCGCCCA ACGGGGCAGA GCCAGCCCTC GCCGATATTT AACTATCCGT ATCAACGCAC CCGCGAAGCG CTGGAACAAT TAAAGCGCCA TAGCGAGATC GACCCGGCCA GCGCCATCAG CCTGCGTTAT ATCAATCCGG CCAACGGCGA TTGGGCCATT CCCACCCTCG GCACCGTCAT CACGCTGCTG CCAAGAGGCT TTACCAGCCT CTACCAACGT GGCAATGCCA GCCAGGTGCT GGTGGTGATG GAGGGAGAAC TGGAAGTCCG GTTAACCGGC GGGATTCACT TCCGGCTAAA ACCGAAAGAC ATTTTTGCGC TGCCTTCGTG GCTCAGCTAT CAGCTGTCGG CACCTGTCGG CGATACGGTC TGCTTCAGCT TTTCCGATCG CCCGGTGCTG GAAAAACTGG GGATCTGGCG GCACGAGATC ACGCCTGAGG GCTAA
|
Protein sequence | MSIDIQQLQK KHLFPLWESL HGLVPNQPEP QATPYQWHYA EVKDALLDIG SKIDIERAER RVLVMENPSL PPGSSRITDT LYAGMQMVLP GEIAPLHRHT PTALRFILEA EGGNTTVDGE KTTLHPGDFI ITPSWRWHQH QNDTDKPIFW LDGLDAPLLH FLKAGFRQDR LPAGQTLEPR PEGDALARYG TSLIPLEYRP TGQSQPSPIF NYPYQRTREA LEQLKRHSEI DPASAISLRY INPANGDWAI PTLGTVITLL PRGFTSLYQR GNASQVLVVM EGELEVRLTG GIHFRLKPKD IFALPSWLSY QLSAPVGDTV CFSFSDRPVL EKLGIWRHEI TPEG
|
| |