Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_4855 |
Symbol | |
ID | 5604194 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009832 |
Strand | + |
Start bp | 5375366 |
End bp | 5377165 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640940428 |
Product | strictosidine synthase |
Protein accession | YP_001481076 |
Protein GI | 157373087 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3386] Gluconolactonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000966412 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTAAGC GAATTTTAAA TCAGAAGAAG TTTCTGGGCA TTACGACCCT TAATTGGCTT CGAACGGATC TTGCAAGAGA TGCGTGTTTT GCTTATTGGG GAGGGCCCCA TGCCGATCTG GTCTCTCGCA ATGGTTTCAT CCGTGATTAT AATCAGCATC ACTTCAACGA TGATGTAGCG GGTTTATGGC CCTTAACACC TGAAGTGGTA ACGATTATTC CAACCGATAA ACGGATCGAT GGCGTTTCTG AAATAATTTT AGACTCTATC ATTAAACTCC CCCTGAGCAT TCGCTCTAAT CGACGTATTT TTAAAGACAA AGAGAATGCC TTCTGTCGCA CAACTTTAAA CATGACGACA TTAGGTGGAG GAAGGGTTGT TAAAACCGGC GACTATAAAA ATATTTATGC TCGATGCGTT GTTTTGATTC GTCGAAGGGA GAATGGCTCT CTAAGCGATT TCAAGCGCTT TATTCATAAC GGCCTTGCTC ATCAACTCGG CCTCAATGCC AATGTCATTG AAGTCCGCAG CCAGGTCTTT GTGCCATGGT TTAAGGCTTT GTGGAATGCA CCCAGTGCTG CTCATAACGT GGCACCCGAG TATCAGTACC ATGCATCAAT TGTCATAGGT GCGAATGATA ATGCCGGATT GCTGTCAGCT CTGGCTCAGG TAGCCAGCCT CAATAGTGAA ATTGCATATC ATTGTTCTGT AATACACGCT TATTCGGTCG CTAACACGTT GATCAATCGG GTTGACGGGC GGACAACAAT ACCGCAACTC AAACCTGGTG CGAAACCTCG ATTAGAACCT GTACGACGTG TTCTTTCTCA TCCCCCTGAG CGTACTTATC AACCCACTGG TACGACTCCA TTCAAAATGC GTGGACTACT AAAAAGTGCA GTTAAAAGGC CAGAAGACGT TATCTCTGAT ATGGCAGGCA ACATTTATTA CGGAGGAAAA GGCGGAAAAA TTTATCGGTA TGATCCTGAA ACTGAGAACG AAACCATTAT AGCGGATACG GCCGGTAGAC CGCTGGGTCT TGAATGGCTG AAATGTGGCT CTCTCTTGGT TTGTGATGCC CACCGCGGTT TGCTTAAGGT CAGATTAGAC GGCCATATAG AAACATTGGT AGAACGAGTT CATGGCCTCC CATTACGGTT CTGCTCGAAT GCCACCGCGA GCACTGATGG AACAATTTGG TTTACTCAAT CGACGAATCG CTATGATTTC GAGCATTACC AAGGGGCAAT GATTGAACAC AGGGGGTCAG GGCAGCTTCT GCGACGTGAT ACGAATGGAC AGGTTCATGT GCTATTGGAC GGTCTGCATT TCCCTAATGG CATCACTTTA GATAGTTCGG AGAGATCTGT CATTTTTGCA GAAACAGATG CCTACCGCTT ACGCAGGCTC TGGGTGAAAG GCCCGAAGGC TGGGTGTCTG GAGATTTTTG CAGATAATCT GCCGGGTTTC CCTGACAACA TTTCACGCAT GCAAAATGGT TATTTTTGGG TGGCCATGGT TACGCCACGC AATAAACGCC TGGACCGCAT GGGAACCATG CCGGGTTTCC TTCGTAAGCT CATTTGGCGC CTTCCAAAAT TTATGCTGCC TAAAACGGCT CGAACAGTGT GGGCTATGGC CTTTAATGAT GCTGGCGAAG TGTTAGCCGA TATGCAAGGC AGCGCGGATA ACTTCTTCGC AGCCACTGGA GTGGTGGAAA CAAATGGTCG GCTGTATATG GCCTCTGTTG AGGCTGACGG TATTGCTGTC CTTGATATTA CGTCGATGCC TAAACGATAG
|
Protein sequence | MSKRILNQKK FLGITTLNWL RTDLARDACF AYWGGPHADL VSRNGFIRDY NQHHFNDDVA GLWPLTPEVV TIIPTDKRID GVSEIILDSI IKLPLSIRSN RRIFKDKENA FCRTTLNMTT LGGGRVVKTG DYKNIYARCV VLIRRRENGS LSDFKRFIHN GLAHQLGLNA NVIEVRSQVF VPWFKALWNA PSAAHNVAPE YQYHASIVIG ANDNAGLLSA LAQVASLNSE IAYHCSVIHA YSVANTLINR VDGRTTIPQL KPGAKPRLEP VRRVLSHPPE RTYQPTGTTP FKMRGLLKSA VKRPEDVISD MAGNIYYGGK GGKIYRYDPE TENETIIADT AGRPLGLEWL KCGSLLVCDA HRGLLKVRLD GHIETLVERV HGLPLRFCSN ATASTDGTIW FTQSTNRYDF EHYQGAMIEH RGSGQLLRRD TNGQVHVLLD GLHFPNGITL DSSERSVIFA ETDAYRLRRL WVKGPKAGCL EIFADNLPGF PDNISRMQNG YFWVAMVTPR NKRLDRMGTM PGFLRKLIWR LPKFMLPKTA RTVWAMAFND AGEVLADMQG SADNFFAATG VVETNGRLYM ASVEADGIAV LDITSMPKR
|
| |