Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5287 |
Symbol | |
ID | 5319589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | - |
Start bp | 249324 |
End bp | 251270 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640777064 |
Product | serralysin |
Protein accession | YP_001313996 |
Protein GI | 150377401 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2931] RTX toxins and related Ca2+-binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTCGA CGACGACCTA TGCCCCGACT GGTAATGCCT ATATCGACGG GCTCCTTGGC GACTGGAAAT GGGCGGTCAA GGACTTCACC TTCAGCTTCC CAACCAGCGC TTCATTCTAC GGCTCCGGTT ATGGCAATGG CGAACCGCAA AAAGGTTTTG CCGCGCTGAA CGCCGCACAG CAGACTACCG CGCGAGCTGT CTTCGATCAG TTCTCCTCCG TTGCCAAAGT ATCATTCACC GAGATTGCAG AGAGCGCGAC CAAGCACGCC GATGTCCGCC TTGCCTCGTC CGACGCTCCG AGCACCGCAT GGGCGTATTT TCCGTCGACT GCTGCAGAAG GCGGGGACGC GTGGTTCAAC AGCTCGTCCG GCTATTACAG CCGCCCGATG AAAGGAAACT ACGCCTATCT GACATTCCTC CACGAGATCG GCCACGCGCT TGGGCTGGAA CACGCCCATG AGGGCAACGT CATGCCTGCG AACCGCGACT CGATGGAATA CACGGTCATG AGTTATCGCT CCTATGTCGG AGCCTCGACC ACGACCGGCT ATATCAACGA GACCTGGGGA TATGCGCAAT CGCTGATGAT GTATGATATC GCTGCCTTGC AGCATATTTA CGGCGCGGAT TTCACCACCC AGAGCGGGAA CACGACTTAC CGATGGAGCC CGACGTCGGG CGAGATGTTC ATCAATGGCG TGGGCCAGAA CGCACCCGGC GGCAACAAGA TCCTGCTTAC GGTCTGGGAT GGCGGCGGAA CCGACACATA CGACTTCTCC AACTACACAA CTGCACTGAA GGTCGATCTC CGTCCGGGCG AATGGACGAC CACCTCGGCG GTCCAGTTGG CGAAGCTGCG CTACGACGAT TCGAAAGTGG CGATCGGCAA TATCGCCAAC GCCCTGCAAT ATCAGAGTGA TGCGCGCTCT CTTATCGAGA ATGCAAAGGG TGGCGCGGGC AATGACGTTA TCACCGGAAA CGCTGCTGCA AATGTTCTTT GGGGTAACGG CGGCAATGAC AGGCTGATCG GCGCCAACGG AAACGACAAC CTTGTTGGCG GTGCGGGCGC GGACCGGCTC GAGGGTGGCA ACGGCAACGA TCTGGCGAAC TACTACAACG CCGCGGCTGG CGTCGTTGCC GACATTTATT CTCCCGGTTC CAACAGAGGG GAAGCGGCGG GAGACACCTA TGTGTCCGTC GAGCGGCTTT ACGGTTCCGC CTTCGGCGAT ACTCTTGCCG GAGATAGGTT CGCAAACCTT CTGAACGGGC TAGCGGGTAA TGACGTGCTT CACGGGCGTG CCGGCAACGA CACTCTCATC GGCGGCGACG GAAACGACAG TCTTGTCGGC GGTGCCGGCG CCGACCGGCT CGACGGCGGC AATGGCGTCG ATCTGGCGAA TTACTACAAT GCCGCCTTGG GGCTAGTCGC TGATCTCTAT TCCCCTGTCT CGAACACCGG GGAAGCGGCC GGTGACACCT ATCTGTCCGT CGAGCGGCTC TATGGTTCAG CTTTCAACGA CAGTTTGCGC GGAGACAATA TCGCAAATCT TCTGAACGGG CTTGCCGGTA ACGATGTGCT CAACGGACGC GGTGGCAACG ACACCCTCAT CGGCGGGGAA GGCGCCGACC GGCTGATTGG CGGTGGCGGC TCGGACATGT TCGTATTTCA GACAGCGACG CAATCGCGAC CGGCTGCGAT GGACGTCATC GATGATTTTG CGCGAGGCAT TGATCGGATC GATTTGCGAT CGATCGATGC CAGCAGCAAC CTCAGCGGGG ATCAGGCGTT TCTTTTTATC GGCGGCAACG GGCTCCACGG AAAATCAGGT GAACTTAACT TCAGGAGTGG GATAGTCTCG GGTGATGTCA ATGGCGATGG CCTTGCAGAT TTTCAGATCA AGGTCATGAA TCTGTCGGCG CTTTCCGCGA GCGACTTCTT CGTCTGA
|
Protein sequence | MPSTTTYAPT GNAYIDGLLG DWKWAVKDFT FSFPTSASFY GSGYGNGEPQ KGFAALNAAQ QTTARAVFDQ FSSVAKVSFT EIAESATKHA DVRLASSDAP STAWAYFPST AAEGGDAWFN SSSGYYSRPM KGNYAYLTFL HEIGHALGLE HAHEGNVMPA NRDSMEYTVM SYRSYVGAST TTGYINETWG YAQSLMMYDI AALQHIYGAD FTTQSGNTTY RWSPTSGEMF INGVGQNAPG GNKILLTVWD GGGTDTYDFS NYTTALKVDL RPGEWTTTSA VQLAKLRYDD SKVAIGNIAN ALQYQSDARS LIENAKGGAG NDVITGNAAA NVLWGNGGND RLIGANGNDN LVGGAGADRL EGGNGNDLAN YYNAAAGVVA DIYSPGSNRG EAAGDTYVSV ERLYGSAFGD TLAGDRFANL LNGLAGNDVL HGRAGNDTLI GGDGNDSLVG GAGADRLDGG NGVDLANYYN AALGLVADLY SPVSNTGEAA GDTYLSVERL YGSAFNDSLR GDNIANLLNG LAGNDVLNGR GGNDTLIGGE GADRLIGGGG SDMFVFQTAT QSRPAAMDVI DDFARGIDRI DLRSIDASSN LSGDQAFLFI GGNGLHGKSG ELNFRSGIVS GDVNGDGLAD FQIKVMNLSA LSASDFFV
|
| |