Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5719 |
Symbol | |
ID | 5320021 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 689424 |
End bp | 690701 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640777441 |
Product | epocide hydrolase domain-containing protein |
Protein accession | YP_001314373 |
Protein GI | 150377778 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTCTA ATTCCTTGAG CCGACGCGGT TTTATGGCGG CTGCCGCGAT TGTCAGCGCG GCATCCATGT TGAAGCCCCC CATGTCGTTT GCTGCCAGTA GTGCGGCGGA CGAGATCCGA CCCTTTAAAG TCAATATTCC CGACAGCGCG ATACAGGAAC TCAGACAGCG GCTTGCCGTG GCTCGCTGGC CCGCAAGGGA GAACGTTGCA GATGATTCCC AAGGAGTCCA GCTCGATCGT CTGCGTAGTC TCGTCGACTA TTGGGCAGAT GGCTACGACT GGCGAAAAGT CGAAGCACAA CTCAACGCGC TGCCAATGTT TATTACAGAA ATAGAGGGCC TCGACATTCA CTTTATTCAC GTCCGCTCAA AACACGAAGA CGCCCTGCCG CTGATTATGA CGCATGGTTG GCCCGGCTCA GTGCTGGAGA TGCTCAAAGT CATTGATCCG TTGACAAATC CCACCGCGCA TGGTGGCACA GAGGAGGACG CATTCCATCT GGTAGTTCCG TCGATTCCTG GCTTTGGCTT CTCGGGAAAG CCGAACGTCC GCGGCTGGGG TTCGGACCAT ATCGGTCGCG CCTGGGGCAC GCTCATGAAC CGATTGGGAT ATGAAAGGTT CGTCTCTCAG GGCGGTGATT GCGGATCGGT CATCTCTCAG CGCATGGCGC ACCAGAACGT TCCTGGTCTT ATTGGTATTC ACCTCAACAT GCCCGCAATC GTTCCCAAAG AGATCGTGCC AATTCTGGCC GCCGGAGCTC CCGCACCCGC TGATCTCACC GATGAGGAGC GTGCATGTTT CGACAAGCTC GCGATTTTCT ATCGTGGCAG CGCCGCGTAC GCACAAATGA TGGTCACAAG ACCTCAGACA ATTGGCTATT CGCTCGTAGA TTCGCCGATC GGAATGGCCG CTTGGATGTA TGACAAGTTC GCTGAGTGGA CGTATAGCAA CAAGCAGCCC GAAAAGGTCC TTACACGCGA CGAAATGCTT GATGACATTT CGCTGTATTG GTTTACTGAG ACGGGTGCCT CGTCTGCCAA GATTTACTGG GAAGACCACA CCAACAACTT CAATGCGCTT GAAAAGATAC CTTCTGTTCC CGTCGCAGTT AGTGTCTTTC CGGGAGAAAT CTATCAAGCT CCAAAATCTT GGACTGAGCG GGCTTTTGGC AACCTCATCT ATTTCAATAA GGTTCCGAAC GGCGGCCACT TTGCCGCATG GGAGCAGCCT ACAATCTTCA CGGAGGAAGT TCGGGCGGCG TTCCGGACCA TGCGCTAA
|
Protein sequence | MSSNSLSRRG FMAAAAIVSA ASMLKPPMSF AASSAADEIR PFKVNIPDSA IQELRQRLAV ARWPARENVA DDSQGVQLDR LRSLVDYWAD GYDWRKVEAQ LNALPMFITE IEGLDIHFIH VRSKHEDALP LIMTHGWPGS VLEMLKVIDP LTNPTAHGGT EEDAFHLVVP SIPGFGFSGK PNVRGWGSDH IGRAWGTLMN RLGYERFVSQ GGDCGSVISQ RMAHQNVPGL IGIHLNMPAI VPKEIVPILA AGAPAPADLT DEERACFDKL AIFYRGSAAY AQMMVTRPQT IGYSLVDSPI GMAAWMYDKF AEWTYSNKQP EKVLTRDEML DDISLYWFTE TGASSAKIYW EDHTNNFNAL EKIPSVPVAV SVFPGEIYQA PKSWTERAFG NLIYFNKVPN GGHFAAWEQP TIFTEEVRAA FRTMR
|
| |