Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3692 |
Symbol | |
ID | 5318810 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 133392 |
End bp | 134396 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640775505 |
Product | ectoine utilization protein EutE |
Protein accession | YP_001312438 |
Protein GI | 150375842 |
COG category | [R] General function prediction only |
COG ID | [COG3608] Predicted deacylase |
TIGRFAM ID | [TIGR02994] ectoine utilization protein EutE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGGAGC ATCACTTGCG GCCATCGCCG ATCAGCGCCA CCGTCGATTT CGAGGCGGAT GGCATCCAGC ACGGCTTCCT GCGCCTTCCC TATAGCCGCG ACGATTCCGC CTGGGGTTCG GTGATGATTC CGATCAGCGT GGTCAAGAAC GGGAAGGGGC CGACCGCCCT CCTGACGGGC GGCAATCACG GCGACGAATA TGAAGGGCCG ATCGCGCTTT TCGATCTCGC CCGCAAGCTC GATGCCGCCG ATGTCGAGGG AAGGGTGATC ATCGTTCCGG CAATGAACTA CCCGGCCTTC CAGGCGCAGA CGCGGACGTC GCCGATCGAC AGGGGCAATA TGAACCGCAG CTTTCCGGGC AAGCCCGACG GGACGGTCAC CCAAAAAATC GCGGACTACT TCCAGCGTGT CCTGCTGCCG CTCGCCGACA TCGTGCTCGA CTTTCATTCA GGCGGGAAGA CGCTCGACTT CCTGCCCTTC TGCGCGGCTC ATGTCCTGCC TGACAAGGCG CAGGAGGAAA GGGCCTTCGA ATTCGTTCGG GCATTCGGGG CGCCCTATTC GATGAAGATG CTGGAGATCG ATACGGTCGG AATGTACGAC ACGGCCGCCG AGGAGATGGG CAAGATCTTC GTCACCACAG AGCTCGGAGG CGGCGGATCG GCAACCGCCC GGACAGCTTC GATCGCAAAA CGCGGTGTTG TGAATGTCCT GAAGCATGCC GGGATCCTCG AGGGCGAGGT CGAGACGGCA TCCACCCGGT GGCTCGACAT GCCGAGCGAC GATTGTTTCG CTTTCGCGGA GGATGAAGGG CTCGTCGAGT TTCTCGTCGA CCTCGGCGAA GCGGTCGCAC GCGACGCGGT GATTGCGCGG GTCTATCCGA TCGGACGCAC GGGTGTCGAA CCGGTGGAGG TCCGAGCCAG GATGAACGGC CTTCTCGTGG CCAGACATAA TCCCGGGCTG ATCAAGCCGG GCGACTGCTG CGCGGTGCTG GCAGTCGAAG TATAG
|
Protein sequence | MREHHLRPSP ISATVDFEAD GIQHGFLRLP YSRDDSAWGS VMIPISVVKN GKGPTALLTG GNHGDEYEGP IALFDLARKL DAADVEGRVI IVPAMNYPAF QAQTRTSPID RGNMNRSFPG KPDGTVTQKI ADYFQRVLLP LADIVLDFHS GGKTLDFLPF CAAHVLPDKA QEERAFEFVR AFGAPYSMKM LEIDTVGMYD TAAEEMGKIF VTTELGGGGS ATARTASIAK RGVVNVLKHA GILEGEVETA STRWLDMPSD DCFAFAEDEG LVEFLVDLGE AVARDAVIAR VYPIGRTGVE PVEVRARMNG LLVARHNPGL IKPGDCCAVL AVEV
|
| |