Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3693 |
Symbol | |
ID | 5318302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 134401 |
End bp | 135582 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640775506 |
Product | ectoine utilization protein EutD |
Protein accession | YP_001312439 |
Protein GI | 150375843 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | [TIGR02993] ectoine utilization protein EutD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCAAC CCAACCTCAA ATTCTCCCTT GAGGAATATG AAGCGCGGCT CGCGAAGACC CGAAAGGCCA TGGAGGCGAA AGGCGTCGAC CTCCTGATCG TGAGCGACCC CTCAAACATG GCCTGGCTCA CAGGCTATGA CGGCTGGTCC TTCTACGTAC ATCAGGCGGT CATCGTACCG CCTTCGGGCG AACCGATCTG GTTCGGCCGC GGCCAGGATG CCAATGGGGC GAAGTTTACC GCCTATCTCA AACACGAGAA CATCATCGGC TACCCCGATC ACTACGTGCA GTCGACCGAA CGTCACCCCA TGGACTATCT CTCGGGTATC CTCGCCGACC GCGGGTTCGG TTCGCTCCGC ATCGGTGTAG AAATGGACAA TTACTGGTTC TCGGCCGCTG CGTTTGCGTC GCTGCAGAAG CGCCTCCCCA ATGCGCGTTT CGTCGACACG ACGGCGCTGG TGAACTGGCA GCGTGCCGTC AAAAGCGAGA CGGAAATCAA ATATATGCGC AACGCCGCCC GCATTGTCGA AGCCATGCAT CAGCGCATCT TCGACAAGAT CGAAGTGGGC ATGCGCAAAT GCGATCTGGT GGCGGAAATC TACGATGCGG GCACGCGGGG CGTTGATGGC ATCGGCGGCG ACTATCCGGC AATCGTGCCG CTTCTCCCTT CCGGCGTCGA GGCTTCAGCG CCGCATCTCA CCTGGGACGA CCGGCCTATG AAGCGAGGCG AGGGCACCTT TTTCGAGATC GCCGGCTGCT ACAATCGCTA TCACCTGCCG CTGTCGCGCA CGGTGTTCCT GGGCAAGCCG ACCCAGGCCT TCCTCGACGC CGAGAAGGCG ACTCTCGAAG GCATGGAGGC GGGGCTCGCA GTCGCCAAGC CCGGCAATAC CTGTGAGGAC ATCGCGAACG CGTTCTTCTC CGTGTTGAAG AAATACGGGA TCGTCAAAGA CAACCGTACC GGCTATCCCA TCGGTCTTTC CTATCCGCCG GACTGGGGCG AGCGCACGAT GAGCCTGCGC TCGGGCGACA GGACGGAACT GAAGCCGGGA ATGACTTTCC ACTTCATGAC GGGCCTCTGG CTCGAGGACA TGGGTTTCGA GACGACCGAA AGCATTCTCA TCACCGAGAG CGGCGTCGAG TGCCTTGCCT CCGTGCCGCG CAAGCTGATG GTCAAGGACT GA
|
Protein sequence | MTQPNLKFSL EEYEARLAKT RKAMEAKGVD LLIVSDPSNM AWLTGYDGWS FYVHQAVIVP PSGEPIWFGR GQDANGAKFT AYLKHENIIG YPDHYVQSTE RHPMDYLSGI LADRGFGSLR IGVEMDNYWF SAAAFASLQK RLPNARFVDT TALVNWQRAV KSETEIKYMR NAARIVEAMH QRIFDKIEVG MRKCDLVAEI YDAGTRGVDG IGGDYPAIVP LLPSGVEASA PHLTWDDRPM KRGEGTFFEI AGCYNRYHLP LSRTVFLGKP TQAFLDAEKA TLEGMEAGLA VAKPGNTCED IANAFFSVLK KYGIVKDNRT GYPIGLSYPP DWGERTMSLR SGDRTELKPG MTFHFMTGLW LEDMGFETTE SILITESGVE CLASVPRKLM VKD
|
| |