Gene Smed_3693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3693 
Symbol 
ID5318302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp134401 
End bp135582 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content61% 
IMG OID640775506 
Productectoine utilization protein EutD 
Protein accessionYP_001312439 
Protein GI150375843 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID[TIGR02993] ectoine utilization protein EutD 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAAC CCAACCTCAA ATTCTCCCTT GAGGAATATG AAGCGCGGCT CGCGAAGACC 
CGAAAGGCCA TGGAGGCGAA AGGCGTCGAC CTCCTGATCG TGAGCGACCC CTCAAACATG
GCCTGGCTCA CAGGCTATGA CGGCTGGTCC TTCTACGTAC ATCAGGCGGT CATCGTACCG
CCTTCGGGCG AACCGATCTG GTTCGGCCGC GGCCAGGATG CCAATGGGGC GAAGTTTACC
GCCTATCTCA AACACGAGAA CATCATCGGC TACCCCGATC ACTACGTGCA GTCGACCGAA
CGTCACCCCA TGGACTATCT CTCGGGTATC CTCGCCGACC GCGGGTTCGG TTCGCTCCGC
ATCGGTGTAG AAATGGACAA TTACTGGTTC TCGGCCGCTG CGTTTGCGTC GCTGCAGAAG
CGCCTCCCCA ATGCGCGTTT CGTCGACACG ACGGCGCTGG TGAACTGGCA GCGTGCCGTC
AAAAGCGAGA CGGAAATCAA ATATATGCGC AACGCCGCCC GCATTGTCGA AGCCATGCAT
CAGCGCATCT TCGACAAGAT CGAAGTGGGC ATGCGCAAAT GCGATCTGGT GGCGGAAATC
TACGATGCGG GCACGCGGGG CGTTGATGGC ATCGGCGGCG ACTATCCGGC AATCGTGCCG
CTTCTCCCTT CCGGCGTCGA GGCTTCAGCG CCGCATCTCA CCTGGGACGA CCGGCCTATG
AAGCGAGGCG AGGGCACCTT TTTCGAGATC GCCGGCTGCT ACAATCGCTA TCACCTGCCG
CTGTCGCGCA CGGTGTTCCT GGGCAAGCCG ACCCAGGCCT TCCTCGACGC CGAGAAGGCG
ACTCTCGAAG GCATGGAGGC GGGGCTCGCA GTCGCCAAGC CCGGCAATAC CTGTGAGGAC
ATCGCGAACG CGTTCTTCTC CGTGTTGAAG AAATACGGGA TCGTCAAAGA CAACCGTACC
GGCTATCCCA TCGGTCTTTC CTATCCGCCG GACTGGGGCG AGCGCACGAT GAGCCTGCGC
TCGGGCGACA GGACGGAACT GAAGCCGGGA ATGACTTTCC ACTTCATGAC GGGCCTCTGG
CTCGAGGACA TGGGTTTCGA GACGACCGAA AGCATTCTCA TCACCGAGAG CGGCGTCGAG
TGCCTTGCCT CCGTGCCGCG CAAGCTGATG GTCAAGGACT GA
 
Protein sequence
MTQPNLKFSL EEYEARLAKT RKAMEAKGVD LLIVSDPSNM AWLTGYDGWS FYVHQAVIVP 
PSGEPIWFGR GQDANGAKFT AYLKHENIIG YPDHYVQSTE RHPMDYLSGI LADRGFGSLR
IGVEMDNYWF SAAAFASLQK RLPNARFVDT TALVNWQRAV KSETEIKYMR NAARIVEAMH
QRIFDKIEVG MRKCDLVAEI YDAGTRGVDG IGGDYPAIVP LLPSGVEASA PHLTWDDRPM
KRGEGTFFEI AGCYNRYHLP LSRTVFLGKP TQAFLDAEKA TLEGMEAGLA VAKPGNTCED
IANAFFSVLK KYGIVKDNRT GYPIGLSYPP DWGERTMSLR SGDRTELKPG MTFHFMTGLW
LEDMGFETTE SILITESGVE CLASVPRKLM VKD