Gene Smed_2098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2098 
Symbol 
ID5322958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2158833 
End bp2159864 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content60% 
IMG OID640791036 
Productextracellular solute-binding protein 
Protein accessionYP_001327766 
Protein GI150397299 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0603168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.126386 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTGA TCAAGCTTCT CACGGCCGGT GTTTTTGCCG GTCTCGCCGT TACCACAGGT 
CAGGCGTCTG CCTCCGTCCT CGACACCGTC AAGCAGCGCG GCACACTGAA CTGCGGCACC
GACAACACCG CTCCCGGTTT CGGCTACCTC AATACGACCA CAGGCCAGAT GGAAGGGCTG
GACGTAGACT TCTGCAGGGC GGTTGCAGCG GCGGTCCTCG GCGACGCGTC CAAGGTCAAA
TTCGTCACCG TAACGGACAA AAGCCGCTTC GACGCCGTCC TGACGAACCA GGTGGACGTC
GTCTTCGCAC ACACGACCAT GAAGCCGGCC CGCGAATCCT CGATCGCCAT AGATTTTCTG
CCGGTCAACT TCTATGACGG CACGGGTATC ATGGTGAAGA CGGATTCCGA GGTGGTGCAG
TTCGCCGACC TCGAAGGTGC GACGTTCTGC ACGACTCAGG GTTCCGTGAC CGAAACCGTG
CTTACCAGCG CTTTCAAGGC CAATGGATGG CAGGGCTCCA AGGTTCTCAC CTACGAAAAC
CTCGAAAAGC TGTTCGCCGC GCTCAACTCC GGTCGCTGCA ACGCGATGAG CACAGACAAG
TCCGCGCTTG CGGCCTGGGC CGGCAACTCG CCGAAGCCGT CCGATTATCT CATCCTGCCG
GAAACCCTCG ACAAGTCGCC CTTCGCTGGT TTCGTCGCGG CCAATGATTC CAAATGGCGC
AATGCGCTGC GCTGGATCAC CTACGGCCTG TTCCAGGCGG AGGAGTCCGA CATCACACAG
GCCAATCTCG AAGAAAAGCT GAAGAGCGAC GACCCGTTCG TTCAGAAGTT TCTCGGCGTG
GGAGGCGGCT ACGGCAAGGA CTTCGGCCTT CCGGACGATT TCGTTGCGCA GGCCATAAAG
GCGATGGGCA ATTACGGCGA GATTTACGCC CGCAATCTCG GTCCGGATAC GAGCATGTTC
CTTGACCGGA AAGGTACGCC TAACGCTCTT TGGACAGAGG GCGGCGCGAT CTACTCGCCC
CTCTGGAACT GA
 
Protein sequence
MKLIKLLTAG VFAGLAVTTG QASASVLDTV KQRGTLNCGT DNTAPGFGYL NTTTGQMEGL 
DVDFCRAVAA AVLGDASKVK FVTVTDKSRF DAVLTNQVDV VFAHTTMKPA RESSIAIDFL
PVNFYDGTGI MVKTDSEVVQ FADLEGATFC TTQGSVTETV LTSAFKANGW QGSKVLTYEN
LEKLFAALNS GRCNAMSTDK SALAAWAGNS PKPSDYLILP ETLDKSPFAG FVAANDSKWR
NALRWITYGL FQAEESDITQ ANLEEKLKSD DPFVQKFLGV GGGYGKDFGL PDDFVAQAIK
AMGNYGEIYA RNLGPDTSMF LDRKGTPNAL WTEGGAIYSP LWN