Gene Smed_2326 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2326 
Symbol 
ID5323187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2407024 
End bp2408664 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content59% 
IMG OID640791264 
Productextracellular solute-binding protein 
Protein accessionYP_001327993 
Protein GI150397526 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.294702 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCACA AGCTCAATGG ACGTTTTCGA ATTCTCGCCG CCTCGGCGGC CCTCGCCATG 
GCGATGGGCG CGGCTCAGCC GGCCTTCGCG GAAACGCCGA AGGACACGCT GGTCGAGGGT
TTCGCATTCG ACGACATCAT CACCATGGAT CCCGGCGAAG CCTTCGAGCT TTCGACCGCC
GAGATGACGA GCAATACCTA CAGCCTGCTC GTGAGGCTCG ATCTCAACGA TACCTCCAAG
GTCGTAGGCG ATCTTGCGGA AAGCTGGACG GTCTCGGATG ACGGCCTGAC CTATACGTTC
AAGCTCAAGC CGGGAATGAA ATTCGCATCC GGCAATCCGA TCACCGCCGA GGACGTCGCC
TATTCGTTCG AACGTGCCGT CAAGCTCGAC AAGAGCCCGG CCTTCATCCT CACCCAGTTC
GGCCTTACCG GGGACAATGT CACGGAAAAG GCGAAGGCCG CCGACCCGGA GACCTTTGTT
TTCACGGTCG ACCAGCCCTA CGCGCCGAGT TTCGTCTTGA ACTGTCTGAC CGCGACGGTT
GCTTCCGTAG TCGACAAGAA GCTCGTGCTC GAACATGTGA AATCCGTGTC GCCGAGCGAC
GAGTACAAGT ATGACAACGA CTTCGGCAAT GAGTGGCTGA AGACCGGTTA TGCCGGTTCC
GGTCCGTTCA AGCTGCGCGA GTGGCGCGCC AATGAAGTCG TGGTGCTGGA ACGCAACGAC
AATTATTATG GCGAACCGGC GAAACTCGCC CGCGTCATCT ACCGTCACAT GAAGGAAAGC
TCGGGTCAGC GGCTCGCGCT TGAAGCCGGC GACATCGATG TCGCGCGCAA CCTCGAGCCC
GGCGACTACG ACGCAGTCGG CAAGAATGCC GATCTGGCGA CGGCCAGCGC CCCGAAGGGA
ACGGTCTACT ATATCAGCCT CAATCAGAAG AACGAAAAGC TCGCAAAACC CGAGGTACAG
CAGGCGTTCA AGTATCTTGT CGATTACGAC GCGATTGGCT CGACCCTGAT CAAGGGCATC
GGCGAGATTC ACCAGAGCTT CCTGCCGAAG GGTGTGCTGG GTGCCGTCGA CGAGAACCCC
TACACCTTCG ACGTAGCCAA GGCGAAGGAA CTGCTGGCGA AGGCCGGCTA TCCGGACGGC
TTCACCGTTA CGATGGATGT GCGTAATACC CAGCCGGTCA CCGGCATTGC CGAATCCTTC
CAGCAGACGC TGGGGCAGGC GGGCGTGAAG CTCGAAATTA TTCCAGGAGA CGGCAAGCAG
ACCCTGACCA AGTACCGCGC CCGCAATCAC GACATGTATA TCGGCCAGTG GGGCATGGAT
TATTTCGATC CGCACTCCAA TGCCGATACC TTCACCAACA ATCCGGACAA TTCCGACGAA
GGCACGAACA AGACGCTCGC CTGGCGCAAC GCCTGGGACG TTCCGGAACT CAGCAAGAAG
ACCAAGGACG CGCTCCTCGA ACGCGACAGC ACAAAGCGCG CCGAGATCTA CAAGGAGCTG
CAGAAAACGG TGCTCGAGGA CAGTCCTTTC GTCGTCATCT TCCAGCAGAC AGAGGTCGCC
GGGTTGCGCG GCATTGTCGA GGGCTTCAAG CTCGGGCCGA GCTTCGACAC CAACTACGTC
TGGAACGTCT CCAAGGAATA G
 
Protein sequence
MMHKLNGRFR ILAASAALAM AMGAAQPAFA ETPKDTLVEG FAFDDIITMD PGEAFELSTA 
EMTSNTYSLL VRLDLNDTSK VVGDLAESWT VSDDGLTYTF KLKPGMKFAS GNPITAEDVA
YSFERAVKLD KSPAFILTQF GLTGDNVTEK AKAADPETFV FTVDQPYAPS FVLNCLTATV
ASVVDKKLVL EHVKSVSPSD EYKYDNDFGN EWLKTGYAGS GPFKLREWRA NEVVVLERND
NYYGEPAKLA RVIYRHMKES SGQRLALEAG DIDVARNLEP GDYDAVGKNA DLATASAPKG
TVYYISLNQK NEKLAKPEVQ QAFKYLVDYD AIGSTLIKGI GEIHQSFLPK GVLGAVDENP
YTFDVAKAKE LLAKAGYPDG FTVTMDVRNT QPVTGIAESF QQTLGQAGVK LEIIPGDGKQ
TLTKYRARNH DMYIGQWGMD YFDPHSNADT FTNNPDNSDE GTNKTLAWRN AWDVPELSKK
TKDALLERDS TKRAEIYKEL QKTVLEDSPF VVIFQQTEVA GLRGIVEGFK LGPSFDTNYV
WNVSKE