Gene Smed_2420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2420 
Symbol 
ID5323281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2498105 
End bp2499367 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content60% 
IMG OID640791358 
Productextracellular solute-binding protein 
Protein accessionYP_001328087 
Protein GI150397620 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.228974 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAAAC TGATCACAGC CGCCCTGCTT GCCACCCTGA TGGCCGGCAG CGCCCTTGCA 
GATACGAAGC TGAAGCTCGT GGAGGTCATA ACCAGCCCCG AGCGCACCGA AACGCTGAAG
TCGATCGTCG CCAAGTTCGA AGAGGCGAAC CCCGGCACGA CCGTCGAGAT CATTTCGCTG
CCCTGGGGCG AAGCCTTCCA GAAATTCGCC ACCATGGTAT CGGCCGGCGA AATCCCCGAT
GTCATGGAAA TGCCCGACAC CTGGCTGTCG CTCTATGCCA ATAACGGCAT GCTCGAAAGC
CTCGAGCCCT ATCTCGCAAA ATGGGAGCAC ACGCCAGGCC TCACCGAGCG CGCGCTCGAA
CTCGGCCGGG ACGTCAACGA CACAGCTTAC ATGTTGCCTT ACGGCTTCTA TCTCCGGGCG
ATGTTCTACA ACAAGAAGCT GCTCTCCGAA GCGGGTGTCG CCGAACCGCC GAAGACGATG
GACGACTTCG TCAAGGCTTC CGAGGCGGTC TCCAAGCTGC CGGGCAAATC CGGTTACTGC
CTGCGCGGCG GTCCGGGCGG GCTCAACGGC TGGGTGATGT TCGGCGCGAC CATGGCCGGC
GACAACAAGT TCTTCAACGA GGACGGCACT TCCACGATGA ACAGCGAAGG CTGGATCAAA
GGCCTCACCT GGGTCATCGA CCTCTACAAG AAGGGTCTGG CGCCGAAGGA TAGCGTCAAC
TGGGGCTTCA ACGAGATCGT CGCGGGCTTC TACAGCGGCA CCTGCGCCTT TCTCGACCAG
GACCCGGATG CCTTGATCGC TATTGCCCAG CGCATGAAGC CGGAGGATTT CGGCGTGACC
ACCATGCCGA AGGGGCCGAG CGGCAAGGCC TTCACCACGA TCGGCTTCGC CGGCTGGTCG
ATCCTTGCCG CCAGCCAGAA CAAGGATCTC TCCTGGAAGC TGATCGAAAC GCTGGAAGGC
CCGGAAGGCA ATATCGAGTG GAATAAGCGC ACCGGCGCGC TGCCCGTTCA CAAGTCGGCC
GAAAAGGACC CCTTTTATGC GAGCGCGCAG TTCAAGGGCT GGTTCGACGA ACTCGCCGAC
AAGGACGTCG TGCTGACGGT CATGCCGACC TATCTCGAAG AATTCGCCTT CTTCAAAGAT
TCGCTCGCCA TCAAGACGAC CCAGGAAGCT CTCCTCGGCG ACATCACGCC GGAAGAACTT
GCCAACCAGT GGGCCGACTA CCTGACCAAG GCTCAGCAGA AATATCTCGC GAACAAGAAA
TAG
 
Protein sequence
MRKLITAALL ATLMAGSALA DTKLKLVEVI TSPERTETLK SIVAKFEEAN PGTTVEIISL 
PWGEAFQKFA TMVSAGEIPD VMEMPDTWLS LYANNGMLES LEPYLAKWEH TPGLTERALE
LGRDVNDTAY MLPYGFYLRA MFYNKKLLSE AGVAEPPKTM DDFVKASEAV SKLPGKSGYC
LRGGPGGLNG WVMFGATMAG DNKFFNEDGT STMNSEGWIK GLTWVIDLYK KGLAPKDSVN
WGFNEIVAGF YSGTCAFLDQ DPDALIAIAQ RMKPEDFGVT TMPKGPSGKA FTTIGFAGWS
ILAASQNKDL SWKLIETLEG PEGNIEWNKR TGALPVHKSA EKDPFYASAQ FKGWFDELAD
KDVVLTVMPT YLEEFAFFKD SLAIKTTQEA LLGDITPEEL ANQWADYLTK AQQKYLANKK