Gene Smed_4557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4557 
Symbol 
ID5318419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1043256 
End bp1044494 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content61% 
IMG OID640776358 
Productextracellular solute-binding protein 
Protein accessionYP_001313290 
Protein GI150376694 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.302395 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAAAT CCATCCGCAA CGCTTTGATC GGCGCGACCC TTGTCGGCGC CGGCTTCACC 
GGCCATGCCC AGGCCGAAAC GACTCTGAAC GCGCTTTTCA TGGCGCAGGC CGCCTATAGC
GAGGCAGACG TCCGCGCCAT GACGGATGCC TTCGCCAAGG CCAATCCCGA CATCAAGGTC
AATCTCGAAT TCGTTCCCTA TGAGGGACTG CACGACAAGA CGGTGCTCGC GCAGGGTTCC
GGCGGCGGCT ATGACGTCGT CCTTTTCGAC GTCATCTGGC CGGCGGAATA TGCAGCCAAC
AATGTTCTCC TCGACGTGAC GGACCGCATC ACGGACGAGA TCAATCAAGG CGTCCTGCCC
GGCGCCTGGA CGACGGTGGA ATATGACGGC AAACGTTACG GCATGCCGTG GATCCTCGAC
ACGAAGTACC TGTTCTACAA CAAGGAAATC CTTGAGAAAG CCGGCATCAA GGAACCGCCG
AAAACCTGGG ACGAGCTTGC GGAACAAGCC AAAGCCATCA AGGACAAGGG ACTGCTCGAA
AACCCGATCG CCTGGAGCTG GTCTCAAGCG GAAGCGGCCA TCTGCGACTA CACCACTCTG
GTCAGTGCCT ATGGCGGAAA ATTCCTCGAT AGCGGCAAGC CGGCCTTCGC CAGCGGCGGC
GGGCTCGATG CGCTGAACTA CATGGTGACG AGCTACACCT CCGGGCTCAC CAACCCGAAT
TCCAAGGAGT TCCTCGAGGA GGATGTCCGC AAGGTCTTCC AGAACGGCGA GGCCGCCTTC
GCGCTCAACT GGACGTACAT GTACAACCTC GCCAACGATC CCAAGGAGAG CAAGGTAGCC
GGCAAGGTCG GCGTCGTTCC GGCTCCCGGC GTTGAAGGCA AAAGCGAGGT TTCGGCCGTC
AACGGCTCCA TGGGCCTCGG CATCACGACG ACCAGCAAGC ACCCCGAAGA AGCATGGAAA
TATATCGTCC ACATGACCTC GCAGGAGACG CAGAACGCCT ATGCCAAGCT GAGCCTCCCG
ATCTGGGCAT CTTCCTATGA AGACCCCGAT GTGACCAAGG GCCAGGAGGA ACTCATCGCT
GCGGCGAAGC GCGGGCTGGC CGCCATGTAT CCACGCCCAA CGACGCCGAA ATACCAGGAG
CTTTCGGCTG CCCTGCAGCA GGCCATCCAG GAGGCGCTGC TCGGCCAAGC CTCTGCGGAA
GACGCGCTGA AGAGCGCCGC TGAGAACAGT GGCTTGTGA
 
Protein sequence
MSKSIRNALI GATLVGAGFT GHAQAETTLN ALFMAQAAYS EADVRAMTDA FAKANPDIKV 
NLEFVPYEGL HDKTVLAQGS GGGYDVVLFD VIWPAEYAAN NVLLDVTDRI TDEINQGVLP
GAWTTVEYDG KRYGMPWILD TKYLFYNKEI LEKAGIKEPP KTWDELAEQA KAIKDKGLLE
NPIAWSWSQA EAAICDYTTL VSAYGGKFLD SGKPAFASGG GLDALNYMVT SYTSGLTNPN
SKEFLEEDVR KVFQNGEAAF ALNWTYMYNL ANDPKESKVA GKVGVVPAPG VEGKSEVSAV
NGSMGLGITT TSKHPEEAWK YIVHMTSQET QNAYAKLSLP IWASSYEDPD VTKGQEELIA
AAKRGLAAMY PRPTTPKYQE LSAALQQAIQ EALLGQASAE DALKSAAENS GL