Gene Smed_4286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4286 
Symbol 
ID5319122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp779607 
End bp780854 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content61% 
IMG OID640776091 
Productextracellular solute-binding protein 
Protein accessionYP_001313024 
Protein GI150376428 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.683579 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.881811 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAA CGGCATTCTT CGCATCGGCC GCTACCATCA TGCTGTCGGG CCTCGTAGCG 
GTTCAGGCCG CACGGGCCGA GTGCGGCATA GAAAAGGGCA CGGTGCGCAT CCTCTCGAAC
GATTTCGAAG CCCTGCGTCT CGTGGTGTCG GAAGCCGAGA AATGCGCCTC GGCGACGGTC
AAGGTCAGCA AGAACCAAAC GAGCGAGCAC AAGAACCTGC AAGTGCCGGC GCTCAAGATC
AATCCGGCCC AATATACTGT CGCGGTGATT GCCAACAATT CGATCGTTCC GCTGCTGAAC
GAAGGTCTGC TGCGCCCCCT TGACGATCTC GTTGCCAAGT ACGGCCAGGA TCTCCAGCCG
ACCCAGCTCA TCAAACTCGA CGGCAAGGTT ATGGCGATCG CCTTCATGGG CAACTCGCAG
CATCTATTCT TCCGCAAGGA CATCCTCGAA AAGGCCGGGC TTCAACAGCC GAAGTCCTAT
GAGGACGTGC TCGCAGCGGC CAAGGCGATC AAGGAGAAGG GGCTGATGCA GTATCCGCTC
GCCGCCTCCA ACAAGGCCGG CTGGGATCTC GCGGCCGAAT TCGTCAACAT GTATCTCGGC
TATGGCGGAG AGCTCTTCGC GGCCGGTTCG GCAGCCCCCG CCATCAACAA CGAGAAGGGT
CTCGCCACTC TGAAGACGAT GAAGGCGATG ACCGAGTACA TGAACCCCGA CTACATGACC
TACAATGCCG ACGAGATCGT CAAGGGCTAT GCGGCCGGCA AGACGGCGAT CATCAATGCG
TGGGGCTCGC TTGCGGGCGG CGTGATCGAT CCGGTCAAAA CGCCGGCCGA GATCGCCGAC
AACACGGTTC TCGGCGCAGC GCCGACCGTT GGGGGCGGCT CGATCCCTGC AGCGGCCCTC
TGGTGGGACG GTTTCTCCAT CGCCAAGAAC ATTTCCGACG AAGATGCGGA GGCATCTTTC
CGCGCCATGG TCCATGGCAT CCGGCCGGAA GTTGGCCAGA AACATCCGGC TCTTGCGACC
TGGCTGACCA AGGGGTACCA GCCTGGACCC AACGCGGTCG GGGTTCTCGC GACGGCGAAT
GGCGGCGCGA AGCCCTATCC GATGCTGCCT TATATGGGCC TGCTGCACTC GGCTCTCGGC
TCCGAACTGG CGGAGTACAT GCAGGGGCGT GAAAGTGCAG AGGAGGCGCT GAAAGATGTC
GAGGCATCCT ACAGCGCGGC AGCGAAAGAG GGCGGATTCC TGAAATGA
 
Protein sequence
MKKTAFFASA ATIMLSGLVA VQAARAECGI EKGTVRILSN DFEALRLVVS EAEKCASATV 
KVSKNQTSEH KNLQVPALKI NPAQYTVAVI ANNSIVPLLN EGLLRPLDDL VAKYGQDLQP
TQLIKLDGKV MAIAFMGNSQ HLFFRKDILE KAGLQQPKSY EDVLAAAKAI KEKGLMQYPL
AASNKAGWDL AAEFVNMYLG YGGELFAAGS AAPAINNEKG LATLKTMKAM TEYMNPDYMT
YNADEIVKGY AAGKTAIINA WGSLAGGVID PVKTPAEIAD NTVLGAAPTV GGGSIPAAAL
WWDGFSIAKN ISDEDAEASF RAMVHGIRPE VGQKHPALAT WLTKGYQPGP NAVGVLATAN
GGAKPYPMLP YMGLLHSALG SELAEYMQGR ESAEEALKDV EASYSAAAKE GGFLK