Gene Smed_4395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4395 
Symbol 
ID5319160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp890916 
End bp892178 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content59% 
IMG OID640776199 
Productextracellular solute-binding protein 
Protein accessionYP_001313132 
Protein GI150376536 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.87283 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCATC TTTTGAAAAC ACTGGCAGGT ATGACCGTCA TTGCCGTCGT ATCCGCCTTT 
CCCGCAAAGG CGGATACCGT TTCGATGTTT TGCTCGGCGA CCGACTACGA GCTTTGCGAG
AAGGCTGTCC AGAAATGGAC GAAAGAAACC GGCCATGACG TGAAACTCAA CCGGATGCCG
CAGAACCTCG ACGACGCCAT TCCGATCTAT CAGCAATTGT TCGCGGCCCA GTCGACCGAC
ATGGATGTCC TCTACATCGA CGTCATCTGG CTGGGTATGT TCAAGGATCA TCTCCTCGAC
CTGACGTCTC TCGTACCTGA GGAGGAGGTG AAGGCGCATT TCGCCTCTGC CGCGGATGCG
GCGCGCCTCG ACGGCAAGCT CCTGTCGATG CCTTTCTACA TCGACACCGG CCTGATGTTC
TATCGTAAGG ACCTGCTGGA GAAGTACGGC AAGCAGCCTC CGAAGACCTG GGACGAACTG
ACGGCGACCG CCAAGGAGAT TCAGGATGCG GAACGCAAGG TCGGCAGTCC GGATATATGG
GGCTATGCCT GGCAGGGCCG GAGCTATGAG GGCCTGACCT GCGATGCGCT GGAGTGGATC
GCTTCGGCCG GCGGCGGCAC GATCCTTTCC GACGACGGAG AGGTGACAAT CAACAATCCC
AAGACGGAGG CGGCTTTGAG CCGTGCGCGC GGCTGGATCG GGACGATTTC GCCTGAGGGG
GTTCTGAACT ACGATGAGGA AAACTCGCGT GCCCTCTTCG AGAGCGGCAA TGCCGTCTTT
CACCGGAACT GGCCTTATGT GTGGGGAACG TCGCAGGCCG AAGGCGGCAA GCTCGTCGGC
AAGGTCGGGG TGAGCGCGCT TCCGGTGGGT GCGGAAGGCC AGAAGTCGAG CGGTGCGCTC
GGTACCGCCT ATCTCGGCGT TTCCAAATAT TCCAAGAATC CGGAGCTTGC AGCGGAGCTG
CTGCGCTACA TGGTAGGTGC GGAAGACCAG AAGATGCGTG CAATCGAAGG CGGCTACAAT
CCGACCGTGG AGGCGCTCTA CGAAGACGCC GATGTGCTGG CGAAGATTCC GTTCCTCGGC
ATGGCGAAGA CCGCGTTCGA AGAATCGGTC GCGCGTCCCT CGGCAGCCAC GGGCAAGAAC
TATAATCGTG TTTCCCGTAC CTTCTACCGG GCGGTTCACG ACATCATCTC CGGCAAGGAC
GATGTCGCGA AGGAACTCGC CGATCTCGAG CGACGCCTCG AACGCGACGT TAAAGCGAAA
TGA
 
Protein sequence
MKHLLKTLAG MTVIAVVSAF PAKADTVSMF CSATDYELCE KAVQKWTKET GHDVKLNRMP 
QNLDDAIPIY QQLFAAQSTD MDVLYIDVIW LGMFKDHLLD LTSLVPEEEV KAHFASAADA
ARLDGKLLSM PFYIDTGLMF YRKDLLEKYG KQPPKTWDEL TATAKEIQDA ERKVGSPDIW
GYAWQGRSYE GLTCDALEWI ASAGGGTILS DDGEVTINNP KTEAALSRAR GWIGTISPEG
VLNYDEENSR ALFESGNAVF HRNWPYVWGT SQAEGGKLVG KVGVSALPVG AEGQKSSGAL
GTAYLGVSKY SKNPELAAEL LRYMVGAEDQ KMRAIEGGYN PTVEALYEDA DVLAKIPFLG
MAKTAFEESV ARPSAATGKN YNRVSRTFYR AVHDIISGKD DVAKELADLE RRLERDVKAK