Gene Smed_3752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3752 
Symbol 
ID5318742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp195709 
End bp196737 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content60% 
IMG OID640775565 
Productextracellular solute-binding protein 
Protein accessionYP_001312498 
Protein GI150375902 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.236177 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTTTGG TTAAACTGGC CGCGGCAGCG CTCGCCGCGG GCGCGACGTT TTTTGCCTAT 
TCGGCGAAGG CCGATGGAAA TCTCAATCTG ATCTGTTCGG CCGATGTGGT GATCTGCGAA
CAGATGAAGG GCGCCTTCGA AAAGAAGTCC GGCGTCTCCG TCAACATGGT GCGCCTCTCT
TCGGGCGAAA CCTACGCGAA GATCCGGGCC GAGGCGCGCA ACCCCAAGAC TGACATCTGG
TGGGCCGGGA CGGGTGATCC CCATCTGCAG GCCGCTTCGG AGGGGCTGAC GGTCGAGTAC
AAGTCGCCGA TGCTCGGCGA ATTACACGAA TGGGCGGTGA AGCAGGCCGA GAGCGCCGGC
TATCGCACGG TCGGCGTATA TGCCGGCGCA CTCGGCTGGG GATACAACAC CGAGATCCTC
AAGCAGAAGA ACCTGAAGGA GCCGAAATGC TGGGCGGATC TGCTCGATCC CTCCTTCAAG
GGCGAAGTGC AAATCGCCAA TCCCAACTCT TCCGGAACCG CCTATACCGC GCTCGCGACT
CTCGTCCAGA TCATGGGTGA GGAAGAGGCT TTCGATTACC TGAAGAAGCT GAACGCGAAC
GTCTCGCAAT ATACGAAATC CGGCTCCGCA CCGGTGAAAG CCGCAGCACG GGGAGAGACC
GCAATCGGCA TCGTATTCAT GCATGATGCT GTGGCGCAGA CGGTCGAAGG GTTTCCGGTA
AAGTCGGTGG CGCCGTGCGA AGGCACCGGC TACGAAATCG GCTCGATGTC GATCATCAAG
GGCGCAAAAA ACCTCGACAA TGCCAAGAAA TGGTATGACT GGGCTCTCTC CGCCGACGTG
CAGTCCAGCA TGAAGGAGGC GAAATCGTTC CAGCTTCCTT CGAACAAGAC GGCCAAGGTG
CCCGAGGAGG CCCCGAAGTT CGAAGACATC AAGCTGATTG ACTACGACTT CAAGACCTAT
GGCGATCCGG CCAAGCGCAA GGAGCTCCTG GAGCGGTGGG ACCGGGAGAT CGGCGCGGCC
GCGAACTGA
 
Protein sequence
MGLVKLAAAA LAAGATFFAY SAKADGNLNL ICSADVVICE QMKGAFEKKS GVSVNMVRLS 
SGETYAKIRA EARNPKTDIW WAGTGDPHLQ AASEGLTVEY KSPMLGELHE WAVKQAESAG
YRTVGVYAGA LGWGYNTEIL KQKNLKEPKC WADLLDPSFK GEVQIANPNS SGTAYTALAT
LVQIMGEEEA FDYLKKLNAN VSQYTKSGSA PVKAAARGET AIGIVFMHDA VAQTVEGFPV
KSVAPCEGTG YEIGSMSIIK GAKNLDNAKK WYDWALSADV QSSMKEAKSF QLPSNKTAKV
PEEAPKFEDI KLIDYDFKTY GDPAKRKELL ERWDREIGAA AN