Gene Smed_4877 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4877 
Symbol 
ID5318924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1382989 
End bp1384011 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content60% 
IMG OID640776662 
Productextracellular solute-binding protein 
Protein accessionYP_001313594 
Protein GI150376998 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID[TIGR01254] ABC transporter periplasmic binding protein, thiB subfamily
[TIGR03261] putative 2-aminoethylphosphonate ABC transporter, periplasmic 2-aminoethylphosphonate-binding protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.713524 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.38801 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTTCT CAAAGGCCTT TCTGGGCGCC GCGACCGCCT TCCTTCTCGC ATCCACCGCC 
GCCTATGCCG AAGCCGAACT TACGGTCTAC ACGTCTGTCG AGGCGGTTGA CCTCGACCGT
TACAAGGAGA CTTTCGAGAA GGCTCACCCC GACATCAAGA TCAACTGGGT CCGCGACTCG
ACAGGCGTGA TGACCGCCAA GCTACTGGCG GAGAAGGATA ATCCGCAGGC GGACGTTGTG
TGGGGCGTGG CTGCGACATC GCTGCTGCTC CTGAAGTCCG AGGGCATGCT CGAACCCTAT
GCCCCGAAGA ATGTCGAGGC CCTGGATCCG AGATTCGTCG ATGGCGACAA GCCGCCGAGC
TGGGTAGGGA TGGACGCATA TGTGGCGGCT CTCTGCTACA ACACGGTGGA GGCCGAGAAG
CTCGGCCTGA CGCCGCCAAC CAGCTGGAAG GATCTGACCA AGCCCGAATA CAAGGGTCAC
GTCGTGATGC CCAACCCCAA TTCCTCCGGA ACAGGCTTCC TCGACGTCTC CGCCTGGCTT
CAGACGTTCG GCGAAGAGGA AGCCTGGTCC TTCATGGACG CCCTGCACGA GAACATTGCC
GCCTATACCC ATTCGGGTTC CAAGCCTTGC AAGATGGCAG CGTCCGGCGA AACCGTCATC
GGCGTCTCCT TTGAGTTTCC GGGCGCCAAG GCGAAAACGT CGGGCGCGCC GATCGACATC
ATTTTTCCCG CTGAAGGATC GGGCTGGGAA GCAGAGGCCA CGGCGATCAT TGCAGGAACG
GCCAATCTCG AGGCGGCGAA AACGCTGGTC GACTGGTCGA TCAGCAAGGA AGCCAACGAG
ATGTACAATG TCGGTTATGC AGTCGTGGCT TATCCGGGAG TCGCCAAGCC GATCGAGAAT
CTTCCCGACG ACGTTGCCGA GAAGATGATC GAGAACGACT TCGAGTGGGC CGCGAACAAC
CGTGCCCGTA TTCTGAAGGA ATGGCAGAAG CGTTACGACG CAAAGTCCGA GCCCAAGTCC
TGA
 
Protein sequence
MPFSKAFLGA ATAFLLASTA AYAEAELTVY TSVEAVDLDR YKETFEKAHP DIKINWVRDS 
TGVMTAKLLA EKDNPQADVV WGVAATSLLL LKSEGMLEPY APKNVEALDP RFVDGDKPPS
WVGMDAYVAA LCYNTVEAEK LGLTPPTSWK DLTKPEYKGH VVMPNPNSSG TGFLDVSAWL
QTFGEEEAWS FMDALHENIA AYTHSGSKPC KMAASGETVI GVSFEFPGAK AKTSGAPIDI
IFPAEGSGWE AEATAIIAGT ANLEAAKTLV DWSISKEANE MYNVGYAVVA YPGVAKPIEN
LPDDVAEKMI ENDFEWAANN RARILKEWQK RYDAKSEPKS