Gene Smed_0327 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0327 
Symbol 
ID5321160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp356839 
End bp357879 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content60% 
IMG OID640789262 
Productextracellular solute-binding protein 
Protein accessionYP_001326020 
Protein GI150395553 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0824216 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAATTT CGAAAGCGCT GATCGGAGCG CTGTCCGTAG CGACGGCATT CCTGGGTTCT 
ACGGCGGCCC ATGCCGAAGG AGAAGTCAAC ATCTACTCCT ACCGGCAGCC GGAACTCATT
CAGCCGCTAT TGGACGCCTT CACCAAGGAG ACGGGCATCA CGACCAACGT GCTCTTCCTC
GACAAGGGCC TTGTCGAGCG CATCCAGGCC GAAGGCGCGA ACTCTCCTGC CGACGTAATC
CTGACTGTGG ACATCAGCCG CCTGACCGAA GCCAAGGACG CGGGCGTGAC GCAGCCGGTG
GTGAACGAGA CCATCAACAA GGACATTCCC GAGCATTTCC GCGATCCGGA CGGCAATTGG
TTCGGCCTGA CGACGCGCGG CCGCGTCGTC TATGCCTCCA AGGAGCGCGT CGCACAGGAC
GAGATCACCT ATGAAGACCT CGCCGATCCG AAGTGGAAGG GCAAGATCTG CACCCGTGAC
GGTCAGCACT CCTACAATGT CGGCCTCTTC GCTTCCATGA TCGCCCACCA CGGTGAAGCC
GAAACGGAAA AATGGCTCGC CGGCCTCAGG GACAATCTTG CCCGGAAGCC GGACGGCGGC
GACCGCGATC AGGCCAAGGC GATCTTCGCC GGCGAATGCG ACGTGGCACT CGGCAACAGC
TATTATGTCG GCCGGATGAT GACCAACGAG AAGGAGCCCG AACAGAAGGA TTGGGCCGCG
GCCATCAAGG TTCTTTTCCC GAACGCCAAG GACCGGGGCA CGCATGTGAA CATCTCCGGC
ATGGCGCTCG CCAAGAACGC GCCGAACAAG GAGAATGCCC TGAAGCTCAT GGAATTCCTG
TCCGAAGGCG AAGCGCAGAA GATCTATGCC GAACAGGTCT TCGAATATCC TGTTCTGCCC
GGCGTCGAAA CATCGGAAGT CGTCAAATCC TTCGGTGAGA TCAAGCCGGA CACGCTGCCG
CTTGCCAAGA TCGCGGCAAA CCGCAAGAAG GCCTCGGAAC TCGTCGACAA GGTCGGCTAC
AACGAAGGTC CGCAGGATTG A
 
Protein sequence
MQISKALIGA LSVATAFLGS TAAHAEGEVN IYSYRQPELI QPLLDAFTKE TGITTNVLFL 
DKGLVERIQA EGANSPADVI LTVDISRLTE AKDAGVTQPV VNETINKDIP EHFRDPDGNW
FGLTTRGRVV YASKERVAQD EITYEDLADP KWKGKICTRD GQHSYNVGLF ASMIAHHGEA
ETEKWLAGLR DNLARKPDGG DRDQAKAIFA GECDVALGNS YYVGRMMTNE KEPEQKDWAA
AIKVLFPNAK DRGTHVNISG MALAKNAPNK ENALKLMEFL SEGEAQKIYA EQVFEYPVLP
GVETSEVVKS FGEIKPDTLP LAKIAANRKK ASELVDKVGY NEGPQD