Gene Smed_2134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2134 
Symbol 
ID5322994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2200942 
End bp2202036 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content63% 
IMG OID640791072 
Productextracellular solute-binding protein 
Protein accessionYP_001327802 
Protein GI150397335 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.575498 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.605089 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAAC GCCTGCTCTC GCTTTCGACC GCCATGCTGC TCGCTACGAC GGCAGTTGCC 
GTTGCCGAGC CGAGCGAGGA GCTCATCGCA GCCGCCAAGA AGGAAGGCAC GCTGACGACA
ATCGCGCTCC CGCACAATTG GTGCGGGTAT GGAGACGTGA TCGCCGGCTT CAAGGCCAAA
TACGGTATCG AAGTCAACGA ACTGAACCCG GATGCGGGCT CCGGCGACGA GATCGAGGCG
ATCAAGGCCA ACAAGGGCAA CACCGGCCCG CAAGCACCCG ATGTCATCGA CGTCGGCCTG
TCCTTCGGCC CCTCGGCCAA GGCCGAAGGC CTGATCCAGC CCTACAAGGT TTCGACCTGG
GATACGATTC CGGACACGGC GAAGGACCCG GAGGGCTATT GGTACGGCGA TTACTACGGC
GTTCTCTCCT TCGTGGTGAA CACCGATATC GTCAAGGATG TGCCGAAGGA CTGGGCGGAC
CTCAAAAAGT CCGACTATGC GAATTCGGTT GCGCTCGCCG GCGATCCGCG GGCATCCAAC
CAGGCGGTAC AGGCGGTCTA CGCAGCCGGC CTCGCGGCCG GTGAGACGGA TGCGGCCAAA
GCGGGCGAAG CCGGTCTTGC CTTCTTCGCC GAGGTCAACA AGGCCGGCAA CTTCGTCCCT
GTGATCGGCA AGTCCGCCTC CCTTGCGCAA GGATCGACCC CGATCATCAT CGCCTGGGAC
TATAACGGCC TCTCCTGGCG CGACAGCCTG AACGGCAACC CGCCGGTCGA GGTCGTCGTT
CCAGCCTCCG GTGTCGTCGC CGGCGTCTAC GTCCAGGCGA TCTCGGCCTT CGCGCCGCAT
CCGAACGCGG CCAAGCTCTG GATGGAATAT CTGTATTCGG ACGAAGGTCA GCTCGGCTGG
CTGAAGGGCT ATTGCCACCC GATCCGCTTC AACGACCTCG TCAAGAACGG CAAGGTTCCG
CAGGAAATGC TCGACAAGCT GCCGCCGGCG GCATCCTACG AGAAGGCCGT CTTCCCGACG
CTCGAAGAGC AGGAAGCCGG CAAGGCAGCG ATCACCACGA AGTGGGATAG CGTCGTCGGC
GCGAGCGTAC AGTAG
 
Protein sequence
MTQRLLSLST AMLLATTAVA VAEPSEELIA AAKKEGTLTT IALPHNWCGY GDVIAGFKAK 
YGIEVNELNP DAGSGDEIEA IKANKGNTGP QAPDVIDVGL SFGPSAKAEG LIQPYKVSTW
DTIPDTAKDP EGYWYGDYYG VLSFVVNTDI VKDVPKDWAD LKKSDYANSV ALAGDPRASN
QAVQAVYAAG LAAGETDAAK AGEAGLAFFA EVNKAGNFVP VIGKSASLAQ GSTPIIIAWD
YNGLSWRDSL NGNPPVEVVV PASGVVAGVY VQAISAFAPH PNAAKLWMEY LYSDEGQLGW
LKGYCHPIRF NDLVKNGKVP QEMLDKLPPA ASYEKAVFPT LEEQEAGKAA ITTKWDSVVG
ASVQ