Gene Smed_5193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5193 
Symbol 
ID5319495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp148533 
End bp149474 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content59% 
IMG OID640776971 
Productextracellular solute-binding protein 
Protein accessionYP_001313903 
Protein GI150377308 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.441941 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGCCG CACCGGGCAA AGGACGTCAA ACGGGACGAC GGATAGTCAG ATTTGGGCTG 
ACGCTAGGAG CAGTGACCGT CAGCATGTGG GCGACCGCAG AAGCACAGAC GCTCGATCGC
GTCCGCAGCA GCAGCACCGT CAAGCTGGGC TACGATGCGA CCGCGCGGCC ATTCTCTTTC
AAGGCCGAAG GAGAAAGCGC CACCGGCTAC GCCGTCAGCC TGTGCATGGA GGTGACCGAG
GAATTGAAAC GTGAACTTGG GATTGCCGAT CTTGCGGTCG AGTGGATTGA GCTCACCAGG
GATGCTGCCG ACAACGCCAT ACGACAAGGT TCGGCCGATC TCTTCTGTGG TGCGTCGCCC
GTGACCTTGA CGCGCCGAAA GGAGGTTTCG TTCTCGATAC CGATCTTTCC GAGCGGAACG
GGTGCGGTAC TGAGTGCGAG CGCACCACTT GCGTTGCGTG AGGTTCTGAC GCAGGGACGC
CCTTCTGACC GGCCGATTTG GCGGGGGTCC CCCGCAAGAA CCGTGCTCAA TCAGAAGACA
TTTTCCCCGA TCGCAGGTAC TACCAGTGAG GATTGGCTTG CGGAGCGGAT AAAGACGTTT
CAGCTTTCAG CGACCATCGC TGCTGTGGAG AACTATGATC AGGGAATCGC CAATATTCTC
AACGGCGAGT CCGACGTACT CTTCGGCGAC CTGCCGCTCT TGCTCGACGC CGCCGCGCGC
GGCGAAAATT CCGGCGATCT CATCGTACTG AAGCGCCATT TCACCTACGA ACCGCTTGCG
CTTGTGCTGG CGCGCAATGA CGAGGATTTT CGAATCGTCG TTGACCGAGC CTTGAGCCGC
ACCTACCGAT CGGAAGATTT CCCGGCATTC TTCAGCGAGT GGTTCGGACC TCCTGACGAT
ACGATCGTGA CCTTCTTCCG GCAAACGACC CTGCCTGAGT GA
 
Protein sequence
MQAAPGKGRQ TGRRIVRFGL TLGAVTVSMW ATAEAQTLDR VRSSSTVKLG YDATARPFSF 
KAEGESATGY AVSLCMEVTE ELKRELGIAD LAVEWIELTR DAADNAIRQG SADLFCGASP
VTLTRRKEVS FSIPIFPSGT GAVLSASAPL ALREVLTQGR PSDRPIWRGS PARTVLNQKT
FSPIAGTTSE DWLAERIKTF QLSATIAAVE NYDQGIANIL NGESDVLFGD LPLLLDAAAR
GENSGDLIVL KRHFTYEPLA LVLARNDEDF RIVVDRALSR TYRSEDFPAF FSEWFGPPDD
TIVTFFRQTT LPE