Gene Smed_5345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5345 
Symbol 
ID5319647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp306693 
End bp307865 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content60% 
IMG OID640777118 
Productextracellular ligand-binding receptor 
Protein accessionYP_001314050 
Protein GI150377455 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.040092 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.403221 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAAGA CTTCCAAACT GTTTCTTGCT GCGGCAGGCA TCGCGCTTGC CTCTTCCGCA 
TCCTTTGCCG ACGACGACAA GATCGTGATC GGCGGAGCAC TCTCGATGAC CGGCATCCAG
GCGCCGCTCG ACACGCCCGG CTACAACGGC GCGCAGGTCG CCGTAAAATT TCTCAACGAC
AATGGCGGCG TCCTCGGCAA GCAGATCGAA TTCATCAACA TCGATGGCAA ATCGGATCCG
GTAACAGTCG GAAACGTCGC GGTGGAATTG ATCGACAAGG GTGCGCAGGT CATCGTGGCG
CCTTGCGATT TCGACTTCGG CTCGCCCGCA AGCCGCGAGG CGCAGTCGGC CGGCCTCGTC
GGTATTTCGA CCTGCGCCTC CGACCCGCTC TATTCCTCCT GGTCCCTCGG TGACAAGCAG
TTCACGCTTT CCATGTGGAA CACCACGATG GGCGCGACGG CCGCCGATTT CGCCGTGAAG
GAAAAGGGCT GGAAGACTGC TTACGTCGTC ACCGACCAGT TCATCGCCTA TACCAAGTCG
CTCTCGAAAT ATTTCGTGGA GCAGTTCAAG GCCAATGGCG GCGAAATCCT CCTCGAAGAC
ACCTACACCA ACGGCGACAA CAATTTCTCC GCCCAGCTCG CCCGCCTCCA GGCGCTCGGC
AAAAAGCCGG ACGTGATCTT CGTCTCCTCC TATGGAACGG ACATCGGCGT GATCATCCGC
GCGCTTCGCG AAGTCGGCTA CGATGCGCCG GTTCTGGGCG GCGACGCCTA TGACGACCCG
GCCATGCACC AGGCACTCGG CGAACAATTC GGCAATCACG TCTACTTCGT TACTCATACC
TGGATGGGCC CGGAAGCCCA CCCGGAGATG CCGAAATTCA TCGAACTCTA TACGGAGATG
TTCGGCAAGG CACCGGACAC ATCCTTCGTC TCGACGGGCT GGGACACGAT CATGCTGCTC
GCGGAAGCAA TCAAGGCGGC CGGGACGACC GAAGGCGCAG CGCTTGCCAA GGCACTGGAG
GATGGCCAGT TCAAACTGTT GACCGGCGAT CTCGACTACG GCACGAACGA GGAAGGCCAT
GTACCGAACA AGGCCGCGGC CGTGATCGAG CTCAAGGCCG GAAAGCCGAG CTTCGTCGGA
TGGCGCAAGC CGGAATCCCT GCCGAAGCCC TGA
 
Protein sequence
MIKTSKLFLA AAGIALASSA SFADDDKIVI GGALSMTGIQ APLDTPGYNG AQVAVKFLND 
NGGVLGKQIE FINIDGKSDP VTVGNVAVEL IDKGAQVIVA PCDFDFGSPA SREAQSAGLV
GISTCASDPL YSSWSLGDKQ FTLSMWNTTM GATAADFAVK EKGWKTAYVV TDQFIAYTKS
LSKYFVEQFK ANGGEILLED TYTNGDNNFS AQLARLQALG KKPDVIFVSS YGTDIGVIIR
ALREVGYDAP VLGGDAYDDP AMHQALGEQF GNHVYFVTHT WMGPEAHPEM PKFIELYTEM
FGKAPDTSFV STGWDTIMLL AEAIKAAGTT EGAALAKALE DGQFKLLTGD LDYGTNEEGH
VPNKAAAVIE LKAGKPSFVG WRKPESLPKP