Gene Smed_4184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4184 
Symbol 
ID5319350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp662918 
End bp664210 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content60% 
IMG OID640775989 
ProductABC transporter periplasmic solute-binding protein precursor 
Protein accessionYP_001312922 
Protein GI150376326 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR03407] urea ABC transporter, urea binding protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.11561 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATCA GGGCACGCGT CACAGGCGCA TTCCTAGCCG CCGTCCTTTC GACGACCGCG 
TTCAACGGCG CCTTCGCGGC GGAGGACACG ATCAAGGTCG GCATCCTGCA CTCGCTTTCG
GGAACCATGG CGATATCCGA AACGACGCTC AAAGACGCAA TGCTGATGCT GATCGAGGAG
CAGAACAAGA AGGGCGGCGT GCTCGGCAAG AAGCTTGAAG CGGTCGTAGT CGATCCGGCC
TCGGACTGGC CGCTCTTCGC CGAGAAGGCC CGTGAGCTCG TTTCCGTCAA CAAGGTCTCC
GCCGTCTTCG GCTGCTGGAC ATCGGTGTCG CGCAAATCCG TATTGCCGGT CTTCGAAGAG
CTGAACTCGA TCCTCTTCTA TCCCGTCCAG TACGAAGGCG AAGAGAGCCA GCGCAACGTC
TTCTACACGG GCGCCGCTCC TAATCAGCAG GCGATCCCCG CGGTGGACTA CCTGATGGAA
AACGAGGAAG TCGAGCGCTG GGTGCTCGCC GGTACCGACT ATGTCTATCC GCGCACGACA
AACAAGATCC TCGAGGCCTA CCTCATCTCA AAGGGCGTCA AGCCTGAGGA CATCATGATC
AACTACACGC CATTCGGTCA TTCCGACTGG CAGACGATCG TCTCCGATAT CAAGAAGTTC
GGCTCCGCCG GGAAGAAGAC GGCGGTCGTC TCGACGATCA ACGGCGACGC CAATGTGCCC
TTCTACAAGG AGCTCGCGAA CCAGGGCGTC AAAGCCGAGG ACATCCCGGT CGTGGCCTTC
TCGGTCGGCG AGGAGGAGCT TGCGGGCCTC GATACCGGTC CGCTCGTCGG GCATCTTGCC
GCGTGGAACT ATTTCCAGTC GGTCGACAGT CCGGCCAATG CCGCGTTCAT CGAAACCTGG
AAGGCTTATA CCAAAAACGA CAAGCGGGTC ACAAACGACC CGATGGAAGC CCATTATATC
GGCTTCAACA TGTGGCTGAA GGCAGTCGAG AAGGCTGGGA CCACCGACAC GGATGCCGTG
CTCGACGCGA TGATCGGCGT GTCGGTGCCG AACCTTTCGG GCGGTTATTC CACCATGATG
CCGAACCACC ATATCACCAA GCCGGTGCTG ATCGGCGAGA TCCAGTCTGA CGGCCAGTTC
GAAACAGTCT GGGAAACGCC CGGCCTCGTC CTCGGCGACG AATGGTCGGA CTATCTGCCG
GACTCGAAGG ACCTGATTTC CGATTGGCGC GCGCCCATGT CATGCGGCAA CTTCAATGTT
GCCACAGGAA AGTGCGGCGG CAAAGGTTCC TGA
 
Protein sequence
MTIRARVTGA FLAAVLSTTA FNGAFAAEDT IKVGILHSLS GTMAISETTL KDAMLMLIEE 
QNKKGGVLGK KLEAVVVDPA SDWPLFAEKA RELVSVNKVS AVFGCWTSVS RKSVLPVFEE
LNSILFYPVQ YEGEESQRNV FYTGAAPNQQ AIPAVDYLME NEEVERWVLA GTDYVYPRTT
NKILEAYLIS KGVKPEDIMI NYTPFGHSDW QTIVSDIKKF GSAGKKTAVV STINGDANVP
FYKELANQGV KAEDIPVVAF SVGEEELAGL DTGPLVGHLA AWNYFQSVDS PANAAFIETW
KAYTKNDKRV TNDPMEAHYI GFNMWLKAVE KAGTTDTDAV LDAMIGVSVP NLSGGYSTMM
PNHHITKPVL IGEIQSDGQF ETVWETPGLV LGDEWSDYLP DSKDLISDWR APMSCGNFNV
ATGKCGGKGS