Gene Smed_5662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5662 
Symbol 
ID5319964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp628249 
End bp629883 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content60% 
IMG OID640777394 
Productextracellular solute-binding protein 
Protein accessionYP_001314326 
Protein GI150377731 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.76313 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCAA GACAGACTGG TACTTTGAAG CCGACGCGCC GCGCGTTTTT GCTTTCCTCC 
GTTGCACTCG CGACCGCTAT TGCCGTGCCA TTGCTGCCGG TACCTTCGTT CGCCGCCGCC
GAAGAGCCGG CGCGCGGCGG CAGCGTGTCG ATCAATATTG GCACCGAGCC GCCGGTTCTG
GTGCTCATCG CTCACAGTGC CGGCGCTGCC TATTACATTA GCGGCAAGGC AACCGAGAGC
CTGCTGACCT ATGACAAGGA GTTCAATCCT CAGCCGTTGC TCGCCACCGA GTGGACGGTG
AGCGAAGACG GCCTGCGCTA CTGGTTCAAG CTGCGCCAGG GCGTTCGTTG GCACGACGGC
AAGGATTTCA CCGCCGAAGA CGTCGCCTTC TCGATCTTGG CGCTGAAGGA AAACCATCCG
CGCGGCCGCG CGACCTTTGC CCATGTGAAA GAGGCCAATG TTCTCAATTC GCATGAGGTG
GAACTGGTCC TTGCCAAGCC GGCACCCTAC CTGCTGACGG CATTTGCAAG CTTTGAGGCG
CCGATCGTTC CCAAACATCT CTACGAGGGC ACGCAGATTG CGGAGAACCC GCACAACGTC
GCGCCGATTG GCACCGGCCC CTACAAGTTC GTCGAATGGG TTCGCGGCAG CCACGCACTT
TTCGTGCGCA ACGAAGACTA TTGGGGTTCG CCCAAGCCTT ACCTCGACCA GATCATCTTC
CGGTTCATCG TCGATCCCGC CGCAGCTGTG GCCGCGATTG AGACCGGCGA GGTGCAGGTC
TCGACGGCCA ACCTGCCGCT GACCGATATC GACCGCCTGA AGGCCAATCC GAACCTCGTC
GTCGATACCG ACCCGGCGCC GTATTCACCG AGCATTGCCC GCGCGGAGTT TAACCTTGAG
AACAAGTATC TGGCCGACAT CAAGGTGCGA CATGCGATCG CGCATGCGAT CGACAAGGAT
TTCATCGTCA ACACCGTCTA CCTCGGCTAC GCCACCCGCC TCGACGGACC GGTCAGCCCC
GACCTTGCGA AGTTCTACTC GCCGGACCTT CCCAAGTACG AGTTCGACCC GGCGAAGTCC
GAGAAGCTTT TGGATGAGGC AGGTTACTCG CGCGGCGCCG ATGGTGTTCG CTTCAAGCTA
TTCATCGATC CGACCCAGCC GTCCGGTCCT CCCAAGCAGA CCGCGGAATA CATCGCACAG
GCGCTTGCCA AGGTCGGCAT CAAGGTGGAA CTGCGAACGC AGGACTTCGC GACTTTCGTC
AAGCGCGTCT TCACTGATCG AGACTTCGAC ATCGCCATCG AGGGCATGAG CAATCTCTAC
GACCCGACCG TCGGCGTCCA GCGCCTCTAC TGGTCGAAAA ACTTCAAGCC GGGCGTCCCC
TTCACCAATG GTTCCAAATA TTCGAATCCC GAAGTCGACC GGCTGCTCGA AACGGCCGCC
GTCGAGATCG ACCCGAAGAA GCGGTTGGAA CTCTTCAACG AGTTCCAAAA GCTGGTGGTG
GAGGACTTGC CGACGCTCGA TATCGTCACG CCGGCGGTGA TCACCGTCTA CGACAAGCGG
TTGAAGAACC TGAAACTTGG TGTCGAGCAT CTCTGGGGCA ACGGGGCGGA CATCTATCTC
GACGGACAAT CGTAA
 
Protein sequence
MTSRQTGTLK PTRRAFLLSS VALATAIAVP LLPVPSFAAA EEPARGGSVS INIGTEPPVL 
VLIAHSAGAA YYISGKATES LLTYDKEFNP QPLLATEWTV SEDGLRYWFK LRQGVRWHDG
KDFTAEDVAF SILALKENHP RGRATFAHVK EANVLNSHEV ELVLAKPAPY LLTAFASFEA
PIVPKHLYEG TQIAENPHNV APIGTGPYKF VEWVRGSHAL FVRNEDYWGS PKPYLDQIIF
RFIVDPAAAV AAIETGEVQV STANLPLTDI DRLKANPNLV VDTDPAPYSP SIARAEFNLE
NKYLADIKVR HAIAHAIDKD FIVNTVYLGY ATRLDGPVSP DLAKFYSPDL PKYEFDPAKS
EKLLDEAGYS RGADGVRFKL FIDPTQPSGP PKQTAEYIAQ ALAKVGIKVE LRTQDFATFV
KRVFTDRDFD IAIEGMSNLY DPTVGVQRLY WSKNFKPGVP FTNGSKYSNP EVDRLLETAA
VEIDPKKRLE LFNEFQKLVV EDLPTLDIVT PAVITVYDKR LKNLKLGVEH LWGNGADIYL
DGQS