Gene Smed_0102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0102 
Symbol 
ID5320930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp113078 
End bp114103 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content63% 
IMG OID640789034 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_001325797 
Protein GI150395330 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0193785 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGA CTGTTCTTTC TGCCGCATTC GGCGCGCTTG CACTCGGCGT GGCCTTCGCA 
TCGCCTTCGC AGGCGGCCGA CGTGTCGGCC TGCCTCATCA CCAAAACCGA CGCCAATCCT
TTCTTTGCGA AAATGAAGGA AGGCGCGACC GCCAAGGCCA AGGAACTGGG CGTGGCCCTG
AAGTCCTATG CCGGTAAGAT CGATGGAGAT TCCGAGAGCC AGGTTGCCGC GATCGAGACA
TGCATCGCCG ACGGTGCGAA AGGTATTCTG ATCGCCGCCT CCGACACCCA GGGCATCGTG
CCTCACGTCA AGAAGGCGCG GGACGCCGGT CTCCTGGTCA TCGCACTCGA TACGCCGCTC
GAGCCGCTCG ACTCCGCCGA CGCGACCTTT GCAACGGACA ACCTGCTCGC CGGCAAGCTG
ATCGGGCAAT GGGCTGCCGC AACGCTCGGC GACGCCGCCA AGGACGCCAA GGTGGCATTC
CTCGACCTTA CGCCGTCTCA GCCTTCCGTC GACGTGCTGC GCGACCAGGG CTTCATGATC
GGCTTCGGCA TCGACCCCAA GGACCCGAAC AAGATCGGCG ACGAGGATGA TCCGCGCATC
GTCGGCCATG ACATCACCAA CGGCAACGAA GAAGGCGGCC GGTCTGCAAT GGAGAACCTC
CTCCAGAAAG ATCCGACCAT CAATGTCGTC CACACGATCA ACGAACCGGC GGCCGCCGGC
GCCTACGAGG CGCTGAAGGC TCTCGGCCGC GAGCAGGACG TGCTGATCGT TTCCGTCGAT
GGCGGTTGCC CGGGGGTCAA GAACGTCGCC GAGGGTGCAA TCGGAGCGAC GTCGCAGCAA
TACCCGCTGA TGATGGCGGC GCTCGGCATC GAGGCAATCA AGAAGTTCGC TGACACCGGC
GAAAAGCCGG TGCCGACAGA GGGCAAGGAT TTCGTGGACA CGGGAGTCTC GCTCGTCACC
GACAAGCCGG TTGATGGTCT GGAATCGATC GACACCAAGA CCGGCCTGGA GAAGTGCTGG
GGCTGA
 
Protein sequence
MKKTVLSAAF GALALGVAFA SPSQAADVSA CLITKTDANP FFAKMKEGAT AKAKELGVAL 
KSYAGKIDGD SESQVAAIET CIADGAKGIL IAASDTQGIV PHVKKARDAG LLVIALDTPL
EPLDSADATF ATDNLLAGKL IGQWAAATLG DAAKDAKVAF LDLTPSQPSV DVLRDQGFMI
GFGIDPKDPN KIGDEDDPRI VGHDITNGNE EGGRSAMENL LQKDPTINVV HTINEPAAAG
AYEALKALGR EQDVLIVSVD GGCPGVKNVA EGAIGATSQQ YPLMMAALGI EAIKKFADTG
EKPVPTEGKD FVDTGVSLVT DKPVDGLESI DTKTGLEKCW G