Gene Smed_4908 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4908 
Symbol 
ID5317885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1417869 
End bp1418831 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content60% 
IMG OID640776692 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_001313624 
Protein GI150377028 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.852955 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCA TGAAGGCACT TGCGAGTGCA ACGATCCTTG CTGCCTGCAC TTTCGGCAGC 
GCTTCGGCCG CGGATCTCGT CGTCGGCTTT TCTCAGATCG GATCGGAGTC CGGCTGGCGC
GCAGCCGAGA CGACGCTGAC GAAACAGCAG GCAGAAGAGC GCGGCATCGA CCTCAAATTT
GCCGATGCGC AGCAGAAACA GGAAAACCAG ATCAAGGCAA TCCGTTCCTT TATCGCCCAG
GGCGTGAACG CGATTCTTCT GGCCCCGGTC GTGGCGACCG GCTGGGATGA AGTGCTGGAA
GAGGCGAAGG ATGCGGAAAT CCCGGTCATA CTGCTCGACC GAACCGTCGA CGCTTCAAAG
GATCTTTATC TGACTGCAGT CACGTCCGAT CTCGTTCACG AAGGCAGCGT GGCCGGCAAA
TGGCTTGTCG ACACCGTTGC GGGCAAGCCG TGCAACGTCG TCGAACTCCA GGGCACCACC
GGCTCCTCGC CGGCCATCGA CCGCAAGAAG GGCTTTGAGC AGGCGCTCTC CGGCAACGAC
AATCTGAAGA TCGTGCGTAG CCAGACAGGC GATTTCACCC GCACGAAGGG CAAGGAAGTG
ATGGAAAGCT TCCTCAAGGC CGAGGACGGC GGCAAGAACA TCTGTGCGCT CTACGCCCAT
AACGACGATA TGGCGGTGGG CGCGATCCAG GCGATCAAGG AAGCCGGCCT GAAGCCCGGC
AAGGACATCC TCGTCGTCTC AATCGACGCT GTGCCCGACA TCTTCCAGGC TATGGCCGCC
GGAGAAGCAA ATGCGACGGT CGAGCTCACG CCAAACATGG CAGGCCCTGC CTTCGATGCA
CTTGCAGCCT ACCTCAAGGA CGGCAAAGAG CCTCCGAAGT GGATCCAGAC GGAATCGAAG
CTCTACACCC AGGCCGACGA TCCGATGAAG GTCTACGAAG AAAAGAAGGG TCTCGGTTAC
TGA
 
Protein sequence
MKLMKALASA TILAACTFGS ASAADLVVGF SQIGSESGWR AAETTLTKQQ AEERGIDLKF 
ADAQQKQENQ IKAIRSFIAQ GVNAILLAPV VATGWDEVLE EAKDAEIPVI LLDRTVDASK
DLYLTAVTSD LVHEGSVAGK WLVDTVAGKP CNVVELQGTT GSSPAIDRKK GFEQALSGND
NLKIVRSQTG DFTRTKGKEV MESFLKAEDG GKNICALYAH NDDMAVGAIQ AIKEAGLKPG
KDILVVSIDA VPDIFQAMAA GEANATVELT PNMAGPAFDA LAAYLKDGKE PPKWIQTESK
LYTQADDPMK VYEEKKGLGY