Gene Smed_5883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5883 
Symbol 
ID5320185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp846809 
End bp847879 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content66% 
IMG OID640777578 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_001314510 
Protein GI150377915 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.92391 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCAGA ACCCGCCCCG ATCCACATCC GTCACCGTTG CCGACGTCGC ACGCAAGGCA 
GGCGTCTCCA AGGCGACGGC AGCGCGCGTG CTGGGCGGCT ACGGAACGGT GAGCGACCCG
GTGCGCGACG CCGTGACAGC CGCTGCCCGG GCACTCGATT ACCGTCCGAA CGAGCTCGCC
CGCAGCATGA CGACGGGCCG CTCCGGCACG ATCGGCGTCG TGGTGGGCGA CATAGAGAAC
CCATTTTTCA GTCTTGCCAT GCGGGGCATC ACCGACGTGG CCCGCCAGGC CGGCTTTACG
GTCATCCTCA TAAATTCCGG CGAGGACGTG GCTGTCGAGA AGGCGGCCAT TCGCACCCTG
CTGGCCAAGC GCGTGGACGG CTTGATCGTC TCGCCGGCCA AGGAAAGCAA TGTCGATCAC
CTTCAGGAAG CCGCCCGCTC GGGCCGGCCG CTGGCGCTGC TCGACCGCGG CAGTGAGACG
CTCGACGTTG ACACCGTTAT TGCCGATGAC AGACACGCCG CCGAAGGCAT CACGCGACGG
CTCATTGCGC TCGGCCATCG CCGCATCGCC TATATCACTG CGTGCGACAC ACCGGATCAT
GTTTTCCGCG TGCCCTCAGA CGTAAATACG GGCTCGGTGC GCCGGCGCGT CGAAGGTTTT
CTTGGCGTCT GCCGGGAGGC TGGCCTTCAG GGAATGGAAG GCTGGGTGCG TGTGGGCGCG
ATCACGCCGG ACCATACGCG GGGCATCGTC TCGGCGATGT TGCAGTCGAG CGAGCGCCCG
ACCGCGATCA TCGCCTCCGA CAGCGTGATC GGCCTCGAAG TTTTCAAGAC CAGCCGCGCA
GCCGGCATTG CTATCCCGGA CGAGCTGTCG CTCGTATCGT TCCATGACGC CGATTGGACC
TCGGTCACCT CGCCCCCTGT GACGGTGGTG AGGCAACCCG TCTATCGCCT GGGCGAAACA
GCCGCGAAAC TGCTGGTCGA GCGGCTGAAC GGATATGAAG CAAGTGCCCG CCGAGTCGTG
CTGCAAACCG AACTCATCGA ACGGGCTTCC GTCGCCGACG CGCCGGCATG A
 
Protein sequence
MDQNPPRSTS VTVADVARKA GVSKATAARV LGGYGTVSDP VRDAVTAAAR ALDYRPNELA 
RSMTTGRSGT IGVVVGDIEN PFFSLAMRGI TDVARQAGFT VILINSGEDV AVEKAAIRTL
LAKRVDGLIV SPAKESNVDH LQEAARSGRP LALLDRGSET LDVDTVIADD RHAAEGITRR
LIALGHRRIA YITACDTPDH VFRVPSDVNT GSVRRRVEGF LGVCREAGLQ GMEGWVRVGA
ITPDHTRGIV SAMLQSSERP TAIIASDSVI GLEVFKTSRA AGIAIPDELS LVSFHDADWT
SVTSPPVTVV RQPVYRLGET AAKLLVERLN GYEASARRVV LQTELIERAS VADAPA