Gene Smed_4146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4146 
Symbol 
ID5319142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp619137 
End bp620054 
Gene Length918 bp 
Protein Length305 aa 
Translation table11 
GC content65% 
IMG OID640775951 
ProductPaaX family transcriptional regulator 
Protein accessionYP_001312884 
Protein GI150376288 
COG category[K] Transcription 
COG ID[COG3327] Phenylacetic acid-responsive transcriptional repressor 
TIGRFAM ID[TIGR02277] phenylacetic acid degradation operon negative regulatory protein PaaX 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.85173 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGGGC AGGGCGACAA CTCGGCAAAG GCGGGCTCAA AGGCCATCCG GGAAATTCTG 
GATGAAGTGC CCCTCCGAGC CGCGAGCTTC ATCGTCACGA TCTATGGGGA CGTGGTCGAG
CCACGCGGCG GCGCGATTTG GATCGGCAAC CTGATCGAGA TCTGCGCCGG TGTCGGAATC
AGCGAAACGC TTGTGAGAAC CGCCGTATCC CGCCTCGTTG CAGCCGGGCG GCTTGCCGGA
GAGCGGGAGG GACGGCGCAG CTTTTACCGC CTCACCGATG CCGCGCGTAC GGAGTTCGCC
GGCGCCGCGC GGGTTATTTT CGGGCCTCCG GAGGAGGCGA GCTGGCACTT CGTGCAGCTG
ATGGGTTCAT CGGCCGAGGA CCGGATGCTG ATGCTCGAAC GATCCGGTTA TGCGCGCCTG
AGCCCTCGGC TTGCAATCGG CGTGCGGCCT TTCCCGACCG CGATCATGCC CGCGCTGGTC
TTCAGAGCGG AGATTGCCGA AGGTGCGGAA GAGTTGAAGG CGTTCGCCTC GGCCTCGTGG
GATCTCGCAC CCCATGCAGA GGCGTACCGG CGATTTCTCG GCTCCTTTCA ATCGCTTGCA
GCACCTGCGG ATGCTGGGTC GATGACACCT GCGGATTGTC TTACCGCGCG CCTCCTGCTG
GTGCATCAGT TCCGCTTTGT GGCGCTGCGC GATCCGCGCC TGCCTGCCGA AACCCTCCCG
GAAGGCTGGC CGGGTGACGA GGCCCGCCGG CTGTTCGCCA GGCTCTACCT CAATCTTTCT
CATCCGGCTG ACCTGCATGC GGCGCAGCAC TTCGTTGCGG CTTCGGGGCC GCTGGCAGCC
TCCACGGGAG CGACCGAGGA ACGGTTTCGG ATGCTGCGGA GGGAAGGTGT TCCCGCGAGC
GGCTGTAAAT CCGTTTGA
 
Protein sequence
MQGQGDNSAK AGSKAIREIL DEVPLRAASF IVTIYGDVVE PRGGAIWIGN LIEICAGVGI 
SETLVRTAVS RLVAAGRLAG EREGRRSFYR LTDAARTEFA GAARVIFGPP EEASWHFVQL
MGSSAEDRML MLERSGYARL SPRLAIGVRP FPTAIMPALV FRAEIAEGAE ELKAFASASW
DLAPHAEAYR RFLGSFQSLA APADAGSMTP ADCLTARLLL VHQFRFVALR DPRLPAETLP
EGWPGDEARR LFARLYLNLS HPADLHAAQH FVAASGPLAA STGATEERFR MLRREGVPAS
GCKSV