Gene Smed_1503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1503 
Symbol 
ID5322361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1583967 
End bp1585385 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content64% 
IMG OID640790450 
ProductGntR family transcriptional regulator 
Protein accessionYP_001327182 
Protein GI150396715 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0921845 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACTT GGCTGCCAGA CATCGAACAG GGCCACGGCC CGCTTTACGC GCGCATCGCC 
GACCAGATTG AAGAGGCGAT AGGCAACGGC ACCCTGCCCG TCGGCACGAA GTTGCCGCCC
CAACGCAATC TTGCATTCGA TGTCGGCGTG ACGATCGGCA CGATCGGACG CGCCTATGGG
ATTGTCCGCG AGCGAGGCCT GGTCAGCGGC GAGGTCGGGC GCGGTACCTA CGTTCTCGAC
CACCCGGAGA GCCGGCCGCC GGAACAGTCG GACCCGCTGA CGACGTCGCT GTCAGGGACG
CGCCCGCTCA TCGCCCCTGC CGGAAAGCTC CGCTTCGACA GCACGGCCGC GCCCGACATA
GGACAGGGTG ACATTCTGGC TAGATTGCTC AGCGAGATCA GCCGCGAGCA TCATCGGGAC
ATTGCGAGCT ATGCCCGCAA TTTTCCAGAG CATTGGTTCG AGGCAGGGTC TCAATGGCTC
GCACGGGAGA GCTTCCGCCC GGCGCCGGAA ACGGTGGTTC CGACGCTCGG CGCCCACGCC
GCAGTCGTCG CGGTAATCTC CGCCGTCACC TCGCCTGGCG ATCGCATTGC CTTCGAGACT
CTGACCTACT CCCAGATCAG CCGCAGCGCA GGCCTCATCG GCCGGCGGAT CGCACTGGTC
GAGAGCGACG AGTTCGGAAT GCGGCCGGAA GACTTCGAGC GCGTCTGCGC ACAACAGCAC
CCGAAACTCG CCTTTCTCAT GCCCGGCGCC CAGAATCCGA CCGTCGCCGT CATGCCCCTC
GACCGGCGCC GGGCGATTGC CGATATAGCG CGCAAGTACG GCGTCTGGCT GATCGAGGAC
AACCTCTACG GCTCGATGAT CGGAGACCCG CTTCCGCTGC TCGTGGAGCT TGCGCCCGAG
CGGACTTTTC TTGTCGGCGG GCTCTCGAAG TCCGTTGCAG CCGGCGTACG CGGCGGCTGG
GTCGCTTGCC CGCCGCATTT CAGTCAACGT ATTCGCGTGG CCCATAAGAT GGTGAGCGGC
GGCCTGCCTT TCATTCTCGC AGAACTATGC GCCCGCCTGG TCCTCTCGGG ATCCGCATCC
GTATTGCGTA ATCGCGGCGT GGAGGAAATC GGTGCGCGCG TAGCGTTGGC TCGCGAAATC
TTTTCCGGGT TCGAGTTCAA CTCGCATTCC AAGATCCCGT TTTTCTGGCT GAAACTGCCC
GAGCCGTGGC TTTCCGGAAC ATTCAAGCAG GCCGCTCTTC AGGAAGGCGT GCTCATCGAC
GACGAGGACG AGTTCAAGGC CGGACGTTCC GACCGGGTTT TCCATCGCAT CCGCGTCGGC
TTCTCCTCTC CCGTCGACCG ATCGGAGGTG AAGCGAGGCT TCGACGTTCT GCGGCGTCTG
CTCGACAGTG GACGCGTCGG ATACGACAGT TTCGATTGA
 
Protein sequence
MTTWLPDIEQ GHGPLYARIA DQIEEAIGNG TLPVGTKLPP QRNLAFDVGV TIGTIGRAYG 
IVRERGLVSG EVGRGTYVLD HPESRPPEQS DPLTTSLSGT RPLIAPAGKL RFDSTAAPDI
GQGDILARLL SEISREHHRD IASYARNFPE HWFEAGSQWL ARESFRPAPE TVVPTLGAHA
AVVAVISAVT SPGDRIAFET LTYSQISRSA GLIGRRIALV ESDEFGMRPE DFERVCAQQH
PKLAFLMPGA QNPTVAVMPL DRRRAIADIA RKYGVWLIED NLYGSMIGDP LPLLVELAPE
RTFLVGGLSK SVAAGVRGGW VACPPHFSQR IRVAHKMVSG GLPFILAELC ARLVLSGSAS
VLRNRGVEEI GARVALAREI FSGFEFNSHS KIPFFWLKLP EPWLSGTFKQ AALQEGVLID
DEDEFKAGRS DRVFHRIRVG FSSPVDRSEV KRGFDVLRRL LDSGRVGYDS FD