Gene Smed_1974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1974 
Symbol 
ID5322833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2024258 
End bp2025802 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content66% 
IMG OID640790912 
ProductGntR family transcriptional regulator 
Protein accessionYP_001327643 
Protein GI150397176 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTTCC CCGCGGACCG GGCCGGCGCG CGTGCCTTTA CGGGGCGAAT ACCGAGGCGA 
CCGCTCTTGC GCCGGCTGCA GCCTCCATGC AACATCCATA CATTGAACAA TCGCAAGGTT
TCGATGAACG AGGATCGCGT TGAGGTTTCC TGGGTGCCGG CGCTCGGCAG AGCGAAGGGG
CCTCTTTATC TTGCCATTGC AGACGAGATC GCCGCCGACA TCGCCGCAGG CCGGCTGGCA
AACGGCATGC GCCTGCCGCC GCAACGGGTT CTGGCGGCGG CGCTCGGTAT CGACTTTACC
ACCGTCAGCC GCGCCTACAA CGAAGCGCGT CGGCGCGGCT TGGTCGAGGG CCGCGTGGGG
CAGGGTACCT ATGTCAGGAC CCGTGAGAGG AGTTCGGGCC GTCCGTCAGC GGATCGCCTC
GCGGCTGGTC TTGTCGACAT GAGCATGAAT CTGCCGCCGC TCTTCGACGA CCCGGCTCTG
TCGGCGAAAA TGTGGGCGGA TGTCGGGGCA CTCGGTCATG GCGGGCTTGA CCAGCGGAGC
CCTGAACTGT TGATGCGCTA TCAGCCCGTC GGCGGCACGG AGCGGGACAG GTCTGCAGGG
GCGGCCTGGC TCAAGCCGCG GCTCGGTGGA CTTCAGGCCG ATCGGATGGT CGTCTGCACG
GGCGCGCAAG GAGCATTGCT TGCGAGCGTC GGCATGCTGG CGACGAAGGG TGACAGGGTC
TGCGCCGAAG CGCTGGCCTA TCCGGGCCTC CGTTCGCTGG CCGCCTATCT GGGCATCGAA
CTCGTCAGTG TTGGGATCGA CCGCAACGGG ATCTTGCCCG AAGCCTTCGA GGAGGCGTGC
GTCCTCCATA GGCCGAAGGC TCTCTATTGT AATCCCACGC TTCACAATCC GACGACCGCC
ACGCTCCCGC TCGATCGCCG CGAAGCTATC GTGGAGATCG CACGCCGCCA CGGCGTAGCC
ATTATCGAGG ACGACGCCTA CGGGGCCTTG CCCGCAAGTC CCGTTCCGCC ATTGGCCGCG
CTCGCGCCTG ACCTCGTCTA TCATGTCGCA GGTCTTGCCA AGTGCCTCTC GCCGGCGCTT
CGCATCGCCT ATCTGGTCGC GCCTGACCGA TCGGCGGCCA TCCGCCTGGA AGGCGCGGTT
CGGGCGACGG CCGGGATGAC GTCGCCTCTT TCGGTCGCCA TCGCAACGCG TTGGCTGGAG
GAAGGAACGG CGCAGGCGGT CCTGGACGCA ATCCGCACCG AGGCCGGCGC ACGCCAGCAG
ATCGCGAGCA GGAGCCTCGA GGGCGCGGAC CTTTCGAGCG ATCGCGAGGG CTTCCATCTC
TGGCTGAAGC TTCCTTCCGG CTGGAATCGC GGTGAGTTTA CCGCACAGTT GCGAGCGGCC
GGGATCGGCG TCGTCGCAAG CGATGCCTTT GCGATATCGG ATCCACCCGA AGCTGTGCGG
CTCGGTCTCG GCGCCGCCCG AACGCGCGAT GACCTGCGGG AGAGTCTTGA CGTCATCGCC
GGCCTTCTTG CCCGCTCGCC GGCTGCACAC AATCTCATTG TCTGA
 
Protein sequence
MPFPADRAGA RAFTGRIPRR PLLRRLQPPC NIHTLNNRKV SMNEDRVEVS WVPALGRAKG 
PLYLAIADEI AADIAAGRLA NGMRLPPQRV LAAALGIDFT TVSRAYNEAR RRGLVEGRVG
QGTYVRTRER SSGRPSADRL AAGLVDMSMN LPPLFDDPAL SAKMWADVGA LGHGGLDQRS
PELLMRYQPV GGTERDRSAG AAWLKPRLGG LQADRMVVCT GAQGALLASV GMLATKGDRV
CAEALAYPGL RSLAAYLGIE LVSVGIDRNG ILPEAFEEAC VLHRPKALYC NPTLHNPTTA
TLPLDRREAI VEIARRHGVA IIEDDAYGAL PASPVPPLAA LAPDLVYHVA GLAKCLSPAL
RIAYLVAPDR SAAIRLEGAV RATAGMTSPL SVAIATRWLE EGTAQAVLDA IRTEAGARQQ
IASRSLEGAD LSSDREGFHL WLKLPSGWNR GEFTAQLRAA GIGVVASDAF AISDPPEAVR
LGLGAARTRD DLRESLDVIA GLLARSPAAH NLIV