Gene Smed_4858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4858 
Symbol 
ID5318843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1358503 
End bp1359978 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content63% 
IMG OID640776643 
ProductGntR family transcriptional regulator 
Protein accessionYP_001313575 
Protein GI150376979 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0612909 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGGCT CGACGGTATC CGAAACGATC TTCTTTCTGG ACCGAACGGG CCATGCGGGC 
TTGCAGGCGC AGATCCGGGA AACCATCGTG TCCGCGGTAT TGTCCGGCCG CCTGGCAGCG
GGTGCCCGGC TCCCTTCCAG CCGCAAGCTT GCGGCCTATC TGAACATTTC GCGGATCACC
GTGACGCTCG CCTATCAGGA GCTCGCCTCG CAAGGCTATA TCGAGGCGGA GAAGCGCAGC
GGCTACCGTG TCGCCGGCAA GCCACCGACG ACAGGCATTG CAATAGGGGC CGCGCACACT
GAAGCCGATA CGATCGACTG GTCGGCCAAG TTGTGCTCGA CCTTCATAGT GGCCAAGCAG
ATGCGCAAAC CCCTCGACTG GCGCGCCTAC CCCTACCCGT TCCTCTACGG CCAGATGGAT
CCATCGCTGT TCGACCTCAC CGCATGGCGC GACTGCGCGC GCCGGGCGCT CGGGCGAGAG
GATTTCGAAC TGATGGCGGG CGATTTCGCG GCATCGGATG ACGTCCAACT TGTCAACTAT
ATCTGCTCGC GTACATTGCC GGGGCGCGGC ATTCGTGCGA GCCCCGACGA GATCCTCGTG
ACGGTCGGCG CGCAAAACGC CCTGTGGATC GTCATCCAAC TGCTGTTGCG CCAGGGCTCA
CACGCGGTCT GCGAGAATCC ATGTCATCCC GACATGAGCG CCTCGCTCCG CTTGAGCGGC
GCCGAAGTCA CCGTCATCGA TGTCGACAAG GAAGGCCTGC CGCCCACCGC TCTTCCCGAT
AAGGTCGATG CGGTATTCGT TACGCCCAGC CATCATTCTC CTACCGGGGC GACGATGCCG
CTCGACCGGC GTGCTGCATT GCTCGAAGCC GCTGCCTCGA AGGACTTCAT AATCGTCGAA
GACGATTATG AGTTCGAGAT GAGCTTTCTG GCGCCCCCAT CCCCGGCTCT CAAGGCCTTC
GACCGCAGCG GACGCGTCTT CTATATCGGC AGTTTTTCCA AGTCGCTGTT TCCCGGTCTG
CGGCTCGGTT ATCTGGTGGC GCCGGCCGCC GTGATCCGGG AGGCGCGGGC GCTTCGCGCG
CTGATGCTAC GCCATCCGCC GGGGCACCTC CAGAGAACGG CCGCCTATTT CCTGGCACTC
GGCCACTATG ATGCCGTGCT GCACCGCATG CGCGAGGAAT ATCATCGCCG GCACATCATC
ATGGCGGCAG CACTTCGGGA TGCCGGCTTG CGGATCGCCG GGTCGGCGGC TTTCGGCGGC
ACCTCCTTCT GGATCGAAGG GCCGGATGAC CTCGATGCGG ATCTGCTGAT GAACGCATTG
CGGTTGGAAG GGGTGCTTAT AGAATCCGGA TCGCCGTTCT TCCCCAAGGA CGACGGTCCG
TGCCGCTTCT TCCGCATGGG TTACTCGTCG ATCGCCCGCA GCCGCATAGC CGACGGTGTC
GCCTTGACTG CGGCACGGAT CGCGGATCGA GCCTGA
 
Protein sequence
MVGSTVSETI FFLDRTGHAG LQAQIRETIV SAVLSGRLAA GARLPSSRKL AAYLNISRIT 
VTLAYQELAS QGYIEAEKRS GYRVAGKPPT TGIAIGAAHT EADTIDWSAK LCSTFIVAKQ
MRKPLDWRAY PYPFLYGQMD PSLFDLTAWR DCARRALGRE DFELMAGDFA ASDDVQLVNY
ICSRTLPGRG IRASPDEILV TVGAQNALWI VIQLLLRQGS HAVCENPCHP DMSASLRLSG
AEVTVIDVDK EGLPPTALPD KVDAVFVTPS HHSPTGATMP LDRRAALLEA AASKDFIIVE
DDYEFEMSFL APPSPALKAF DRSGRVFYIG SFSKSLFPGL RLGYLVAPAA VIREARALRA
LMLRHPPGHL QRTAAYFLAL GHYDAVLHRM REEYHRRHII MAAALRDAGL RIAGSAAFGG
TSFWIEGPDD LDADLLMNAL RLEGVLIESG SPFFPKDDGP CRFFRMGYSS IARSRIADGV
ALTAARIADR A