Gene Smed_4096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4096 
Symbol 
ID5318911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp558595 
End bp560019 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content63% 
IMG OID640775903 
ProductGntR family transcriptional regulator 
Protein accessionYP_001312836 
Protein GI150376240 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGAGAG AGCCGGTACA GATGCAGCGG AATGCGGCGA GTTCGCTGGG CGAGAGCCTC 
ATTGCCAAGG TCATGGGTAC GGTGCAGCAG CGGATTGCGG CACGGAGTTT GACGCCGGGT
TCCCGGCTCC CTTCCATTCG CTCTTTCGCC TTGTCGATGA AAGTCTCAAA GTCCACCGTC
GTCGAGGCCT ATGAACGCCT TCAGGCCGAA GGCCTGATTC GTTCGCGGCC GGGCTCCGGT
TTTTACGTAG CGGCGCCGTT AGCGCCGCTG ACGCTCGCCG AGATCGGTCC GCCTGTCGAC
CGGGCCGTCG ATCCCTTATG GATCTCGCGC CAGGCGCTGG AGCCGGGGGA GGGCGTGCTC
CGGCCCGGCT GCGGTTGGCT GTCGCCGTCC TGGATGCCGG AGGAAGGATT GCGACGCGCC
TTGCGCGGCA TTGCCCGCGG GAGCAGCGCG ACGCTCGTTG ATTATGGTGC TCCGCTTGGA
TTGCAGCCCT TGCGCCAGCT TCTGGCACGG CGCGTGGCGC AGCACGGCAT CGAGGCATCG
CCGGACCAGA TATTGCTCAC CGAATCCGGC ACGCAGGCAA TCGATTTGCT TTGCCGCTTC
CTGCTGAAGC CCGGAGATGC GGTGCTCGTA GACGACCCCT GCTATTTCAA TTTTCACGCA
CTCCTCAGGG CGCATCAGGC AAGAATCGTC GGCGTTCCAT ACACGCCCAG CGGACCGGAT
ATAGGCCGCT TCGCCGAAGC GCTCACTGAG CACAAGCCGC GACTTTACAT TACCAATTCA
GCCCTTCACA ATCCGACCGG CGCAACCCTG TCGCCGCTCG TCGGCCATCG CCTGCTCAAG
CTCGCCGAAC AATCCGGGTT GACGATCATC GAGGATGATA TCTTCGCCGA TTTTGAAGAA
GCGCCCGCGC CGCGTCTTGC GGCCTTCGAC GGCCTCGAAC GGGTGGTGCA CATCGGCAGC
TTCTCGAAAA CGCTCTCGGC GGCCGTTCGG TGCGGTTTCA TCGTGGCGCC TCGTGATTGG
GTGGAGGCCT TGACCGACCT GAAGATCGCG ACGTGCTTCG GTGCAGCCGG GTTCTCTTCG
GAATTGGTGA TGGCGCTCTT GAAGGACGGC AGCTATCGCA AACATCTTGA GGCGGTGCGC
CAGCGCCTCG CTAAGGCGAT GGCCGACGTT GCGGAAAGGC TCGCCCGGAT CGGCATCACG
CCCTGGATAG AACCGCAGGC CGGCATGTTC CTCTGGTGCC GCCTGCCGGA TGGCATAGAC
GCCGCCAAAC TCGCAAGGGA ATCTCTCGGC AAAGGCATCG TGCTTGCACC AGGCAATGTC
TTCAGTCACG CACAAACGGC TGCCGGCTTT TTGCGCTTCA ACGTCGCCCA GTCGCAAGAC
GATCGGCTCT TCCGGGAGCT CGAAGCGCTG ATGGCCCTTG CATGA
 
Protein sequence
MEREPVQMQR NAASSLGESL IAKVMGTVQQ RIAARSLTPG SRLPSIRSFA LSMKVSKSTV 
VEAYERLQAE GLIRSRPGSG FYVAAPLAPL TLAEIGPPVD RAVDPLWISR QALEPGEGVL
RPGCGWLSPS WMPEEGLRRA LRGIARGSSA TLVDYGAPLG LQPLRQLLAR RVAQHGIEAS
PDQILLTESG TQAIDLLCRF LLKPGDAVLV DDPCYFNFHA LLRAHQARIV GVPYTPSGPD
IGRFAEALTE HKPRLYITNS ALHNPTGATL SPLVGHRLLK LAEQSGLTII EDDIFADFEE
APAPRLAAFD GLERVVHIGS FSKTLSAAVR CGFIVAPRDW VEALTDLKIA TCFGAAGFSS
ELVMALLKDG SYRKHLEAVR QRLAKAMADV AERLARIGIT PWIEPQAGMF LWCRLPDGID
AAKLARESLG KGIVLAPGNV FSHAQTAAGF LRFNVAQSQD DRLFRELEAL MALA