Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4858 |
Symbol | |
ID | 5318843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 1358503 |
End bp | 1359978 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640776643 |
Product | GntR family transcriptional regulator |
Protein accession | YP_001313575 |
Protein GI | 150376979 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0612909 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGGCT CGACGGTATC CGAAACGATC TTCTTTCTGG ACCGAACGGG CCATGCGGGC TTGCAGGCGC AGATCCGGGA AACCATCGTG TCCGCGGTAT TGTCCGGCCG CCTGGCAGCG GGTGCCCGGC TCCCTTCCAG CCGCAAGCTT GCGGCCTATC TGAACATTTC GCGGATCACC GTGACGCTCG CCTATCAGGA GCTCGCCTCG CAAGGCTATA TCGAGGCGGA GAAGCGCAGC GGCTACCGTG TCGCCGGCAA GCCACCGACG ACAGGCATTG CAATAGGGGC CGCGCACACT GAAGCCGATA CGATCGACTG GTCGGCCAAG TTGTGCTCGA CCTTCATAGT GGCCAAGCAG ATGCGCAAAC CCCTCGACTG GCGCGCCTAC CCCTACCCGT TCCTCTACGG CCAGATGGAT CCATCGCTGT TCGACCTCAC CGCATGGCGC GACTGCGCGC GCCGGGCGCT CGGGCGAGAG GATTTCGAAC TGATGGCGGG CGATTTCGCG GCATCGGATG ACGTCCAACT TGTCAACTAT ATCTGCTCGC GTACATTGCC GGGGCGCGGC ATTCGTGCGA GCCCCGACGA GATCCTCGTG ACGGTCGGCG CGCAAAACGC CCTGTGGATC GTCATCCAAC TGCTGTTGCG CCAGGGCTCA CACGCGGTCT GCGAGAATCC ATGTCATCCC GACATGAGCG CCTCGCTCCG CTTGAGCGGC GCCGAAGTCA CCGTCATCGA TGTCGACAAG GAAGGCCTGC CGCCCACCGC TCTTCCCGAT AAGGTCGATG CGGTATTCGT TACGCCCAGC CATCATTCTC CTACCGGGGC GACGATGCCG CTCGACCGGC GTGCTGCATT GCTCGAAGCC GCTGCCTCGA AGGACTTCAT AATCGTCGAA GACGATTATG AGTTCGAGAT GAGCTTTCTG GCGCCCCCAT CCCCGGCTCT CAAGGCCTTC GACCGCAGCG GACGCGTCTT CTATATCGGC AGTTTTTCCA AGTCGCTGTT TCCCGGTCTG CGGCTCGGTT ATCTGGTGGC GCCGGCCGCC GTGATCCGGG AGGCGCGGGC GCTTCGCGCG CTGATGCTAC GCCATCCGCC GGGGCACCTC CAGAGAACGG CCGCCTATTT CCTGGCACTC GGCCACTATG ATGCCGTGCT GCACCGCATG CGCGAGGAAT ATCATCGCCG GCACATCATC ATGGCGGCAG CACTTCGGGA TGCCGGCTTG CGGATCGCCG GGTCGGCGGC TTTCGGCGGC ACCTCCTTCT GGATCGAAGG GCCGGATGAC CTCGATGCGG ATCTGCTGAT GAACGCATTG CGGTTGGAAG GGGTGCTTAT AGAATCCGGA TCGCCGTTCT TCCCCAAGGA CGACGGTCCG TGCCGCTTCT TCCGCATGGG TTACTCGTCG ATCGCCCGCA GCCGCATAGC CGACGGTGTC GCCTTGACTG CGGCACGGAT CGCGGATCGA GCCTGA
|
Protein sequence | MVGSTVSETI FFLDRTGHAG LQAQIRETIV SAVLSGRLAA GARLPSSRKL AAYLNISRIT VTLAYQELAS QGYIEAEKRS GYRVAGKPPT TGIAIGAAHT EADTIDWSAK LCSTFIVAKQ MRKPLDWRAY PYPFLYGQMD PSLFDLTAWR DCARRALGRE DFELMAGDFA ASDDVQLVNY ICSRTLPGRG IRASPDEILV TVGAQNALWI VIQLLLRQGS HAVCENPCHP DMSASLRLSG AEVTVIDVDK EGLPPTALPD KVDAVFVTPS HHSPTGATMP LDRRAALLEA AASKDFIIVE DDYEFEMSFL APPSPALKAF DRSGRVFYIG SFSKSLFPGL RLGYLVAPAA VIREARALRA LMLRHPPGHL QRTAAYFLAL GHYDAVLHRM REEYHRRHII MAAALRDAGL RIAGSAAFGG TSFWIEGPDD LDADLLMNAL RLEGVLIESG SPFFPKDDGP CRFFRMGYSS IARSRIADGV ALTAARIADR A
|
| |