Gene Smed_3043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3043 
Symbol 
ID5323921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3192219 
End bp3193196 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content61% 
IMG OID640791992 
ProductDeoR family transcriptional regulator 
Protein accessionYP_001328704 
Protein GI150398237 
COG category[K] Transcription 
COG ID[COG2390] Transcriptional regulator, contains sigma factor-related N-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGAAG ATGATGACGC GCTGATGGTG CGGGCGGCCT GGTTTTACTA TGTCGGCGGC 
CTCAACCAGG AAATGACCGC CGCCCGGCTG GGCCTCACCC GTGCCAGGGT CAATAAAATG
CTGGCCGAGG CGCGCGAGAG CGGGCTTGTC AGCATTTCGA TCGACCATCG GCGGGTCGGT
GTCCTGCCGC TCGAGGACAG GCTGCGCACG CGCTTCGGGC TTGATTTCTG CATCTCCACA
CCGGCCTTCG GGTTTCACGA TACGTCCAAG CAAGACAGCG AAGTGCGCAA GCAGATCGCC
TTCCGAGCCG TGGGCGTGGC TGCCGCCAAT CACTTGAAAA CATTGCTTTC CGAGAACGAT
TCCCTGACGG TCGGGACCGG TTGGGGCCGA ACGATCGAAC AGATGACCTT GCATCTGGCC
GGCGTTCGGG CGCCGCATGC GCGCTTCATC TCGATCATGG GGTCGTTGAC GGCAAACAAT
GCCTATAACC CGTTCGAAGT CGTGCACAGC CTCGCGAGAC GCACCGGCGG CGAAGGTTAT
TTTCTTCCGG TGCCCTTCAT CGCCGACTCG GTAGATGACA AAAAGGTCCT CATCTCACAA
CGTTCCGTGG TCAAGGCATT GGAAATCGCC CGCAGTGCTT CCGTCTGCTT CATCAGCGCG
GGTGAATTGA CGGAGGAATC GCTTTTACGG CGCCAGGGCA TGATCAGCGG CACCGAACTC
GAAAGCCTGC GGCAGGCCGG CGCCGTCGGC GACACCAACG GCATTTTCTT CGACAGCGAA
GGGAGGCAGG TCGACCATGA GCTCAACGAA CGGACCATCG CACTGGGTTT CGAAGAGCTG
AAGGCTCTGC CGGTGCTCCT GCTGATCGCC GGCCTGGAGA AAATCCAGGC GGCCCGGGCC
CTGCTTCGCA GCGGCGTCGT CAACGGTCTG ATCATAGACG GCGATGCGGC CGAGGCGTTG
GCAGCGTTGG GCGAATAG
 
Protein sequence
MSEDDDALMV RAAWFYYVGG LNQEMTAARL GLTRARVNKM LAEARESGLV SISIDHRRVG 
VLPLEDRLRT RFGLDFCIST PAFGFHDTSK QDSEVRKQIA FRAVGVAAAN HLKTLLSEND
SLTVGTGWGR TIEQMTLHLA GVRAPHARFI SIMGSLTANN AYNPFEVVHS LARRTGGEGY
FLPVPFIADS VDDKKVLISQ RSVVKALEIA RSASVCFISA GELTEESLLR RQGMISGTEL
ESLRQAGAVG DTNGIFFDSE GRQVDHELNE RTIALGFEEL KALPVLLLIA GLEKIQAARA
LLRSGVVNGL IIDGDAAEAL AALGE