Gene Smed_3606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3606 
Symbol 
ID5318440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp34753 
End bp35757 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content61% 
IMG OID640775420 
ProductDeoR family transcriptional regulator 
Protein accessionYP_001312353 
Protein GI150375757 
COG category[K] Transcription 
COG ID[COG2390] Transcriptional regulator, contains sigma factor-related N-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAACG CCAAACCGAA ATCCACATCC GGCCCGCGCG AGGAAATCGT CATCGCCAGG 
CAGATGCACC AAGCCCTGGT GCTGCATTTC CTGGAAGGCT TGACGCAGGC GCAGATCGCC
GATCAGCTCG GCATCTCACA TGCGACCGTC AATCGCCTGA TCAAGCGCGG CCGACAGCTC
GGCCTCGTCG AGATAAAGAT CAAATCGCCT GTGGAGCCGT TGATCGACAT CGAGGAAAGA
TTGCTTGCGC TTGGCGGCAT CAGCCGGGCG GTGGTCGTGC CGACAGCCTC CGACAATCCG
CAGACCGCCT TGCAAGCGGT CGGCGAGGCC GCAGCAAGAC TGATGCTCGA GGAGATCGCC
GATGGCGACA CGATCTGCAT CACCGGTGGC AAAGGCGTGA GCGCCGTCGT TGCCGGTCTC
CACCCGCCGC GCCGGTACGA TATCGAGGTC ATTCCCGCGA CAGGCTGCGT TCAGGGCAAG
CACTATACCG ACGTTAATCA CGTCTCAACC CTGATGGCGG ATCGGCTCGG CGGCCATTCT
TTCCAGATCC ATGCGCCTCT TTTTGCCGAC AGCGAAGCGG AACGAAGAAT GCTGCTGGGC
ATGCGCGCAG TCGCCGACGT CTTCAAGCAG GCGCGTGAAG CAAAGATTGC CGTGGTCGGC
ATCGGCTCGA TCCTTTCGGA CGACTCCAGC TATTACGACC TGCATCCCTC CTCCAGTACC
GACCGCGCGG CGATCGAACA GTCCGGTGCA TCCTGCGAGC TGCTCGCGCA TCTCCTCGAT
GATCAAGGGC GCGTCTGCGG CTATGGCCTC AACCAGCGCC TCGTATCGCT GACGCTCTCG
GAATTCGCTT CCATCCCCAT GAAGATTGGC GTCGCAAGCG GTCCGAGCAA GGCGGGGCCG
ATCCTGAGCG TCATGCGCGG CAAACATCTG GACACACTCG TTACCGATCA GGCAACGGGC
TCGCGCATAC TCGAACTGGC CAAGGAAGTC GGAGAACATT CATGA
 
Protein sequence
MPNAKPKSTS GPREEIVIAR QMHQALVLHF LEGLTQAQIA DQLGISHATV NRLIKRGRQL 
GLVEIKIKSP VEPLIDIEER LLALGGISRA VVVPTASDNP QTALQAVGEA AARLMLEEIA
DGDTICITGG KGVSAVVAGL HPPRRYDIEV IPATGCVQGK HYTDVNHVST LMADRLGGHS
FQIHAPLFAD SEAERRMLLG MRAVADVFKQ AREAKIAVVG IGSILSDDSS YYDLHPSSST
DRAAIEQSGA SCELLAHLLD DQGRVCGYGL NQRLVSLTLS EFASIPMKIG VASGPSKAGP
ILSVMRGKHL DTLVTDQATG SRILELAKEV GEHS