Gene Smed_3680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3680 
Symbol 
ID5318787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp118089 
End bp120449 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content59% 
IMG OID640775493 
Productstage II sporulation E family protein 
Protein accessionYP_001312426 
Protein GI150375830 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0105855 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTGA TGGGCGAAAA CACCGCTCGT TTGGGCTTGC GTTCGTTCCG CGCCAAGTTC 
GTTCTGGTCG TCGGAGGCGC CGTGCTGTTC GATCTGCTCG TCACCGGCGG TCTTGCGCTC
TGGAATGTAC AAAGGCTCTC GCGCGATGCA GCAGCGGAAG TCGGGCGGGG CCTGGAGGAG
GCGAACCAGG ATTATATCCG CTCCTATGCG GAAACGACGG CCGCCCGCGT CAATCTTCTG
CTCAACCAGG TTCATTCGGA TGTAGGCACA CTGGCGGGGG TTCTCCAGGA GCAAATCGAC
CGGCCGGTGC GAAAATCGCA AATAGGGGAA GCAATGGCGC GTGGTGCGCC GGGAACCGTG
GCGGTTGCCT ACGACGAGAA AGGCGGGTGG GCTCAAAATC TGCCAGGTCC GCCGTCGGTC
GTCAGCGTCT GGGGGTATCT TCTCGATCAA AATCACTTTC CGCTTCCCGA TGTGCAGACC
GACATCGAGA ACAGCGCGGT CCTGGACCTC GTCGCGCCGA GCCTTCTGGC GAATGGCCAG
TCGAAGCTGC AGATGTACTA CATCGGCCCA AAGGAGCGGC CGATCTTCAG GACGGCACCC
TATACGGATC AGGCTCAGAC TTTCGACAAA CTGTATCCGG GACACAATGC GGCGGAGTTC
TGGTCTTTTT TCTTTCCGGG TCTTTACGAA TCCTGGGAGC AATGGGCGCG TGAGCCGGAG
TCACGTCCTG TGAATGACGA CATTACCCAG ACGGCGCCCT ATACCGACGC GATAACCGGC
AAGCTCATCG TCAGTTTCTT CCAGCCGCTA TGGTCGCGCG ACCGCAGCCG GGTGCTAGGC
GCTGCCGGCA CCGATATCAC GCTCGACCAA CTGGCCGAGA TCGTCGAGAA CGTCAAAGTG
GCGGAAACCG GTTTCGGTTT CCTGACGATG TCGGACGGCA ACGTTGTTGC CATTAATCCG
GTCGGAGAAA AGGTTATTGG CCTGCGGGCG TCGAACGCTG CCGCCAGCGG CGGCGTGACA
GGACTGGATC GCTCCTTGCG ACGGAGCATT CAGCCGGCGA TTGCGCAGTT GCCCCTGGAA
GAGGACGGAC TGCTCAAGCA CATACTGCTC GAGGAGAACG GCGAGAGGGT GCCGTACCTG
GTCGTCCTAA AACACTTGCA GCCAACCAAC CTCTGGGCGT CGGGCCCGGT CAGCCGCGAA
GCCATGTTGC TCGGAATCGC CGTGCCGGAG CGGGAAATCT ACGCTTCGCT TTTCGCCGCC
CAGGCCGGGA TCTCGAAGGC GACGAACAGA ATCCTGATCT ATCAGGTCCT CGCGCTTCTC
GTATCCCTCT TCTTCGTCAT CGCCGCCGTA TTCGCAATTT CCAAGCGCAT CACCGGCGGC
ATAAGCGCGC TCGCCAATGC CGCCAAGCGC ATTGAGGCCA AGGACTATTC AGTAAGGGTC
GACATTCCGA CGCGCGACGA GGTCGGCGAG GCCGGCGCAG CCTTCAATCG TATGGCCGAA
GAGATCAGCT TTCACACAGA GAACCTCGAG CAGCTGGTCG AGGACAGGAC GAAGGAGATC
GAAGAGGCGA ACCTCCAGAT CTCAGCTCTG AACCAGCAGT TGCGCAGCGA AAACTTGCGC
CTGGGCGCGG AACTCGCAGT CGCCCGACAT ATCCAGATGA TGGTTCTGCC TAAGGCCGGC
GAACTCGAAG AAATCGCCGA GCTCGAAATC GCCGCTTATA TGCGGCCCGC CGACGAGGTT
GGAGGCGACT ACTACGACGT TCTCAGGGAC GGAAACCGAT TGAAGATTGG AATTGGCGAT
GTGACCGGGC ACGGTCTCGA GAGCGGCGTG CTGATGCTGA TGGTTCAATC CGTGGCCCGT
GGCTTGCAGG AAGTGGGCGA GATGGAGCCG GCGCAATTCC TCAACCGCGT CAACCGGGCA
ATCTACAAGA ATCTGGTCAG GACCAGCACC AACAAGCACC TCTCCCTTGC ATTTCTCGAC
TATGACGGCG CAAGGCTGAT CCTTTCTGGC CAGCACGAGG AGTTGATCGT CATCAGAGAT
ATCGAAAAGG TGGAGCGGAT CGACACCCTC GATCTCGGAC TTCCCATCGG CTTGGAGCCT
GATATCTCTC CATTCGTCGC AACGCGCGAA ATCTCCTTCG GCAGCGGCGA TATGATCGTC
CTCCATACCG ATGGCGTGAC GGAGGCGGAG AGCGGAAGCG GTGAGCTCTT CGGCATCGAA
AGGCTCTGCG AAAGCGCGCG CTGTCGCTAT GGCAGCAGCG CTGAGGAGGT CAAGTCCGGC
ATTATAGAGG ATTTAATGAC ACATATAGGC ACCCAGAAAA TTCACGACGA TATCACTCTG
GTGGTGATGA GACACAGGTG A
 
Protein sequence
MSVMGENTAR LGLRSFRAKF VLVVGGAVLF DLLVTGGLAL WNVQRLSRDA AAEVGRGLEE 
ANQDYIRSYA ETTAARVNLL LNQVHSDVGT LAGVLQEQID RPVRKSQIGE AMARGAPGTV
AVAYDEKGGW AQNLPGPPSV VSVWGYLLDQ NHFPLPDVQT DIENSAVLDL VAPSLLANGQ
SKLQMYYIGP KERPIFRTAP YTDQAQTFDK LYPGHNAAEF WSFFFPGLYE SWEQWAREPE
SRPVNDDITQ TAPYTDAITG KLIVSFFQPL WSRDRSRVLG AAGTDITLDQ LAEIVENVKV
AETGFGFLTM SDGNVVAINP VGEKVIGLRA SNAAASGGVT GLDRSLRRSI QPAIAQLPLE
EDGLLKHILL EENGERVPYL VVLKHLQPTN LWASGPVSRE AMLLGIAVPE REIYASLFAA
QAGISKATNR ILIYQVLALL VSLFFVIAAV FAISKRITGG ISALANAAKR IEAKDYSVRV
DIPTRDEVGE AGAAFNRMAE EISFHTENLE QLVEDRTKEI EEANLQISAL NQQLRSENLR
LGAELAVARH IQMMVLPKAG ELEEIAELEI AAYMRPADEV GGDYYDVLRD GNRLKIGIGD
VTGHGLESGV LMLMVQSVAR GLQEVGEMEP AQFLNRVNRA IYKNLVRTST NKHLSLAFLD
YDGARLILSG QHEELIVIRD IEKVERIDTL DLGLPIGLEP DISPFVATRE ISFGSGDMIV
LHTDGVTEAE SGSGELFGIE RLCESARCRY GSSAEEVKSG IIEDLMTHIG TQKIHDDITL
VVMRHR