Gene Smed_4278 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4278 
Symbol 
ID5319040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp767954 
End bp771037 
Gene Length3084 bp 
Protein Length1027 aa 
Translation table11 
GC content59% 
IMG OID640776083 
Productputative PAS/PAC sensor protein 
Protein accessionYP_001313016 
Protein GI150376420 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG1352] Methylase of chemotaxis methyl-accepting proteins
[COG2201] Chemotaxis response regulator containing a CheY-like receiver domain and a methylesterase domain
[COG3920] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATACG AAGATAACAG ATTCCCGATT GTTGGCATCG GCGCGTCGGC CGGTGGTATT 
CCTGCAATGC AGGGGTTCTT CAAGGGTCTT CCGGTCAATT CCAACATGGC CTTCGTGATC
GTGACGCATC TCAGCCCCGA GCATGAGAGC CAACTTCACG AGGTGGTCGC GCGCTACACA
GACTTGCCGG TGTTGGTGGC GGAGGATGGG ATGGCGGCCC AGCCAAATCA TGTCTATGTC
ATGCCGCAGA ATGCCATCCT CTCGATCGCA GGCGGGCACT TGCGTCTGCG CAAGCCCGAC
GCCGCCACGC GGGAGCGCAA GCCGATCGAT ATCTTTCTGA GCACGCTGGC GGAAGATCAA
GGCGAGCAGG CGGTCGGTAT CATTCTTTCC GGCGGCGACA GCGATGGAAC GCTGGGGGCG
AAGGCGATCA AGGAGCATGG AGGGCTGACA CTCGCGCAGG CGAGCGACGG GTCAGGTCCG
CGTAACCCGG ACATGCCGCA AAGCGCAATC TCCAGCGGCG TGATCGACCT GGCGGTTACA
GCCGAAGAGA TGGGCGAGAA GCTTATCGCC TATGCCAACG GTTTCATGGC TGCCCAAGAC
GTTGCGACAG AGGAGACGCC CGACGCCCGG GAAGCTCGCT CGGAGATCTA TGGACTTCTC
CGCAGGCATT CGGGACATGA TTTCTCAGGG TATAAATCAA AAACGTTTCT ACGCCGCGTA
AGAAGACGCA TGCAGATTCT TCAGGTTCGA GCGCTGCCGG ATTACTTGAA GCTGCTCAGG
CGCGACCCCT CAGAGGTGAC GAACCTCTTC CGCGACCTAT TGATCAACGT CACCAATTTC
TTCCGCGACC AGGATGCGTT CAAGGCACTT GAGCAACTGG TGATACCGCG CCTGTTCGAA
GGCCGAAAAG CGGGCGAGCC CGTGCGTGTA TGGGTTCCAG GCTGCGCGAC GGGGGAAGAA
GTCTATTCCC TTGGCATCCT CATGCGCGAA CGCATGGAGA AGCTGGCGGA CATGCCGCGG
GTTCAGATTT TCGCCACAGA CATTGATGAA TCCGCTCTCG GTGTCGCCCG GGCCGGCCGT
TACCCTGAAG CGCTGCTGCA AGGCCTGTCA CCTGAGCGCA TAGCAAGACA CTTCAATTGC
GACGGCGCCA GCTTCGTGAT CAGCAGTGAT GTGCGCGAGC TCTGCATCTT CTCCCCGCAT
AGCGTTACGC GCGATCCGCC GTTCTCGCGC ATGGATCTGG TATCCTGCCG CAATCTGCTG
ATCTATCTGG ATCAAGAGAC GCAGAAACGC GTCATCTCCA CATTCCATTA TGCGCTGAAG
CCGGGTGGCT ATCTTTTTCT CGGGACGTCC GAGAGTATCG GTCAGCACGG GGAGCATTTC
TCGACCATTG ACAAGAAGAA TCGCATTTTC CAGGCGCGGG GGCATGATGC AGTGCCCCGT
GTTCCAGCCC TCGTTGGTCC TCCCAAAACG GTGAACACTG CCGATGGCCG TATCCACGGT
CTCAGGATCG GCACCTATCC GCTGCGGCAG GTGGTCGAAG CGCATGTTCT CGAGCGCTTT
GCCCCCGCGC ATGTCGTCGT CAACGCCGAC GGCGAGGCGG TCTACTACTC CGCTCGCACG
GGAAAATACC TCGAAGTTCC CCAGGGCGCC CCGAGCCGGC AGATTCTTAC GACTGCAAGA
CGGGGCCTGC GGCTCGACTT GCGGGCGGCG CTGCGCGAGG CCGCCTCGGC ACGAAAAGTT
GTGGTACGCG AAAATGCTTT GCTGGAGGAC GACGACGATC GGGTGCAACG GGTTAATCTA
ACCGTCGAGC CATTGGCGGA CGGCGCCGGT GGCGAGCCTC TTTATCTGGT CGTGTTCGAT
CCCATCGGGC CTCTACAAAG CCGGGCGGAG GCCGAGCATT CCGGATATGA CGCGGATACG
GCGGCTATCC TGGAGAGTGA GCTTCGCGAG ACGCGCGAGC GACTGCAATC CACCATCGAA
GAATACGAGA CGGCTCTTGA AGAGCTCAAG TCCTCTCACG AAGAGCTTGT TTCCGTCAAC
GAGGAGGCGC AGTCAACCAA TGAGGAGCTG GAAGCCTCCA AGGAGGAAAT GCAGTCGCTC
AACGAGGAAC TGAGCACGAT CAATGCGGAA CTGACCGCAA AGGTCGAGGA ACTGGACCGT
GCGAACAGCG ACCTGAAGAA CCTGTTCGAG AGCATGCAGA TCGCAACGGT GTTTCTCGAC
CGCAACCTCG TTATCCGCAA TTTCACACCG GCTGCCTCTT CGTTCTTCAA CATCCGGCCG
TCCGACGTGG GGCGGCCCCT TACCGAGCTT TCAAGCAAGC TCGACTATCC GGAACTCAAG
GAACAGATCG CTTCGGTCTT CAAGACCGGA GAAGCCGTGG AACACCATCT TGCCCGCGAT
CAGAACGGAA AGCACTTTCT TGTACGCCTG ATCCCCTATC GCGACGACGC AGCACGCGTC
GATGGTGTCG TGGTCACGCT TGTCGATGTG ACGAAGATGG CCGAGGCCGA GGCGCACCAG
CAGATGCTGG TCTCGGAACT CAATCACCGG GTCAAGAACA TGCTTGCAGT GGTGATCAGC
ATCGCCAATC ATAGCATGCA GACCGCCCGG TCACCGAACG AGTTCAATCA GGCGCTGATC
GGCCGTCTGC AGGCGATGGG GCGGGCTTAT GGACTGTTGA CCGAGACGCA CTGGACCGCT
GCTCCCGTCG ATCACCTCAT CAGGCAGGAA ATCGAAGCCT TCGGCACAGG TCGCTTCGAA
GTGAACGGGC CTGACATCCA CCTCGAACCT CAACAGGGGC TTTCTATAGG CATGGTCATA
CACGAACTGG CGACCAATGC GTCTAAATAC GGAGCCCTGA GCAAATCCGA GGGGAAGGTA
CTGGTTGGCT GGCAATCGGC GAATGGCGTC TTCCGACTCA CCTGGCAGGA AAGTGACGGT
CCGCCGGTGA GTGAACCGGA GCGGGAGGGC TTCGGCCTGT CTCTGCTCAA AGGAGAAATA
GGCTACAGGC TCGATGGCGA GGTGGAAACG TTCTTCCGTC CGGAAGGTCT GTTCGTGCGC
ATAGCTTTTC CGTTCGAAAG GTAG
 
Protein sequence
MTYEDNRFPI VGIGASAGGI PAMQGFFKGL PVNSNMAFVI VTHLSPEHES QLHEVVARYT 
DLPVLVAEDG MAAQPNHVYV MPQNAILSIA GGHLRLRKPD AATRERKPID IFLSTLAEDQ
GEQAVGIILS GGDSDGTLGA KAIKEHGGLT LAQASDGSGP RNPDMPQSAI SSGVIDLAVT
AEEMGEKLIA YANGFMAAQD VATEETPDAR EARSEIYGLL RRHSGHDFSG YKSKTFLRRV
RRRMQILQVR ALPDYLKLLR RDPSEVTNLF RDLLINVTNF FRDQDAFKAL EQLVIPRLFE
GRKAGEPVRV WVPGCATGEE VYSLGILMRE RMEKLADMPR VQIFATDIDE SALGVARAGR
YPEALLQGLS PERIARHFNC DGASFVISSD VRELCIFSPH SVTRDPPFSR MDLVSCRNLL
IYLDQETQKR VISTFHYALK PGGYLFLGTS ESIGQHGEHF STIDKKNRIF QARGHDAVPR
VPALVGPPKT VNTADGRIHG LRIGTYPLRQ VVEAHVLERF APAHVVVNAD GEAVYYSART
GKYLEVPQGA PSRQILTTAR RGLRLDLRAA LREAASARKV VVRENALLED DDDRVQRVNL
TVEPLADGAG GEPLYLVVFD PIGPLQSRAE AEHSGYDADT AAILESELRE TRERLQSTIE
EYETALEELK SSHEELVSVN EEAQSTNEEL EASKEEMQSL NEELSTINAE LTAKVEELDR
ANSDLKNLFE SMQIATVFLD RNLVIRNFTP AASSFFNIRP SDVGRPLTEL SSKLDYPELK
EQIASVFKTG EAVEHHLARD QNGKHFLVRL IPYRDDAARV DGVVVTLVDV TKMAEAEAHQ
QMLVSELNHR VKNMLAVVIS IANHSMQTAR SPNEFNQALI GRLQAMGRAY GLLTETHWTA
APVDHLIRQE IEAFGTGRFE VNGPDIHLEP QQGLSIGMVI HELATNASKY GALSKSEGKV
LVGWQSANGV FRLTWQESDG PPVSEPEREG FGLSLLKGEI GYRLDGEVET FFRPEGLFVR
IAFPFER