Gene Smed_4180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4180 
Symbol 
ID5319346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp657612 
End bp660992 
Gene Length3381 bp 
Protein Length1126 aa 
Translation table11 
GC content63% 
IMG OID640775985 
Productintegral membrane sensor hybrid histidine kinase 
Protein accessionYP_001312918 
Protein GI150376322 
COG category[F] Nucleotide transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG1457] Purine-cytosine permease and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0796198 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCAC GCCAGCGCAT CATTCCCGTT CGAAGAGAAT ATAATCGCTG GGTCGCCGAC 
CAGACGCTCG AAGACTATGC GCTGCGCTTT ACCGCAAAGA GCGCGCGGCG CTTCTCCTCC
GCGCGCATTT CACAGACGGC GATCGGCGCG ATCTCATTCC TGGCGCTGGA AGCTATCGGC
GGCGCCATCA CCATGTCCTA CGGCACCACC AACGCGATCG TCGCGATCCT CGTCGCAAGC
CTGATGATCC TTATCGTCGG CCTGCCGATC AGCCGCTATG CCATCCGCCA TGGCGTCGAC
ATAGACCTTC TGACGCGCGG CGCGAGCTTC GGCTATATCG GCTCGACGAT AACCTCGCTC
ATCTATGCGA GCTTCACCTT CATCCTCTTC GCAATAGAAG CCTCGATCAT GTCCGGCGCG
CTGGAGCTGG CGCTCGGGGT CCCGCTCTGG GCCGGCTACA TCGTCAGCGC CGTCGTGGTG
ATCCCGCTGG TCACCCATGG CGTCCGACTG ATCAGCCGGT TTCAGCTCGT CACGCAGCCC
TTCTGGATCG TGTTGAACAT CCTTCCCTTC GCCTTCATCG CGCTCGCGGA CTGGGAGAAG
GTGGGCCTGT GGCTCGCCTA TGCGGGCATT CATCACACTT CGGGTCCCTC CGGCACCGTC
GCGCCCTTCG ATCTCATCGA GTTCGGCGCG GCTTCCGCGG TCATCTTCGC GCTGATGGCG
CAGATCGGCG AACAGGTGGA TTTCCTGCGC TTTCTGCCGC CCGACGGTCA GCGGAAGCTC
CGCCACCGCA TAGCGGTCTT TCTCGCCGGC TCCGGCTGGG TCATCGTCGG CGCGCCGAAG
CTGCTTGCCG GATCCTTCCT GGTCGTCCTG GCGCTCAGCG CCGGCGTGCC TTCCACCAGA
GCCGCAGATC CGGCGCAAAT GTACTACACG GCCTTCGGCT ATATCGTCCC ATACGAGACG
GCAGCCCTCC TGCTGATGGC AGCCTTCGTC GTCGTTTCAC AGCTCAAGAT CAACGTGATG
AACGCCTATG CCGGGTCGCT CGCCTGGTCG AACTTCTTTT CGCGCCTCAC CCACAGCCAT
CCGGGCCGGG TGATCTGGCT GGTCTTCAAC GTGGCGATCG CACTCCTCCT GATGGAACTT
GGCATCTATC GGCTTCTCGA GGAGACGCTC GGCATCTTTT CGATCATTGC CATGGCCTGG
CTCTGCACGA TCTCGGCCGA TTTGTTCGTC AACAAGCCGC TCGGCCTGGC GCCGGCCGGC
ATCGAGTTCA AACGCGCCCA TCTCTATGAC ATCAATCCCG TCGGGCTCGG CACCATGGGC
GTGTCGACGC TGTTCGCTCT GGTCGCGCAT TTCGGCGCGA TGGGCGAGAT CGCAAGCTCG
CTCGCTCCCT TTATCGCGCT CGCCACGGCC TTCCTCGTCT CGCCGCTGAT CGCCTGGTGG
ACGGAGGGGA AATTCTACCT TGCCCGCAAG CCGCGCAAGA GCTGGTTCTC CGAGAGCGAG
ATCACCTGCT CAATCTGCGA GCACCCGTTC GAACCGGAGG ACATGGCCTG GTGCCCGGCT
TATGCGGCGC CGATCTGTTC GCTCTGCTGC TCGCTCGACG GCCGCTGCCA TGACATGTGC
AAACCGAAGG CGCGACTGAG CTTTCAGATC GCGACTGTAG CCAAGGCACT GCTGCCCGAA
ACGGTCGTCG CAAAACTTGC GACGCGGCTC GGCCGCTACG CCATCGCGGC GGTGATCTCC
ATCACCGGTA TCGGAGCGAT CCTCGCGATG ATCGCGCACC AGACTACCGC CGCTTCTCCC
GAAACCGCCG CCGTGGTCGA GCGTACCGTT GCCATCGTCT TTTTTGTTTT CGCCATCCTC
GCCGGCATCG TGTCCTGGTT CTACGTGCTT GCCCATGACA GCCGTGTCGT CGCGGAAGAA
GAATCCTCGC GGCAGAACAC GGCTCTCCTG AAGGAGATAG CGGCCCACAA GAAGACCGAC
GCGGCACTGC AGGACGCGAA AGAGCGGGCG GAGGCGGCCA ACCGTGCGAA GAGCCGCTAT
GTCGTGGGGC TGAGTCACGA GTTGCGAACG CCGCTCAATG CGGTGCTCGG CTACGCCCAG
ATCCTCGAAC GCGACGAGAC GATACCGCCG TCGCGACAGA ACGCAATCAA AGCGATCAGG
CGCAGCGCCG ACCATCTCTC CGGACTGATC GATGGGCTGC TCGACATATC CAAGATCGAA
GCCGGCAAGC TGCAGGTCTA TTCCAACGAG ATCAATATCC AGGATTTCCT CGACCAGATC
GTCGACATGT TCCGTCCGCA GGCGCAGGCC AAGGGGATCA CCTTCGAACA CAGCCGCGCG
GCGGCGCTGC CGCAATATGT GCGAACGGAC GACAAGCGCC TGCGGCAGAT CCTCGTCAAT
CTGCTTTCCA ATGCCTTGAA GTTCACGGAA CGGGGCCACA TCCGCTTCGA CGTCGCTTAT
CGCAGTCAGG TCGCGAGTTT TACGGTCGAA GACAGCGGCC GCGGCATCAG CGAGAGGGAT
CTGCCGAAGA TCTTCGAACC ATTCCAGCGC GGCGAAGCGG AATATCGCAT GCCGGGCCTC
GGCCTGGGCC TGACGATCAC GCGCCTTCTC ACCCAGACGC TCGGTGGCGA AATCTCCGTC
ACCAGCGAGA GGGATAAGGG CACGGTCTTC AAGGTGCGGC TTATGCTCTC CGCAGTAGAC
CGTCCGGCCG CGCGCAAGGA CGCGGTGCGC AAGATCAGGT CCTACCAGGG ACCGCACCGC
GCGATCGTCG TGGTCGACGA CAATCCCGAG CACCGCGAGC TGATGCGCGA AGTACTGACG
CCGCTCGATT TCACCGTCAT CCCCGCGGCG AGCGGAGCGG ATTGCCTGAC GCTCATCGAA
GACACGAAAC CGGATCTGTT CCTGATCGAC ATATCCATGC CCGGAATGAA CGGCTGGGAC
TTGGTGACGC GGCTGAGGGA GGCCGGCCAG ACAGCGCCGG CAATCATGCT TTCGGCCAAT
ATCGGCGACG GTTCGGCTGC CGGCGCCTTC GGCCATAACG ATACGCTTGC CAAACCCTTC
CGTGTCCGCC AGCTCACCGA CAAGCTCGCG ATTCAGCTCG GCCTCGAATG GATCTACGAC
GACGGGCCCA AGGAAACGCG TGGACCGGAC AAGGCCGGTG CAGTCATAAG CCCCGGCAAT
CATCACATCC GGGAATTGAT CGAACTCGGC GAAATCGGCT ACGTGAGGGG CATCGAAGCG
AAACTGACGG AGATCGCCCT GAACCCGGAT AACCGAGCCT TCGCCGAGGA AGCGCGCGCC
TATGTCCAGG CATTCGACCT TTCGGGTTAC GACGCTTTTC TGAAACGCCT GCTCGCCGGG
GAGGCGAATG CCCGTGCCTG A
 
Protein sequence
MAARQRIIPV RREYNRWVAD QTLEDYALRF TAKSARRFSS ARISQTAIGA ISFLALEAIG 
GAITMSYGTT NAIVAILVAS LMILIVGLPI SRYAIRHGVD IDLLTRGASF GYIGSTITSL
IYASFTFILF AIEASIMSGA LELALGVPLW AGYIVSAVVV IPLVTHGVRL ISRFQLVTQP
FWIVLNILPF AFIALADWEK VGLWLAYAGI HHTSGPSGTV APFDLIEFGA ASAVIFALMA
QIGEQVDFLR FLPPDGQRKL RHRIAVFLAG SGWVIVGAPK LLAGSFLVVL ALSAGVPSTR
AADPAQMYYT AFGYIVPYET AALLLMAAFV VVSQLKINVM NAYAGSLAWS NFFSRLTHSH
PGRVIWLVFN VAIALLLMEL GIYRLLEETL GIFSIIAMAW LCTISADLFV NKPLGLAPAG
IEFKRAHLYD INPVGLGTMG VSTLFALVAH FGAMGEIASS LAPFIALATA FLVSPLIAWW
TEGKFYLARK PRKSWFSESE ITCSICEHPF EPEDMAWCPA YAAPICSLCC SLDGRCHDMC
KPKARLSFQI ATVAKALLPE TVVAKLATRL GRYAIAAVIS ITGIGAILAM IAHQTTAASP
ETAAVVERTV AIVFFVFAIL AGIVSWFYVL AHDSRVVAEE ESSRQNTALL KEIAAHKKTD
AALQDAKERA EAANRAKSRY VVGLSHELRT PLNAVLGYAQ ILERDETIPP SRQNAIKAIR
RSADHLSGLI DGLLDISKIE AGKLQVYSNE INIQDFLDQI VDMFRPQAQA KGITFEHSRA
AALPQYVRTD DKRLRQILVN LLSNALKFTE RGHIRFDVAY RSQVASFTVE DSGRGISERD
LPKIFEPFQR GEAEYRMPGL GLGLTITRLL TQTLGGEISV TSERDKGTVF KVRLMLSAVD
RPAARKDAVR KIRSYQGPHR AIVVVDDNPE HRELMREVLT PLDFTVIPAA SGADCLTLIE
DTKPDLFLID ISMPGMNGWD LVTRLREAGQ TAPAIMLSAN IGDGSAAGAF GHNDTLAKPF
RVRQLTDKLA IQLGLEWIYD DGPKETRGPD KAGAVISPGN HHIRELIELG EIGYVRGIEA
KLTEIALNPD NRAFAEEARA YVQAFDLSGY DAFLKRLLAG EANARA