Gene Smed_0641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0641 
Symbol 
ID5321477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp689473 
End bp691791 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content63% 
IMG OID640789577 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001326332 
Protein GI150395865 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.544021 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGACG CGCGACGGGG AAGCGCAGCC GAAGGGCGGT TGCGACTTAA ATTCCCGGGC 
CTTTCGGGCC TCGAACGAGA GTTCATCGAG CAGGCAAAGA TCTTTGCCGA ACCGGCATCG
CAGAAGCTCG CTATGGCCGA GCCCATGCTC AAGCGATCCA TTCCGGTCCT CATCATCGCT
TTTCTTCTCA TCGTGGCGAT TTCACGAATG TCGGGAATGA TGGACGAGCA TGCTCGCATG
GAGGCCTCCT CCCGCCGCAC CGTCAGTCTT ACGGCCGCAG CAGCAGCCGG CGCTTTGCAA
CCGGACGGGC CGGAGATCTT CACGGACGTC CGCCGCTGGG AGATCGAACA GCGGCTCACG
GAATTTCTGC CCGCCGACAT GCTGGATGCC GGGATCATCC TGCTGACGGT CCGCAGGGAC
GGTCGCGTCT TTGCGAGCAC GCAGGATGGC GCCCACCTCG TTGGGCGCAC GCTCTCCGCA
GTGGCCCCCG AGGTTGCCAC GCTGCAATAT TACGGCGAGG GGGCCGGGAT CGTGAGAGCG
TCGGTTGATG GTGTCGATCA CTTCATTGCG CTGCGGCAGC TGCCCCACTC CGGCGGCGCG
GTTCTGGCGG CAACGTCGAT TGCCTCGTTC GAGGCAGCTT GGCGTGATGA AATTTCTCTC
AATGTCACGC TGTTTGCCGG CGTTTCTGCG ATCCTGATGG TCGTCCTTTA CGCCTACTAC
ATCCAGGCGA AACGCGCCCG CGACGCCGAT GCGATCTTTG CCGAATCGAA CCTGCGCGTC
GAAACGGCGC TCTCCCGCGG GCGATGCGGT CTCTGGGACT TCGACCTCGA CAACAGGCGC
CTCTTCTGGT CACGTTCAAT GTACGAAATG CTCGGCATGC CGGGCGATGC GAGCGTGCTG
TCGTTCGGCG ACGCCGCCCG GCTCATGCAT GTAGACGACC GGGGCATATA CCGCGTCGCG
CGGGCGATCG CGAGGGGCAG CGAGCGCCAG ATCGATCAGG TATTCCGCAT GCGCCATGCG
GACGGCCATT ATGTATGGCT TCGGGCCCGC GCGCAGGTGA TCCGTACCGT CTCAGGCCGG
ACGCACCTCA TCGGCATCGC GATGGACGTG ACCGAACAGC ATAGGCTCGC CCAGCGCTAC
GCCGAAGCCG ATCAGCGCCT TGCGGACGCC ATCGAGTGCA CGTCGGAGGC CTTCGTACTC
TGGGACAAGC ACGATCGCCT GGTCATGTGC AACACTCACT TCCAGCAGGC CTGGCAGTTG
CCCGACCACG TGCTCGTGCC CGGCACCGAA CGCACGATCG TCCAAGCGGC GGCGGCGCGG
CCGGTCGTCG AACGGCGCAT CGCCGATCCG GACCGGAGCA ACCACTCCCA GACGAGCGAG
GTGCAGCTTG CCGACGAACG CTGGCTGCAG ATCAATGAGC GGCGCACGCG CGACGGCGGC
CTCGTCTCGG TGGGCACGGA CATCACGCTC CTCAAGCGGC ACCAAGTGCG GCTACGCGAA
TCGGAGCGGC GGCTGATGGC CACCATCGGC GATCTCTCCG CCTCGCGCAT AACGCTCGAG
CAGCAAAAGG CCGAACTCTC CGTCGCCAAC GCGAATTACC AGGCGGAGAA AGAGCGCGCC
GAGGCCGCGA ACCGGGCGAA GTCCGAGTTC CTGGCCAACA TGTCGCACGA GCTGAGAACG
CCTCTGAACG CTATCCTCGG CTTCTCCGAA ATCCTGCAGG ACCAGATGTT CGGACCGCTC
GGCTCGGAGA AGTACCACGA ATATTCCCGC GACATTTTCG AAAGCGGCAA GCACCTGCTC
AACGTCATCA ACGACATTCT CGACATGTCG AAGATCGAGG CAGGCCACAT GCGCATCACG
CGCGAGAGGA TAGATCTCGC GCCGCTTATC GAGGAGACGC TCCGTTTCAC CACAATTCCG
GCGGAACAGA AGAACATCCG CGTCGTCCAG CAGGTATCTT CCGGTCTGAC GATGTTCGCA
GACCGCCGGG CGATGAAGCA GGTCCTGCTC AACCTGCTGT CCAATGCCGT CAAGTTCACC
AATGAGGGGG GGCGCATTTC GCTCCGGGCC CGAAAGGTCA GAGGCGCGGT CACCCTCACC
ATCGCCGATT CCGGGATCGG TATTCCCAGG GATGCCTTGC AAAAGATCGG CCAGCCATTC
GAACAGGTGC AGAGCCAATA TGCCAAGAGC AAGGGCGGTT CGGGGCTCGG GCTTGCCATC
TCCCGCTCGC TTACCCGCCT TCACGGCGGC AGCATAAAGA TCCACTCGAC AGAGAATGTC
GGCACCATCA TTTCGGTCAG AATCCCCGAC CGCGCCTGA
 
Protein sequence
MADARRGSAA EGRLRLKFPG LSGLEREFIE QAKIFAEPAS QKLAMAEPML KRSIPVLIIA 
FLLIVAISRM SGMMDEHARM EASSRRTVSL TAAAAAGALQ PDGPEIFTDV RRWEIEQRLT
EFLPADMLDA GIILLTVRRD GRVFASTQDG AHLVGRTLSA VAPEVATLQY YGEGAGIVRA
SVDGVDHFIA LRQLPHSGGA VLAATSIASF EAAWRDEISL NVTLFAGVSA ILMVVLYAYY
IQAKRARDAD AIFAESNLRV ETALSRGRCG LWDFDLDNRR LFWSRSMYEM LGMPGDASVL
SFGDAARLMH VDDRGIYRVA RAIARGSERQ IDQVFRMRHA DGHYVWLRAR AQVIRTVSGR
THLIGIAMDV TEQHRLAQRY AEADQRLADA IECTSEAFVL WDKHDRLVMC NTHFQQAWQL
PDHVLVPGTE RTIVQAAAAR PVVERRIADP DRSNHSQTSE VQLADERWLQ INERRTRDGG
LVSVGTDITL LKRHQVRLRE SERRLMATIG DLSASRITLE QQKAELSVAN ANYQAEKERA
EAANRAKSEF LANMSHELRT PLNAILGFSE ILQDQMFGPL GSEKYHEYSR DIFESGKHLL
NVINDILDMS KIEAGHMRIT RERIDLAPLI EETLRFTTIP AEQKNIRVVQ QVSSGLTMFA
DRRAMKQVLL NLLSNAVKFT NEGGRISLRA RKVRGAVTLT IADSGIGIPR DALQKIGQPF
EQVQSQYAKS KGGSGLGLAI SRSLTRLHGG SIKIHSTENV GTIISVRIPD RA