Gene Smed_0215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0215 
Symbol 
ID5321047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp239780 
End bp240880 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content63% 
IMG OID640789150 
ProductHPr kinase 
Protein accessionYP_001325909 
Protein GI150395442 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.623305 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTCC GCGTCTCCGA TGGGGCACGA TTTTTTCTGC TTGGACAACG CAAGACTATT 
TTCGTTGAAG CGTCGCAGCA GATTTTTGAG GTCGATGATC TCACGGCCTA TCTCACCTGC
GTGCTCGCCG TGCCTGCGTC TCAGCGCCAG CTCGAAGTCG ATCTCGTCGC GCGTGGCGCC
GAACGGGCCG AGGCCCGGAG ATCGGTGCGG GAATATCTTC ACTACTGGTG CCGCCATGGT
CTGCTGGAGA TTGCCTTCGA CGCCGAGGAG GGGGAGCCGC TGCACACGCA TGTGCTTGAT
TTACAGGGAG CTGCCGCCTC GATAGCCTAT CACGACAAAG GGCTCCTCGA TCTTCTCCTG
CCCATATTCG GTCATCTGGC ATCACCGGGC CTCAAGCCTT CCGTCTCCTA TGGCGTGGCA
AGGTTCGGCA GCCAAGCGTG CATCAGCCGC AACCGTTCCC CAGGCCGAAT CGTCCGGGTT
GAAGAGGTGG CGCCGGTGCT GAAAGCGCTG CTGACGGAGG ATGTGCTGGC AAGCCTCGGC
CCTGACGTCG CGCTTCACGC CGCTCTTCTG GTCAGGAACG CAAAGGGCCT TCTGATTTGC
GGCGCGCCCG GGGCGGGCAA ATCGACGCTA ACGCTTGCGC TCCTCGAAGC AGGCTTTGCC
TGCGGCGGTG ACGATATCAC GCTGATGAGG CCAGACGGCC TGCTTCAGGG CGTGCCCTTT
GCACCCGCCC TGAAGCGCGG CTCCTGGCGC CTTCTCGAAA ACATGCGCGC TTCGGTCGAG
GCGGCGCCGG TCCACCGTCG CCTCGACAAC AGACATGTCC GCTATCTCGC GTCGATCCCC
TTCGCATCCG ACGATCCCGT CAAGCTCGGC ACTATCGTGC TTCTGCGCCG CCGCAAGGGA
CGGGCGGCAA TTGCCGCCGT CGAGCCGGCG CGGGTTCTGT CGGAACTCTT TCGCGGCGCT
TTCACTCCGG CACGCGGGCT CGGTCTGCCG CAATTCGACG CTTTGCTGAG TGCTGTCCGC
GGCGCCAGCG CCATCGAGCT ATCCTATACG CGGCTGGATG AGGCCGTAGA GATGCTGAGC
AGCCATCATG AAGGCGCGTA G
 
Protein sequence
MKFRVSDGAR FFLLGQRKTI FVEASQQIFE VDDLTAYLTC VLAVPASQRQ LEVDLVARGA 
ERAEARRSVR EYLHYWCRHG LLEIAFDAEE GEPLHTHVLD LQGAAASIAY HDKGLLDLLL
PIFGHLASPG LKPSVSYGVA RFGSQACISR NRSPGRIVRV EEVAPVLKAL LTEDVLASLG
PDVALHAALL VRNAKGLLIC GAPGAGKSTL TLALLEAGFA CGGDDITLMR PDGLLQGVPF
APALKRGSWR LLENMRASVE AAPVHRRLDN RHVRYLASIP FASDDPVKLG TIVLLRRRKG
RAAIAAVEPA RVLSELFRGA FTPARGLGLP QFDALLSAVR GASAIELSYT RLDEAVEMLS
SHHEGA