Gene Smed_0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0041 
Symbol 
ID5320868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp42491 
End bp43546 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content62% 
IMG OID640788972 
ProductPhoH family protein 
Protein accessionYP_001325736 
Protein GI150395269 
COG category[T] Signal transduction mechanisms 
COG ID[COG1702] Phosphate starvation-inducible protein PhoH, predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000833124 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAACGGAC ACGAACTGAT TTCTTCCTCA CCGCGCCAGT CGAAGACATC CGCGACGGAC 
GCCAATCACT TCGTGCTCAC CTTCGAGAAC AACCGCTTTG CGAGCGAACT CTTCGGCCAG
TTCGACCAGA ATTTGAAGCT GCTTGAAGAG CGGTTGCGCA TCGATGCCCG ACCGCGGGGA
AACTCCGTCG CAATCTCCGG TGACGTGGTT GCCACCAATC AGGCGCGCCG CGCTCTCGAC
TATCTCTACG GAAGGCTGCA GAGTGGCGCT TCGATCGATA CATCAGATGT CGAAGGGGCG
ATCCGCATGG CGGTCGCCGC CGACGATCAG CTACAGTTGC CGACGATGGA GCGCAAAGCC
AAATTGACAA TGGCCCAGAT TTCGACGCGC AAGAAGACCA TCGTTGCGCG CACTCCGATG
CAGGATGCCT ATATCCGCGC GATGGAGCGG TCGGAACTCG TCTTCGGCGT CGGCCCGGCC
GGCACCGGCA AGACCTACCT TGCCGTCGCT CATGCCGCCC AGCTGCTGGA GCGTGGCGCA
GTCGACCGTA TCATTCTCTC AAGGCCGGCG GTCGAAGCGG GCGAGCGTCT CGGCTTCTTG
CCGGGCGACA TGAAGGAGAA GGTCGATCCC TATCTCAGAC CTCTCTATGA CGCGCTCTAT
GACATGATGC CGGGCGACAA GGTGGAGCGG GCAATCACCG CAGGTGTAAT CGAGATCGCG
CCGCTTGCCT TCATGCGCGG GCGCACGCTC GCCAATGCCG CCGTTATCCT GGATGAGGCA
CAGAACACGA CATCGATGCA GATGAAGATG TTCCTGACGC GTCTGGGCGA AAACGGCCGG
ATGATCATCA CGGGTGATCC GAGTCAGGTC GACCTGCCGC GCGGCGTGAA GTCGGGCCTG
GTGGAGGCGC TGCAGATACT CAAGGGAGTA GAGGGCGTCT CGGTGATCCG CTTCAAGGAC
GCCGACGTCG TCCGCCATCC GCTGGTGGCG CGGATCGTCA GAGCCTATGA CAGCCAGACG
GCGGTTCACG ACGAGAGCGA GCAGGGCGAT CGTTGA
 
Protein sequence
MNGHELISSS PRQSKTSATD ANHFVLTFEN NRFASELFGQ FDQNLKLLEE RLRIDARPRG 
NSVAISGDVV ATNQARRALD YLYGRLQSGA SIDTSDVEGA IRMAVAADDQ LQLPTMERKA
KLTMAQISTR KKTIVARTPM QDAYIRAMER SELVFGVGPA GTGKTYLAVA HAAQLLERGA
VDRIILSRPA VEAGERLGFL PGDMKEKVDP YLRPLYDALY DMMPGDKVER AITAGVIEIA
PLAFMRGRTL ANAAVILDEA QNTTSMQMKM FLTRLGENGR MIITGDPSQV DLPRGVKSGL
VEALQILKGV EGVSVIRFKD ADVVRHPLVA RIVRAYDSQT AVHDESEQGD R