Gene Smed_0527 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0527 
SymbolispH 
ID5321361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp568944 
End bp569954 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content62% 
IMG OID640789461 
Product4-hydroxy-3-methylbut-2-enyl diphosphate reductase 
Protein accessionYP_001326218 
Protein GI150395751 
COG category[I] Lipid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0761] Penicillin tolerance protein 
TIGRFAM ID[TIGR00216] (E)-4-hydroxy-3-methyl-but-2-enyl pyrophosphate reductase (IPP and DMAPP forming) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.49504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGCCT CATCCGTAGC AAAAACCCCG ATCACCATCC GCCTCTGCGG GCCGCGTGGC 
TTCTGCGCGG GCGTCGACCG GGCGATCCAG ATCGTGGTGC TTGCGCTCAA GGAGTTCGGC
GCGCCGGTCT ATGTCCGTCA TGAGATCGTG CACAATCGCT ATGTCGTGGA AGGTCTTGAA
GCCAAGGGTG CGATTTTCGT CGAGGAGCTC GACGAGATTC CACCCGAGCA TCGCAAGCAG
CCCGTCGTCT TCTCCGCCCA CGGCGTGCCG AAATCCGTCC CCGCGGATGC GGATGCGCGC
AATCTCTTCT ATCTCGATGC CACATGCCCG CTCGTCTCGA AGGTGCACAA ACAGGCTATG
CGCCATCACC GTATGGGGCG CCATGTGGTG CTGATCGGCC ATGCCGGCCA TCCCGAGGTT
ATCGGCACCA TGGGGCAACT GCCGGAGGGG ACGGTCTCCC TCATCGAGAC TGTCGAGGAT
GTCGACGTTT ATACGCCCCC GGATCCGGAC AATCTCGGCT TTGTTACGCA GACGACGCTC
TCGGTGGATG ATACCGCCGG CGTCATCAAG CGGCTCCATG AGCGCTTTCC GAACCTGACT
GCGCCTGCCG CCGACTCGAT CTGCTACGCC ACCACGAACC GGCAGGAAGC GGTGAAGCAG
GCTGCACCCG GCTGCGATCT TTTCCTCGTC GTCGGCGCCC CCAATTCTTC GAACTCGAAG
CGCCTGGTGG AAGTAGCGCT GAGGGCCGGG GCAAAGAAAG CCGTTCTGGT TCAGCGGGCT
TCTGAAATTG ACTGGGCGAC GATCGGGGAA ATCTCGACCG TCGGGTTGTC CGCCGGTGCC
TCGGCGCCGG AGGTGATCGT CAATGAGATC ATCGAAGCCT TCCGCGAGCG CTACGACGCC
GCGGTCGAGC TTGCCGACAC GGTGGAGGAG AACGAGCACT TCCTCGTCAA CCGCGAGCTC
AGGCATGTCG AACTGACCGG CGCCGACATG GCTTTCGTCA ATGGTGAATA G
 
Protein sequence
MMASSVAKTP ITIRLCGPRG FCAGVDRAIQ IVVLALKEFG APVYVRHEIV HNRYVVEGLE 
AKGAIFVEEL DEIPPEHRKQ PVVFSAHGVP KSVPADADAR NLFYLDATCP LVSKVHKQAM
RHHRMGRHVV LIGHAGHPEV IGTMGQLPEG TVSLIETVED VDVYTPPDPD NLGFVTQTTL
SVDDTAGVIK RLHERFPNLT APAADSICYA TTNRQEAVKQ AAPGCDLFLV VGAPNSSNSK
RLVEVALRAG AKKAVLVQRA SEIDWATIGE ISTVGLSAGA SAPEVIVNEI IEAFRERYDA
AVELADTVEE NEHFLVNREL RHVELTGADM AFVNGE