Gene Smed_4290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4290 
Symbol 
ID5318453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp784009 
End bp785316 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content62% 
IMG OID640776095 
Producthistidinol dehydrogenase 
Protein accessionYP_001313028 
Protein GI150376432 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.682921 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTGA CCTATCTCAA GCGCGGCAAG CCCGAGGCCC AGCGATCGGA GGAAGATGCG 
AAGGTCCGCG GGATCGTCGA GTCGACGCTT AAGGACATCG AGACGCGCGG CGACCAGGCG
GTTCGAGAAC TTTCTGAGAA GTTCGACAGG TTTTCGCCGC CGTCTTTCCG GCTCAGCCCA
TCGGAGATCG AGGCGGCCAT GTCCAGGGTC TCAACGCGCG ACATGACCGA TATCGACTTC
GCCCAGACGC AGATCCGCCG CTTCGCCGAG GCGCAACGCG CCTCGATGAC GGATATCGAG
ATCGAGACGA TTCCGGGGGT AATCCTCGGT CATCGCAACA TTCCCGTACA GTCGGTCGGG
TGTTACGTGC CCGGGGGCAA GTTCCCGATG GTGGCCTCCG CCCATATGTC GGTCCTGACG
GCCGCGGTTG CCGGCGTGCC GCGCATCGTT GCCTCCGCTC CTCCCCAGAA GGGTGCGCCG
CATCCGGCGA TCGTGGCGGC GATGCACAAA GCCGGTGCCC ACGAAATCTA CGTGCTCGGC
GGCATGCAGG CGGTCGGCGC GATGGCGCTC GGAACCGAGA CGATCAAGCC CGTCGACATG
CTGGTGGGTC CGGGAAATGC CTTCGTTGCC GAAGCCAAAC GGCAGTTGTA CGGCCGCGTC
GGAATAGATC TCTTCGCCGG TCCGACCGAG ACGATGGTGA TTGCCGACGA GACGGTGGAT
GCGGAGATAT GCGCAACCGA TCTCCTCGGT CAGGCCGAGC ATGGTTACAA TTCTCCGGCG
GTGCTTGTGA CCAATTCACG CAGGCTTGCC GATGAGACGC TTGCGGAAAT CGGCCGGCTT
CTTTCGATCC TGCCGACGGC GGACACCGCC AGTGCCTCAT GGCGCGACTA CGGCGAAGTG
ATCGTCTGCG ACACCTATGA GGAAATGCTC GACGTCGCCA ATGAAATCGC CTCCGAGCAC
GTGCAGGTCA TGACCGATCG CGATGATTGG TTCTTGGAGA ACATGCATTC CTACGGTGCG
CTTTTCCTTG GGCCACGCAC CAATGTCGCC AATGGCGACA AGGTCATCGG AACCAACCAC
ACCCTGCCGA CCAGGAAGGC GGGGCGCTAT ACGGGTGGCC TCTGGGTCGG CAAGTTCATG
AAGACGCATT CCTACCAGAA GGTGCTGACA GACGAGGCGG CTGCGGAAAT CGGCGCCTAT
TGCTCGCGCC TGTGCCTGCT GGAGGGCTTT ATAGGCCATG CGGAGCAGGC CAATGTCCGG
GTTCGCCGAT ACGGCGGACG CAATATCGGC TATGGCGGCG CGGCGTAG
 
Protein sequence
MTVTYLKRGK PEAQRSEEDA KVRGIVESTL KDIETRGDQA VRELSEKFDR FSPPSFRLSP 
SEIEAAMSRV STRDMTDIDF AQTQIRRFAE AQRASMTDIE IETIPGVILG HRNIPVQSVG
CYVPGGKFPM VASAHMSVLT AAVAGVPRIV ASAPPQKGAP HPAIVAAMHK AGAHEIYVLG
GMQAVGAMAL GTETIKPVDM LVGPGNAFVA EAKRQLYGRV GIDLFAGPTE TMVIADETVD
AEICATDLLG QAEHGYNSPA VLVTNSRRLA DETLAEIGRL LSILPTADTA SASWRDYGEV
IVCDTYEEML DVANEIASEH VQVMTDRDDW FLENMHSYGA LFLGPRTNVA NGDKVIGTNH
TLPTRKAGRY TGGLWVGKFM KTHSYQKVLT DEAAAEIGAY CSRLCLLEGF IGHAEQANVR
VRRYGGRNIG YGGAA