Gene Smed_3123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3123 
Symbol 
ID5324002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3271524 
End bp3272579 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content62% 
IMG OID640792073 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_001328784 
Protein GI150398317 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGAT TCTGGTCGCC GATCGTTGCC GCCTTGAAGC CTTACGTTCC CGGTGAACAG 
CCGCGGATGG CGAACCTCGT CAAGCTCAAC ACCAATGAGA GCCCCTATGG CCCCTCGGAA
AAAGCGCTGG AGGCGATTCG GGCGGCGGCT GACATGGATC TCAGGCTTTA TCCGGATCCC
GTCGCTCTCG GGTTGCGGGA GGCGATCGCC AGGCGCTACG GCATGTCCGT CGGCGAGGTC
TTCGTCGGCA ATGGTTCCGA CGAGGTTCTG GCCCACACCT TCGCGGCGCT TTTGAAACAC
GATCGGCCGC TGCTTTATCC TGACATCTCC TACAGCTTCT ATCCGACCTA TGCGGGCCTT
TTCGGCATCG AAGCGGTCGA GATTCCGCTG AACGCCGACT TCCGCATCGA AATTGCAGGC
TATCGCCGCC CTGCCGGCGC CATCATCCTG CCGAATCCGA ATGCGCCGAC GGGAATCGGC
CTGCCGCTTT CCGACATTGA AAGGCTGGTG AGCGAGCATT CCGACCAGCC CGTGGTCGTC
GACGAGGCCT ATATCGACTT CGGCGGCGAA TCCGCGATCG CACTCGTTCC GAAATACGAG
AACCTGCTCG TCGTTCAGAC CTTCTCGAAG TCACGTGCGC TTGCCGGTCT TCGCGTCGGC
TTCGCGATAG GCCAAAGGCC GCTGATCGAG GCGTTGGAGC GCGTCAAGGA TAGTTTCAAT
TCCTATCCGC TCGGGCGCGC CGCCCTTGCG GGTGCAACGG CTGCGATCGA GGACGAGGCC
TGGTTCGAGA AGACGCGCGC CAAGATCCTC GCCACGCGGG CGGCATTGAC GAAGGGGCTC
GAAGCGCGCG GTTTCGAGGT CCTGCCGTCT CAGGCGAATT TCGTCTTCGC CCGGCATCAG
AACCACGCGG GCCAGACGCT GGCGGCGAAA CTCCGCGAAC GCGCGGTCAT CGTCCGCCGT
TTTGCCAAGC CCCGGATTGA GGATTTCCTG CGCATCACCA TCGGCACCGA TGATGAATGC
GCCAAACTGG TCGCGGCGCT CGACGAAATT CTGTGA
 
Protein sequence
MSRFWSPIVA ALKPYVPGEQ PRMANLVKLN TNESPYGPSE KALEAIRAAA DMDLRLYPDP 
VALGLREAIA RRYGMSVGEV FVGNGSDEVL AHTFAALLKH DRPLLYPDIS YSFYPTYAGL
FGIEAVEIPL NADFRIEIAG YRRPAGAIIL PNPNAPTGIG LPLSDIERLV SEHSDQPVVV
DEAYIDFGGE SAIALVPKYE NLLVVQTFSK SRALAGLRVG FAIGQRPLIE ALERVKDSFN
SYPLGRAALA GATAAIEDEA WFEKTRAKIL ATRAALTKGL EARGFEVLPS QANFVFARHQ
NHAGQTLAAK LRERAVIVRR FAKPRIEDFL RITIGTDDEC AKLVAALDEI L