Gene Smed_0665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0665 
Symbol 
ID5321501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp715869 
End bp716990 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content61% 
IMG OID640789601 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_001326356 
Protein GI150395889 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAAT TCTCGCGCCT CACGCCTCTC ATCCAGTCTT TGCCGGCCAC CGTCCCCTTC 
GTCGGCCCGG AAGCCCTCGA GCGTCAGCGT GGCCGCAAGA TAACGGCGCG AATCGGTGCC
AATGAGAGCG GATTCGGCCC GGCTCAATCC GTGTTGCACG CCATTCGGCA AGCGGCTGAT
GAGACCTGGA AATATTCCGA TCCCGAGAAT CACGACCTGA AGCAGGCGCT CGCCGACCAT
CTCGGCATCC CTTCCGAGAA TATCGCCGTG GGCGAAGGCA TAGACGGCCT TCTAGGCCAG
ATCGTACGAC TCGTCGTGGA AGCGGGCATG CCGGTCATAA CCTCTCTTGG GGGCTATCCG
ACGTTCAATT ATCATGTCGC AGGCCACGGC GGGCGGCTCG TGTCGGTGCC CTATGCCGAC
GATCAGGAAG ATCTCGAAGG ACTGCTCGCT GCGGCGGAGC GCGAGAATGC TCCGCTTGTG
TATCTGGCCA ATCCCGACAA TCCGATGGGA AGCTGGTGGC CGGCCGAGCG CGTGATCGCC
TTTGCGAACG CCCTTCCGGA AACGACGCTC CTGGTGCTGG ACGAAGCCTA TTGCGAGACG
GCCCCGCCGG ACGCTCTCCC CTCGATCGAG AGCCTTATCG ATAAGCCGAA CGTCATTCGG
ACGCGTACCT TCTCCAAAGC TTACGGCCTG GCCGGAGCTC GTATCGGTTA CACGCTGTCG
ACGCCCGGCA CCGCCCAGGC TTTCGACAAG ATACGCAACC ATTTCGGGAT GAGCCGTATC
GGGGTGGCGG CAGCGATCGC CGCTTTGGCC GACCAGAATT ACTTAAAGGA AGTCAAGCTC
AGAATCGCGA ATTCACGCGA CCGGATCGGC CGGATCGCCG GCGAAAACGG GCTCCTCGCA
CTTCCCTCGG CCACGAATTT CGTAACTGTC GATTGTGGAA AAGATGCAGC CTATGCGCGG
GGAATTGTCG ATCGGCTGAT GAGCGATCAC GGGATCTTCA TCCGGATGCC GGGGGTCGCG
CCGCTTAACC GCTGCATTCG CATCAGCACC GCGCCCGATG CTGAAATGGA TTGTCTGGCG
GTCGCGCTTC CGCAGGTGAT CAGGAAACTG GCTTCCGGTT GA
 
Protein sequence
MSEFSRLTPL IQSLPATVPF VGPEALERQR GRKITARIGA NESGFGPAQS VLHAIRQAAD 
ETWKYSDPEN HDLKQALADH LGIPSENIAV GEGIDGLLGQ IVRLVVEAGM PVITSLGGYP
TFNYHVAGHG GRLVSVPYAD DQEDLEGLLA AAERENAPLV YLANPDNPMG SWWPAERVIA
FANALPETTL LVLDEAYCET APPDALPSIE SLIDKPNVIR TRTFSKAYGL AGARIGYTLS
TPGTAQAFDK IRNHFGMSRI GVAAAIAALA DQNYLKEVKL RIANSRDRIG RIAGENGLLA
LPSATNFVTV DCGKDAAYAR GIVDRLMSDH GIFIRMPGVA PLNRCIRIST APDAEMDCLA
VALPQVIRKL ASG