Gene Dshi_2946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2946 
SymbolhisC 
ID5710797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp3105385 
End bp3106470 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content68% 
IMG OID641268872 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_001534280 
Protein GI159045486 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0271169 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGA TCACCCCCCA GCCCGGCATC ATGGACATCG CGCTCTACGA GGGTGGGGCG 
TCGAAAGTGG ACGGTCTCGA CACCGTCATC AAGCTCAGCT CGAACGAGAA TCCGCTGGGC
CCCAGCCCCG CCGCGATCGC CGCCTACAAG GCGGCTGCGG GGGAGTTGCA CCGCTATCCC
TCCACCGATC ACGCGGGCCT GCGCGGCGCG ATCGCCGAGG TCTATGGCCT TGATCCCGAA
CGGATAATCT GCGGCGCCGG GTCGGACGAG ATCATCGCGT TCCTGTGCCA GGCCTATGTC
GGGCCCGGCG ACGAGGTGAT CCATACCGAA CACGGCTTTG CCATGTACCG CATCAGCACC
CTGGCCGCCG GCGGCACCCC CGTCGAAGTG CCGGAACGGG AGCGCGTGAC GGATGTGGAC
GCGATCCTCG CCGGGGTGAC CGACCGCACC CGGCTGGTGT TCATCGCCAA CCCCAACAAC
CCCACCGGCA CCATGATCGG CGGCAACGCC TTGGCCCGTC TCGCGGACGG GTTGCCGGAG
GGGTGCTTGC TGGTTCTGGA CGGGGCTTAC GCGGAATATG TGCCGGATTA CGACGCCGGA
AAGGCGCTGG TCGAGAGCCG CGAGAATGTG GTGATGACCC GAACGTTTTC AAAGATTTAC
GGGCTGGGTG CCCTGCGCGT CGGCTGGGGC TACGGGCCGC GCCACGTCAT TGATGTGCTC
AACCGCGTGC GGGGTCCGTT CAACCTGTCG ACCGGGGCGC TGGCGGCGGC GGAGGCGGCC
GTGCGGGACC GCGCCTATAC CGAGACCTGC CGCGCCGAGA ACGCCAAATG GCGCGGCTGG
CTGGCCAGCG AACTGGCCGC TCTCGGCATC CCCTCGGATA CCTCCTCGGC CAATTTCGTG
CTGGCCCGTT TCGCCAGCCC GGAGGAGGCA GGCGCCTGCG ACGACTTTCT CAAGGCGCGC
GGGATCATCG TCCGGCGCGT TTCGGGCTAC AAGCTGCCCG CCGCCCTGCG CATGACCGTG
GGCGACGCGG AAGGCTGCCG CGCACTCGTG GACGCCGTCG CCGCCTTCAA GGCGCAGGCG
GCATGA
 
Protein sequence
MTQITPQPGI MDIALYEGGA SKVDGLDTVI KLSSNENPLG PSPAAIAAYK AAAGELHRYP 
STDHAGLRGA IAEVYGLDPE RIICGAGSDE IIAFLCQAYV GPGDEVIHTE HGFAMYRIST
LAAGGTPVEV PERERVTDVD AILAGVTDRT RLVFIANPNN PTGTMIGGNA LARLADGLPE
GCLLVLDGAY AEYVPDYDAG KALVESRENV VMTRTFSKIY GLGALRVGWG YGPRHVIDVL
NRVRGPFNLS TGALAAAEAA VRDRAYTETC RAENAKWRGW LASELAALGI PSDTSSANFV
LARFASPEEA GACDDFLKAR GIIVRRVSGY KLPAALRMTV GDAEGCRALV DAVAAFKAQA
A