Gene Daci_5521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaci_5521 
Symbol 
ID5751141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDelftia acidovorans SPH-1 
KingdomBacteria 
Replicon accessionNC_010002 
Strand
Start bp6139463 
End bp6140587 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content68% 
IMG OID641300654 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_001566535 
Protein GI160900953 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.284682 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCTT CCATTCCGCA GGTTCAGGCC CTGGCCCTGG AGCGCATCCG CGCCGATGTC 
CGCTCCATGC ATGCCTACCA TGTGCAGCCC TCGCAAGGCC TGCTCAAGAT GGACACGATG
GAGAACCCCT TCCGCCTGCC CCCTCAATTG CAGGCCGCGC TGGGCGAGCG CCTGGGCCGG
CTTGAGATCA ACCGCTACCC TGGCGAGCGC CAGGAAGTGC TCAAGGACAT GCTGGCCCGC
TACGCGCAGG CGCCGGCCGG CAGTGCCGTG CTGCTGGGCA ATGGCTCGGA CGAGATCATC
ACGCTGCTGG CCCTGGCCTG TGCCCAGCCC CACGCCGAGG GCCGCGCCAC CATGCTGGCG
CCCATGCCGG GCTTCGTCAT GTACCCCATG AGCGCCAGGC TGCAGGGCCT GGACTTTGTG
GGCGTCAACC TGACGGCCGA CTTCGAGCTG GACGTGCCCG CCATGCGCGC CGCCATTGCC
GAGCACCGCC CCGCCATCAC CTATATCGCC TATCCCAACA ATCCCACGGC CACGCTCTGG
GCCGAGGCCG ATGTGCAGGC CGTGATCGAC GCGGTCGCCG CCATCGGCGG CCTGGTCGTC
ATGGACGAGG CCTACCAGCC CTTCGCCCGC CGCAGTTGGG CACAGAACAT GCGCGCCGAC
CCGGCCCGCA ACGCCCATGT GCTGCTGATG CGCACGCTCA GCAAGTTCGG CCTGGCCGGT
GCGCGCCTGG GCTACCTGAT CGGCCCGGCC GCCATCGTCG GCGAGATCGA CAAGGTGCGC
CCGCCCTACA ACATCAGCGT GCTCAACTGC GAGACGGCCA TCTTCGCGCT GGAGCACGAG
GCGCTGTATG CGCAGCAGGC CGTGGCCATC CGCGCCGAGC GCCAGCCGCT GATCGATGCG
CTGGCCACGC TGCCGGGTGT GGAAAAGATC TGGCCATCCG AGGCCAACAT GGTCTTGCTG
CGCGTGCGCG ATGCCGCGCG CGCCCAGGCC GAGATGAAGG CCCGGGGCGT GCTGGTGAAG
AATGTCTCTG CCATGCACCC GCTGCTGGTC AACTGCCTGC GACTGACCGT GGGCACCCAC
GAAGAAAACG CCCAGATGCT GGCGGCCCTG AAGGAATCCC TATGA
 
Protein sequence
MTASIPQVQA LALERIRADV RSMHAYHVQP SQGLLKMDTM ENPFRLPPQL QAALGERLGR 
LEINRYPGER QEVLKDMLAR YAQAPAGSAV LLGNGSDEII TLLALACAQP HAEGRATMLA
PMPGFVMYPM SARLQGLDFV GVNLTADFEL DVPAMRAAIA EHRPAITYIA YPNNPTATLW
AEADVQAVID AVAAIGGLVV MDEAYQPFAR RSWAQNMRAD PARNAHVLLM RTLSKFGLAG
ARLGYLIGPA AIVGEIDKVR PPYNISVLNC ETAIFALEHE ALYAQQAVAI RAERQPLIDA
LATLPGVEKI WPSEANMVLL RVRDAARAQA EMKARGVLVK NVSAMHPLLV NCLRLTVGTH
EENAQMLAAL KESL