Gene Daci_0149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaci_0149 
Symbol 
ID5745685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDelftia acidovorans SPH-1 
KingdomBacteria 
Replicon accessionNC_010002 
Strand
Start bp162479 
End bp164050 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content71% 
IMG OID641295211 
Producthistidine ammonia-lyase 
Protein accessionYP_001561180 
Protein GI160895598 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.468789 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACCA TCACGACTCC CGGCCTCACC TCCCGTCCCG GCACCGCGCT GACGCTGCAG 
CCCGGCCAGG TCAGCCTGGC CCAGCTGCGG GCCATCCAGC AGGGCGGCGT GCGCCTGTCC
ATGGCGGCGT CCGCCTACGA GCGCATGCGC GCCGCCCAGG CCCATGTGCA GCACATCGTG
GATGAGGACC AGGTCGTCTA TGGCATCAAC ACCGGCTTCG GCAAGCTGGC CTCCACCAAG
ATCGCCCATG ACCGCCTGGC AGAGCTGCAG CGCAACCTGG TGCTGTCGCA CAGCGTGGGC
ACGGGCGATC CGCTGCCTGA CGCCGTGGTG CGCCTGGTGC TGGCCACCAA GGCCGTGAGC
CTGGCGCGCG GCCACTCGGG CGTGCGCCCC GAGCTGGTGG ACGCGCTGCT GGCCCTGGCC
AACGCCGACG TGCTGCCCGT GATTCCCGCC AAGGGCTCGG TGGGCGCCTC GGGCGACCTG
GCGCCGCTGT CGCACCTGGC CTGCGTGCTG ATCGGCGAAG GCCAGGCCAA GATCGATGGA
CAGGTGGTGT CCGGCACCGA GGCCATGCGC CACCTGGGCC TGGAGCCCTT CGTGCTCGGC
CCCAAGGAAG GGCTGGCCCT GCTCAACGGC ACCCAGGTGT CCACGGCCCT GGCCCTGGTG
GGCCTGTTCC AGGGCGAGAG CGTGTTCGCG GCCGGCCTGG TGGCGGGCTG CCTGACGCTG
GAGGCCATCA AGGGCTCGGT CAAGCCGTTC GACGCGCGCA TCCACGAGGC GCGCGGCCAG
CTTGGCCAGA TCGCCGTGGC CGCCGCCGTG CGCGAGCTGC TGGACGGCAG CGCCATCGAC
ACCTCCCACC CCCATTGCGG CCGCGTGCAG GACCCGTACT CCATCCGCTG CGTGCCCCAG
GTCATGGGCG CCTGCCTGGA CAACCTCAGC CATGCGGCCC GCGTGCTGGT GATCGAGGCC
AACGCCGCCT CGGACAACCC GCTGGTCTTC GACAACGGCG ACGTGATCTC GGGCGGCAAC
TTCCACGCCG AGCCCGTGGC CTTCGCGGCC GACATCATCG CCCTGGCCCT GGCCGAGATC
GGCGCCATCT CCGAGCGCCG CATGGCCCTG CTGCTGGACA CCGGCCTGTC GGGCCTGCCG
GCCTTCCTGA TTGCCGACAG CGGCGTGAAC TCGGGCTTCA TGATCGCCCA GGTCACGGCC
GCCGCCCTGG CTGCCGAGAA CCAGTGCCTG GCCCATCCCA GCAGCGTCAC CAGCCTGCCC
ACCTCGGCCA ACCAGGAAGA CCATGTCTCC ATGGCCACCT ACGGCGCGCG GCGCCTGGGC
GAGATGGCGC GCAACACCGC CGTCATCGTG GGCGTGGAAG CCATGGCCGC CGCCCAGGGC
ATGGACTTCG ACCGCAGCCT GAACAGCTCC GAGTTGATCG AGGCGCAGTA CGCGCTGATC
CGCAGCCAGG TTCCGCACCT GGACCGTGAC CGCTACCTGG CACCCGACAT CGAAACCATG
CGCCAGTGGG CGCTTGCCAC GGACTGGCCC CAGGCCATCG TCCGCCACCT GCCCAGCCTC
CAGGCTGCCT GA
 
Protein sequence
MNTITTPGLT SRPGTALTLQ PGQVSLAQLR AIQQGGVRLS MAASAYERMR AAQAHVQHIV 
DEDQVVYGIN TGFGKLASTK IAHDRLAELQ RNLVLSHSVG TGDPLPDAVV RLVLATKAVS
LARGHSGVRP ELVDALLALA NADVLPVIPA KGSVGASGDL APLSHLACVL IGEGQAKIDG
QVVSGTEAMR HLGLEPFVLG PKEGLALLNG TQVSTALALV GLFQGESVFA AGLVAGCLTL
EAIKGSVKPF DARIHEARGQ LGQIAVAAAV RELLDGSAID TSHPHCGRVQ DPYSIRCVPQ
VMGACLDNLS HAARVLVIEA NAASDNPLVF DNGDVISGGN FHAEPVAFAA DIIALALAEI
GAISERRMAL LLDTGLSGLP AFLIADSGVN SGFMIAQVTA AALAAENQCL AHPSSVTSLP
TSANQEDHVS MATYGARRLG EMARNTAVIV GVEAMAAAQG MDFDRSLNSS ELIEAQYALI
RSQVPHLDRD RYLAPDIETM RQWALATDWP QAIVRHLPSL QAA