Gene Hoch_3903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3903 
Symbol 
ID8546299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5384691 
End bp5386253 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content72% 
IMG OID646388575 
Producthistidine ammonia-lyase 
Protein accessionYP_003268295 
Protein GI262197086 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.23664 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.18215 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCCG TGAGTGAACA TCTCGACCGC GCCCCTGTTC TCCTTGGCGA GTCACCGCTG 
GTGCTCGAAG ACATCGTCCG CGTGGCCCGC GACGGCGCCG CCGCGCAGCC CGGGCCCAGC
GCCCTCGAGG CCATGGCGAA ATCGCGCGCC GTGGTCGACA GCATCCTGAC TGGCGGCGAC
GACGCGCCTC TGGTCTACGG CGTCAACACC GGCTTTGGCG CGCTGGCCGA GGTCCGCATC
TCGTCGCGCC AGATCGCCGA GTTGCAGCGC AATCTGGTGC GTTCGCACGC GGTCGGCGTG
AGCACGCCGC TGCCGCGCGA GGCCGTGCGC GCCATGATGA TGTTGCGCGC CCAGGTGCTG
GCGCGCGGCC ACAGCGGCTC GCGTCCGATG ATCTGCGAGC GTCTGTGCGA GCTGCTGGCG
CGCGGCGTCC ACCCCGAGAT CCCCAGCCGC GGCTCGGTGG GCGCCTCGGG CGATCTCGCG
CCGCTGGCGC ATCTGGCGCT CACGCTCATC GGCGAGGGCC ACGCCGAGTA CCAGGGCGAG
CGACTGCCGG CGGCCGAGGC CTTGCGCCGG GCCGGCCTAA CGCCGGTCGA GCTGGCCGCC
AAGGAGGGCA TCACGCTGCT CAACGGCACC CAGCACATGA CCGCGCTGGG CGCGCTGAGC
GTGTTCGATG GCGAGCACAC CTGTCGCGTC GCCGACCTCG CCGGCGCCAT GTCGCTCGAG
GCGCTGCAGG GCACGGCGCG GGCCTTCGAC GCGCGCGTGG CCGGCGCGCG CCCGCACCCC
GGGCAGATGG CGGTGGCCGA GTCGCTGTGC GAGCTGCTGG CCGAGAGCGA GATCGCCGAC
TCCCACCGCG ACTGCGGCAA GGTCCAGGAT CCCTACTCGC TGCGCTGCAT GCCGCAGGTC
CACGGCGCCA CCCGCGACGT GCTCGCGTAC GCGCGCGCGG TGCTCGAGCG CGAGGCCAAC
GCGTGTACCG ACAATCCGCT GGTGTTCCTC GACGAGTCGC TCGCGCACGG CGGCGTGCTG
ATCTCGGGCG GCAACTTCCA CGGCCAGCCT GTGGCCCTGG CGCTCGACGC CGCGACCATG
GCGGTGGCCG AGCTGGCCAA CATCAGCGAG CGGCGCATCG AGCAGCTCGT CAACCCGGCG
CTCTCCAGCG GGCTGCCGCC CTTCCTGGCG CCCTCGAGCG GCCTCAACTC GGGCTACATG
ATCGCCCAGG TGAGCGCGGC GTCCCTGGTG TCCGAAAACA AAGTCCTGGC GCACCCGGCC
TCGGTCGATT CCATCCCCTC TTCGGCCGGA CGCGAGGACC ACGTGTCCAT GGGCGCGCTG
TCGGCGCTCA AGCTGCGCGA TGTCCACGAC CACGTGCGCA CGGTGCTCGC CATCGAGGTG
CTGTGCGCCA CGCAGGGCAT CGATCTGCGC GCGCCGCACA AGCCCAGTGT CAAGCTCCGG
GCCGCGCACG CCTGCGTCCG CGCGCGGGTT CCCTTCATGG AGCGCGACCG GCCCATCTAT
GAAGATGTCC AGGTGGTGCG CGCGCTCATC GACAGCGGCG AGCTGCTGGC CGCGGTGGCC
TGA
 
Protein sequence
MAAVSEHLDR APVLLGESPL VLEDIVRVAR DGAAAQPGPS ALEAMAKSRA VVDSILTGGD 
DAPLVYGVNT GFGALAEVRI SSRQIAELQR NLVRSHAVGV STPLPREAVR AMMMLRAQVL
ARGHSGSRPM ICERLCELLA RGVHPEIPSR GSVGASGDLA PLAHLALTLI GEGHAEYQGE
RLPAAEALRR AGLTPVELAA KEGITLLNGT QHMTALGALS VFDGEHTCRV ADLAGAMSLE
ALQGTARAFD ARVAGARPHP GQMAVAESLC ELLAESEIAD SHRDCGKVQD PYSLRCMPQV
HGATRDVLAY ARAVLEREAN ACTDNPLVFL DESLAHGGVL ISGGNFHGQP VALALDAATM
AVAELANISE RRIEQLVNPA LSSGLPPFLA PSSGLNSGYM IAQVSAASLV SENKVLAHPA
SVDSIPSSAG REDHVSMGAL SALKLRDVHD HVRTVLAIEV LCATQGIDLR APHKPSVKLR
AAHACVRARV PFMERDRPIY EDVQVVRALI DSGELLAAVA