Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3903 |
Symbol | |
ID | 8546299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 5384691 |
End bp | 5386253 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 646388575 |
Product | histidine ammonia-lyase |
Protein accession | YP_003268295 |
Protein GI | 262197086 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2986] Histidine ammonia-lyase |
TIGRFAM ID | [TIGR01225] histidine ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.23664 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.18215 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGCCG TGAGTGAACA TCTCGACCGC GCCCCTGTTC TCCTTGGCGA GTCACCGCTG GTGCTCGAAG ACATCGTCCG CGTGGCCCGC GACGGCGCCG CCGCGCAGCC CGGGCCCAGC GCCCTCGAGG CCATGGCGAA ATCGCGCGCC GTGGTCGACA GCATCCTGAC TGGCGGCGAC GACGCGCCTC TGGTCTACGG CGTCAACACC GGCTTTGGCG CGCTGGCCGA GGTCCGCATC TCGTCGCGCC AGATCGCCGA GTTGCAGCGC AATCTGGTGC GTTCGCACGC GGTCGGCGTG AGCACGCCGC TGCCGCGCGA GGCCGTGCGC GCCATGATGA TGTTGCGCGC CCAGGTGCTG GCGCGCGGCC ACAGCGGCTC GCGTCCGATG ATCTGCGAGC GTCTGTGCGA GCTGCTGGCG CGCGGCGTCC ACCCCGAGAT CCCCAGCCGC GGCTCGGTGG GCGCCTCGGG CGATCTCGCG CCGCTGGCGC ATCTGGCGCT CACGCTCATC GGCGAGGGCC ACGCCGAGTA CCAGGGCGAG CGACTGCCGG CGGCCGAGGC CTTGCGCCGG GCCGGCCTAA CGCCGGTCGA GCTGGCCGCC AAGGAGGGCA TCACGCTGCT CAACGGCACC CAGCACATGA CCGCGCTGGG CGCGCTGAGC GTGTTCGATG GCGAGCACAC CTGTCGCGTC GCCGACCTCG CCGGCGCCAT GTCGCTCGAG GCGCTGCAGG GCACGGCGCG GGCCTTCGAC GCGCGCGTGG CCGGCGCGCG CCCGCACCCC GGGCAGATGG CGGTGGCCGA GTCGCTGTGC GAGCTGCTGG CCGAGAGCGA GATCGCCGAC TCCCACCGCG ACTGCGGCAA GGTCCAGGAT CCCTACTCGC TGCGCTGCAT GCCGCAGGTC CACGGCGCCA CCCGCGACGT GCTCGCGTAC GCGCGCGCGG TGCTCGAGCG CGAGGCCAAC GCGTGTACCG ACAATCCGCT GGTGTTCCTC GACGAGTCGC TCGCGCACGG CGGCGTGCTG ATCTCGGGCG GCAACTTCCA CGGCCAGCCT GTGGCCCTGG CGCTCGACGC CGCGACCATG GCGGTGGCCG AGCTGGCCAA CATCAGCGAG CGGCGCATCG AGCAGCTCGT CAACCCGGCG CTCTCCAGCG GGCTGCCGCC CTTCCTGGCG CCCTCGAGCG GCCTCAACTC GGGCTACATG ATCGCCCAGG TGAGCGCGGC GTCCCTGGTG TCCGAAAACA AAGTCCTGGC GCACCCGGCC TCGGTCGATT CCATCCCCTC TTCGGCCGGA CGCGAGGACC ACGTGTCCAT GGGCGCGCTG TCGGCGCTCA AGCTGCGCGA TGTCCACGAC CACGTGCGCA CGGTGCTCGC CATCGAGGTG CTGTGCGCCA CGCAGGGCAT CGATCTGCGC GCGCCGCACA AGCCCAGTGT CAAGCTCCGG GCCGCGCACG CCTGCGTCCG CGCGCGGGTT CCCTTCATGG AGCGCGACCG GCCCATCTAT GAAGATGTCC AGGTGGTGCG CGCGCTCATC GACAGCGGCG AGCTGCTGGC CGCGGTGGCC TGA
|
Protein sequence | MAAVSEHLDR APVLLGESPL VLEDIVRVAR DGAAAQPGPS ALEAMAKSRA VVDSILTGGD DAPLVYGVNT GFGALAEVRI SSRQIAELQR NLVRSHAVGV STPLPREAVR AMMMLRAQVL ARGHSGSRPM ICERLCELLA RGVHPEIPSR GSVGASGDLA PLAHLALTLI GEGHAEYQGE RLPAAEALRR AGLTPVELAA KEGITLLNGT QHMTALGALS VFDGEHTCRV ADLAGAMSLE ALQGTARAFD ARVAGARPHP GQMAVAESLC ELLAESEIAD SHRDCGKVQD PYSLRCMPQV HGATRDVLAY ARAVLEREAN ACTDNPLVFL DESLAHGGVL ISGGNFHGQP VALALDAATM AVAELANISE RRIEQLVNPA LSSGLPPFLA PSSGLNSGYM IAQVSAASLV SENKVLAHPA SVDSIPSSAG REDHVSMGAL SALKLRDVHD HVRTVLAIEV LCATQGIDLR APHKPSVKLR AAHACVRARV PFMERDRPIY EDVQVVRALI DSGELLAAVA
|
| |