Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_26290 |
Symbol | hutH |
ID | 7761537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 2684516 |
End bp | 2686045 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643805507 |
Product | histidine ammonia-lyase |
Protein accession | YP_002799780 |
Protein GI | 226944707 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2986] Histidine ammonia-lyase |
TIGRFAM ID | [TIGR01225] histidine ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.663417 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTTCCG CCCTGCTTCT CCGCCCCGGC CAGCTCAGCC TGGACGACCT GCGCGCCATC TACTGGAGTC AGGTCGGAAT CGACCTCGAT CCGGCCTGCC GGACCGGGGT CGAGGCCAGC GCCCGCAGCG TCGCGGCGAT CCTCGCCAGC CAGCGCACCG TGTACGGCAT CAACACCGGC TTCGGCTTGC TGGCGCGCAC CTCGATCCCG GCCGAGTCGC TGACCGCCCT GCAGCGCAAC CTGGTGCTCT CCCACTGCAC CGGTACCGGA GCGCTGCTGG ACGACGCCAG CGTCGGGCTG ATCCTGGCAC TGAAGATCGC ATCCCTGGCG CGCGGTCACT CGGGCGTCGG CTGGGCGCTG ATCGAGGCGT TGCTGCGGCT CTACCGGGCC CGGGTCTATC CGTGCATTCC CTCCCAGGGC TCGGTGGGCG CCTCCGGCGA CCTGGCGCCG CTGGCCCATC TGGCGGCGAC ACTGCTCGGG ATCGGCCAGG TGCGCCACCG GGGCCACCTG CTGGAGGCTG GCGAAGGACT GGCCCTCGCC GGTCTCGAAC CCCTGACGCT CGGGCCCAAG GAAGGCCTGG CGCTGCTCAA CGGAACCCAG GTTTCCACCG CCCTGGCGCT GCGCGGGCTG TTCGCCGCGG AACGGCTGTT CGGCGCCGCG GTGGTCGCCG GCAGCCTGAG CACCGAGGCG TTGAAGGGTT CCTTCGTGCC CTTCGATACC CGCATCCAGG CGGTGCGCGG CCAGCCGGGA CAGATCGCGG TGGCCGCGCT CTACCGCGAA CTGCTCCACG ACAGCGCCAT CAACCGCTCC CATGCCCGCT GCGCGCGGGT TCAGGACCCC TACTCGCTGC GCTGCCAGCC GCAGGTGATG GGCGCCTGCC TGGATCACCT GCGTTTCGCC GCCGGGGTGT TTTTGCGCGA GGCCAACGCG GTGTCGGACA ACCCGCTGGT GTTCGCCGAC GATGCCGAGG TGCTGTCCGG CGGCAACTTC CACGCCGAAC CGGTGGCCAT GGCCGCCGAC GCGCTGGCCC TGGCCATCGC CGAGATCGGA GCCCTTTCCG AGCGGCGCAT CGCCCTGCTG ATCGACCCGG CGCTGTCCGG CCTGCCGGCC TTCCTGGTCA AGGAAGGCGG GCTGAATTCC GGCTTCATGA TCGCCCAGGT CACGGCGGCC TCGCTGGCCT CGGAGAACAA GACCCTGGCC CATCCGGCCT CGGTGGACAG CCTGCCGACC TCGGCCGGCC AGGAGGATCA CGTCTCCATG GCCACCTTCG CCGCCCGCCG CCTGCAGGAC ATGGCCGGCA ACGCCGCGGG CGTGGTGGGC ATCGAGCTGT TGGCCGCCGC CCAGGGCGTG GACTTCCACG CGCCGCTGTC CAGTTCGCCG CAACTGAACG AGGTCATGGC ATTGATCCGC AGCCGGGTGG CGCACTACGA AGAGGACCGC TATTTCGCCC CGGACATCGC CACCGCGCGA GCCTGGGTCG AGGGCGGCGC CTTCGAGCGC TGGGTGTCCT GCGCCCGCCT CCATCTCTGA
|
Protein sequence | MPSALLLRPG QLSLDDLRAI YWSQVGIDLD PACRTGVEAS ARSVAAILAS QRTVYGINTG FGLLARTSIP AESLTALQRN LVLSHCTGTG ALLDDASVGL ILALKIASLA RGHSGVGWAL IEALLRLYRA RVYPCIPSQG SVGASGDLAP LAHLAATLLG IGQVRHRGHL LEAGEGLALA GLEPLTLGPK EGLALLNGTQ VSTALALRGL FAAERLFGAA VVAGSLSTEA LKGSFVPFDT RIQAVRGQPG QIAVAALYRE LLHDSAINRS HARCARVQDP YSLRCQPQVM GACLDHLRFA AGVFLREANA VSDNPLVFAD DAEVLSGGNF HAEPVAMAAD ALALAIAEIG ALSERRIALL IDPALSGLPA FLVKEGGLNS GFMIAQVTAA SLASENKTLA HPASVDSLPT SAGQEDHVSM ATFAARRLQD MAGNAAGVVG IELLAAAQGV DFHAPLSSSP QLNEVMALIR SRVAHYEEDR YFAPDIATAR AWVEGGAFER WVSCARLHL
|
| |