Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4629 |
Symbol | |
ID | 5736476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5914984 |
End bp | 5916570 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281793 |
Product | histidine ammonia-lyase |
Protein accession | YP_001547388 |
Protein GI | 159901141 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2986] Histidine ammonia-lyase |
TIGRFAM ID | [TIGR01225] histidine ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAATGTT TAGTGCTCAA TGGCGAGCAG TTAACAGTTG ATGGTTTGGT GGCTGCCGCT CGTAATCCGG CAATTAAGGT CGAATTAGCG CCCGAAGCAA TTGAACGAAT GCACTATTCT CGCGCTGCCG TCGAGCGATT TGTGGCCGAA GGTCGCGTGG TCTATGGCAT TACCACGGGC TTTGGTCATT TTCAAAATCG TACAATCGAT CGCGACCATG TGCGCGAGTT GCAACGCAAT ATTATTATGA GCCACGCCAC TGGCACAGGC ACGCCGCTGC GCCGCGACCA AGTACGCGCC ATGTTGATCG TGCGAGTCAA TACCTTGGCT AAAGGCTTTT CAGGGATTCG CCCGCTGGTT GCACAAGCCT TGCTTGATCT GCTCAACGCC GATATTTTGC CAATTATTCC TTGTCAAGGC TCGCTTGGAG CTAGCGGCGA TTTGGCTCCC CTCGCCCATG CCTGTTTGAT TTTGCTGGGC TTGGGCGAGG CGGTTGCTCC AGGTCAATCG CCAGTCCATG GCCAACGCAT GAGTGGAGCC GAAGTTTTAG CCCACTTGCA GCAAGAACCT TTGGTTTTAG AGGCTAAAGA AGGCTTAGCA TTAACTAATG GCACGGCATT ATTGAGTGGC TTAGCCGCCT TGGCAATCTA CGATGCCGAG CAACTTTGCC GCAGTGCCGA GACTATCGCC GCCTTGTCGA TGGAAGCTTT GGCGGCTTTG CCAGCAGCCT TCGATCAGCG GTTGCATGCA ATTCGTCCGC ATCCACGTCA GCTTGATAGT GCGCGGAGCA TTCGTCAATT GTTGCAAGGC AGTAGCTTTG TTTACCCCAG CCAAGCCGCT GATCCGACTA TTTATGGGCC GCATAAAGTC CAAGATGCCT ACTCGTTGCG CTGTGTGCCT CAAGTCCATG GGGCAATTCG CGATGCAGCC TGTTATGGGC GTTGGGCTAC CGAGATTGAA CTCAACAGCG CTACCGATAA CCCCTTGATT GTTCCTGTTG ATCCTGCGCA ACCCCATGGC GAATATGAGG CGATTTCGGG CGGTAACTTT CATGGTGAGC CTCTCGCATT AGCCATGGAT TTTCTGAAAG TGGCGTTGAG CGAATTGGGC AACATCAGCG AGCGCCGCAC TGCTCGCTTG GTTGATGCAG GTTTGAATGG CAATTTACTC GCCCCGTTTT TAACCGAGCA AGGCGGCCTG CACTCAGGCA TGATGTTGAT TCAATATACG GCTGTGGCTT TGGCGAGCGA AAATAAAGTG CTGGTACACC CCGCTGCTGC TGATACGATT CCTACCTCGG GTAATCAAGA AGATCATGTC AGTATGGGGC CGACTGCTGC CCGTCAGGCT GCCGAGATGC TCGATAATGT GGTGGGTATT TTGGCCTGTG AAGCCTTATG CGCGGCCCAA GCGATCGATT TACGTTGGCG CAAACACGAG CATTTACAGC TGGGCCAAGG AACTGCGCCC GCCCATCAAG TAATTCGCCA GGTTGTGCCA TTTCTAGCTG AAGATACCGT GATGTACCCG CATATCGAAG GCCTGAAACA GGTGATTCAG GCTGGTAAAT TGGCCTTAGC CGAATGA
|
Protein sequence | MECLVLNGEQ LTVDGLVAAA RNPAIKVELA PEAIERMHYS RAAVERFVAE GRVVYGITTG FGHFQNRTID RDHVRELQRN IIMSHATGTG TPLRRDQVRA MLIVRVNTLA KGFSGIRPLV AQALLDLLNA DILPIIPCQG SLGASGDLAP LAHACLILLG LGEAVAPGQS PVHGQRMSGA EVLAHLQQEP LVLEAKEGLA LTNGTALLSG LAALAIYDAE QLCRSAETIA ALSMEALAAL PAAFDQRLHA IRPHPRQLDS ARSIRQLLQG SSFVYPSQAA DPTIYGPHKV QDAYSLRCVP QVHGAIRDAA CYGRWATEIE LNSATDNPLI VPVDPAQPHG EYEAISGGNF HGEPLALAMD FLKVALSELG NISERRTARL VDAGLNGNLL APFLTEQGGL HSGMMLIQYT AVALASENKV LVHPAAADTI PTSGNQEDHV SMGPTAARQA AEMLDNVVGI LACEALCAAQ AIDLRWRKHE HLQLGQGTAP AHQVIRQVVP FLAEDTVMYP HIEGLKQVIQ AGKLALAE
|
| |