Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4535 |
Symbol | |
ID | 5736386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5804372 |
End bp | 5805466 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281697 |
Product | histidinol-phosphate aminotransferase |
Protein accession | YP_001547294 |
Protein GI | 159901047 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.458798 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGATG TAACTCAGTT TGTGCGGCCT GATATTGCGG CGCTCGAAGC CTATACTCCG ATTCAACCAT TCGATGTGCT TAGCGAACAA TTGGGCATTC CAATTGAGCA CATTATTAAA CTTGATGCCA ATGAAAATCC TTACGGGCCA GCGCCCAGCG TTGCCGAAGC ACTGCGCGAT TGCCCGTTTT ATGCAATTTA TCCCGACCCT GATCAAACCC GTTTGCGGCG AGCAATTAGC CCATTTATTG GCCAGCCAAT TGAACGAATT TTATGTGGCA ATGGCTCAGA TGAAATTATC GATCTGTTGA TGCGAGTGCT GGTGCAGCCC AACGATGTGG TCATTGCCTG CCCACCAACC TTTGGCATGT ATAGCTTCAA TACTGGCGTG GTTGGCGGGC GGTTTGTGGC AGTGCCACGC GATGACAACT TCGATGTCGA TATCGAGGCG CTGGCCGATG CAGTGCTAGA GCATCAGGCC AAAATGGTGT TTTTGCCCTC GCCCAACAAT CCAACTGGCA ACCTTTTAGC CCGTGAACAC GTCCTACGGC TATTGAAATT GCCAACAATG CTCGTGTTGG ACGAAGCCTA CGCCGAGTTT AGTTCGGGGA GTGTCGCCGA TTTGGTTGGG CACTATCCCA ATTTGGTCGT GTTACGCACC TTTTCCAAAT GGGCTGGTTT AGCGGGGTTG CGCTTGGGCT ATGGGCTAAT TCCCGAGTGG TTGATTACCC ACCTGTGGAA AATCAAGCAG CCCTACAATA CCAATGTCGC AGCTGAGGTC GCGGTGTTGC AATCGTTGGC CGAAACCGAG GCGCTTAATG CCCGCGCCAA AATTATGGTT GCTGAGCGAG AACGTCTATT TGCTGCCTTG CAAACAATCG ACGGCCTTAC GCCATTTCCA TCGCAGGCAA ATTTTATTCT GTGTCGGGTT GAGCGTGGCG ATGCCGCCCA ACTCAAAGCC GACCTTGCCA AACGGGGAAT TTTGCTGCGC TACTATCGCA ACCCAGAGCT TGCCAATTGT ATTCGGGTGA GTATTGGCCT ACCAGAACAT CATGATGCCT TGCTTGCAGC GTTAGCCGAT TTGGGTTATC GCTAA
|
Protein sequence | MADVTQFVRP DIAALEAYTP IQPFDVLSEQ LGIPIEHIIK LDANENPYGP APSVAEALRD CPFYAIYPDP DQTRLRRAIS PFIGQPIERI LCGNGSDEII DLLMRVLVQP NDVVIACPPT FGMYSFNTGV VGGRFVAVPR DDNFDVDIEA LADAVLEHQA KMVFLPSPNN PTGNLLAREH VLRLLKLPTM LVLDEAYAEF SSGSVADLVG HYPNLVVLRT FSKWAGLAGL RLGYGLIPEW LITHLWKIKQ PYNTNVAAEV AVLQSLAETE ALNARAKIMV AERERLFAAL QTIDGLTPFP SQANFILCRV ERGDAAQLKA DLAKRGILLR YYRNPELANC IRVSIGLPEH HDALLAALAD LGYR
|
| |