Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0927 |
Symbol | |
ID | 4268214 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1052309 |
End bp | 1053448 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 638125679 |
Product | histidinol-phosphate aminotransferase |
Protein accession | YP_741771 |
Protein GI | 114320088 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATGA GCGAAGTCGA CTGGGCCCGA CGGGCGGTCC CCGGTGTCCA GGCCCTGGCC CCCTATGAGC CGGGCAAGCC CATCGCGGAA CTGCGCCGCG AATACGGCGT GACCGACATC ATCAAGCTGG CCTCCAACGA GAGCCCGCTG GGGCCCTCGC CGCAGGCCCT GACCGCCGCC CGCGAGGCCG CCGCCGAGGT GCACCGCTAT CCGGACGGCA ACGCCTTCGA GCTGAAGGCA CGGCTGGCTG CGCGGCATGG CGTCGGGGCG GAGCGCATCA CCCTGGGTAA TGGCTCCAAC GACGTCCTCG CGCTGATCGC CCAGGCCTTC CTGGGCCCGG AGCGCGAGGC GGTGTTCTCC CGCCACGCCT TCGCCGTCTA CCCCATTGTC ACCCAGGCGG CCGGGGCGGT GGCCCGGGTG GCGCCCGCGC ACGGCGCCGA CAGCGACCAA CCCTACGGGC ACGATCTGGC CGCCATGCAG CGGCTGATCA GCGGGCGCAC CCGGGTGGTC TTCATCGCCA ACCCCAACAA CCCAACCGGT ACCTGGGTCG GGGAGGATGC GCTGCGCGCC TTCATCGAGC AGGTCCCGGG CGACGTGCTG GTGGTGGTGG ACGAGGCCTA TTTCGAGTAC GCCCGGGATC TCGCCGGCCT GCCCGACGCC AGCCGCTGGC TGGATGAGTT CCCCAACCTG GTGGTCACCC GGACCTTCTC CAAGTGCTAC GGGCTGGCCG GGTTGCGGGT GGGCTACGCG CTCAGCAGCC CGCCGGTGGC GGAGCTGCTC AACCGCGTCC GCCAGCCCTT CAACTGCAAT GCCGTGGCCC AGGCGGCGGC CGGGGCAGCG CTGGACGACG AGGCCCATCT GGCCCGCGCC ATCGCCCTCA ATACCGAGCA ACTGCGGCTC ATGGAGGCGG AGCTGCGCCA GTTGGGGCTC ACCGTCCTGC CCTCCGCCGG CAACTTCCTC TGTTTCGATG TCGGCGGCGG CGCTGCCTCG GTCAACGAGG GGTTGCTGCG GGCCGGGGTC ATCGTGCGCC CGGTGGGCGG TTACGAACTG CCCGGTTTCC TGCGGGTATC GGTCGGCCTG CCCGAGGAGA ACCGGCGCTT CCTCGACACC CTGGAGCGGC TGATCAGCGC CCCGGCATGA
|
Protein sequence | MSMSEVDWAR RAVPGVQALA PYEPGKPIAE LRREYGVTDI IKLASNESPL GPSPQALTAA REAAAEVHRY PDGNAFELKA RLAARHGVGA ERITLGNGSN DVLALIAQAF LGPEREAVFS RHAFAVYPIV TQAAGAVARV APAHGADSDQ PYGHDLAAMQ RLISGRTRVV FIANPNNPTG TWVGEDALRA FIEQVPGDVL VVVDEAYFEY ARDLAGLPDA SRWLDEFPNL VVTRTFSKCY GLAGLRVGYA LSSPPVAELL NRVRQPFNCN AVAQAAAGAA LDDEAHLARA IALNTEQLRL MEAELRQLGL TVLPSAGNFL CFDVGGGAAS VNEGLLRAGV IVRPVGGYEL PGFLRVSVGL PEENRRFLDT LERLISAPA
|
| |