Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_0746 |
Symbol | |
ID | 7318093 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 800714 |
End bp | 801805 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643615626 |
Product | histidinol-phosphate aminotransferase |
Protein accession | YP_002512825 |
Protein GI | 220933926 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.275971 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGACG CCGACACCCT GGTCGGCCAG TGGGTCCGGC CCGGCATCCG GGCCCTGGCC GCCTATCACG TCCCCGACGC CACCGGCTTC GTTAAACTGG ATGCCATGGA AAACCCCTAC CACTGGCCGC CGGAACTGGT GGATGCATGG CTCGCGCAGC TGCGTGACGT GGAACTGAAC CGCTACCCGG ATCCCCGTTC GCCGCAGCTG ATCGAGACCC TGCGCCGGGT CGCCGGCATC CCCGCCGACC AGTCCGTGAT CCTCGGCAAC GGCTCCGACG AGTTGATCCA GGTGATCATC ATGGCGCTCG CCGCGCCCGG GCGGGTGGTG ATGTCCGTGG AGCCGAGCTT CGTCATGTAC CGCATGATCG CCGGCTACGC GGACATGGCC TACGTGGGCG TGCCCCTGGG CGAGGACTTC GCCCTGGATG TGGATGGCTT CATCCGGGCC ATGCAGGAGC ACCAGCCCGC GGTGGTGTTC CTGGCCTATC CCAACAACCC CACGGGCAAC CGCTTTCCCC GCGAGGACGT GGAGGCCGTG ATCCAGGCCG CCCCCGGCCT GGTGGTGGTG GACGAGGCCT ATGCGCCGTT TGCCGATGAC AGCTTCCTCA AGGACCTGGG CCGTTACCCG AATCTCGTGG TCATGCGCAC GGTCTCCAAG CAGGGCCTGG CGGGCCTGCG CCTGGGTTAC CTGAGCGGCC CGGCTGAGTG GCTCTCGGAA TTCGACAAGC TGCGCCTGCC CTACAACATC AACGTGCTCA CCCAGGCCAG TGCCACCTTC GCCCTGGAAC ATCACCAGGT GCTCGAGGAC CAGGCCCGGG CCATCCGCCA CGACCGGGGC GTACTCCTGG ATGCCCTGGC CGCGCTGCCG GGGCTGAAGG TCTATCCCAG CGAGGCCAAC TTCATTTTGT TCCGCGTGCC GCAGGGGCAG GGCAACCGGG TGTTCCAGGG ATTGAAGGAC GCCGGCGTGC TCATCAAGAA CCTCTCCCCC CAGGGCGGCG TGCTCGCCGA CTGCCTGCGG GTGACCGTAG GCCGGCCGGA GGAAAACGCC CGCTTCATGG AGGCGTTGGC AGGTGTGCTG GGGTCTGGCT GA
|
Protein sequence | MKDADTLVGQ WVRPGIRALA AYHVPDATGF VKLDAMENPY HWPPELVDAW LAQLRDVELN RYPDPRSPQL IETLRRVAGI PADQSVILGN GSDELIQVII MALAAPGRVV MSVEPSFVMY RMIAGYADMA YVGVPLGEDF ALDVDGFIRA MQEHQPAVVF LAYPNNPTGN RFPREDVEAV IQAAPGLVVV DEAYAPFADD SFLKDLGRYP NLVVMRTVSK QGLAGLRLGY LSGPAEWLSE FDKLRLPYNI NVLTQASATF ALEHHQVLED QARAIRHDRG VLLDALAALP GLKVYPSEAN FILFRVPQGQ GNRVFQGLKD AGVLIKNLSP QGGVLADCLR VTVGRPEENA RFMEALAGVL GSG
|
| |