Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1942 |
Symbol | |
ID | 5103329 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1886825 |
End bp | 1888000 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640507830 |
Product | histidinol phosphate aminotransferase |
Protein accession | YP_001192006 |
Protein GI | 146304690 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.95238 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACAAG AAGGTGCTTT CCTGAAGCTA ACTCCTCGAG AAACATGTGA TGGAAAAAGC TTAAAACAGG TTAAAAACTA TTCCTCCCGA GGGATTCTTA TTGCACCAAC CGATGTAAAG GACATTATAT ACCCATGGTT AAAGGAGGCA AAGGAGTACG ATTTCTCGGA TATCAAGGAT GGGATCAGGC TACACCTGAA TGAGTCCCCC TATCCGCCTC CCGATTTCGT CCTAGATGCG GTAAAGAAGT ATTTAATTCA AGGGAACAGG TATCAGCATC CAGATCTAAC CAAGAGGTTT AAGGAGCTCG CTGCCGAGTA TAATAAGGTT GAACCTGGGG AGATCTTTCC AACACCTGGT GGAGACGGAG CCATAAGGTC AGTGTTCCTT AATCTCTCCA TGCCTGGGGA CAGGGTCGTG TTGAACTATC CCAGCTACAG CATGTACTCG GTCTACTCTT CCTTTAGGGG GTTGAACCAG GTTAGGGTTC CCCTTAGGGA GGAGGGAGAG TGGTGGAAGG AGGACTGGGA GAAGTTAGTT ACTGAGGCCA GAGACGCTAG ATTGGTGGCA ATAGACGACC CCAATAATCC AACTGGCTCC CCCATGATCA TGGGGGACGA GCAAAGATTG AGGGAACTGG TGGAGTCGAC CAAGGGGATA GTCCTTCTTG ACGAGGCATA TTACGAGTTT TCGGGATACA CTGCCTCAAG GCTGGTGTCT AAGTATCCGA ACCTCATGAT TGTGAGGACC ATGAGTAAGG CCTTCTCTCT TGCCTCCTTT AGGGTGGGTT ATCTCATAGC TAACAGGGAT GTGGTAAAGG CCCTCGAGAA GGGATCCACG CCCTTTGACG TTGCTCTTCC TTCTCTCATA GCTGGCATAA CCGCATTGGA AAATCCAGGT TACGCACACA GGATTGCTCA GGAGATCTCG GAGAACAGGG AAGGATTATA CCAGGGATTA ATTTCCCTCG GCGTGAAGGC TTACAGGTCA ATTACCAACT TCCTCTTGTT TAAACATTCA GCGGAGCTGG TCGAGCCCTT GATGAGGAAA GGGATAGCCA TAAGGAACCC AGTAAAGGGA TTTTATAGGG TATCAGTTGG GACAAAAGAG CAGTGTAATT TGTTCCTAAA TAAACTGGGT GAAGTACTTG AAAATAGCGA TACCAAACAA AGGTAG
|
Protein sequence | MRQEGAFLKL TPRETCDGKS LKQVKNYSSR GILIAPTDVK DIIYPWLKEA KEYDFSDIKD GIRLHLNESP YPPPDFVLDA VKKYLIQGNR YQHPDLTKRF KELAAEYNKV EPGEIFPTPG GDGAIRSVFL NLSMPGDRVV LNYPSYSMYS VYSSFRGLNQ VRVPLREEGE WWKEDWEKLV TEARDARLVA IDDPNNPTGS PMIMGDEQRL RELVESTKGI VLLDEAYYEF SGYTASRLVS KYPNLMIVRT MSKAFSLASF RVGYLIANRD VVKALEKGST PFDVALPSLI AGITALENPG YAHRIAQEIS ENREGLYQGL ISLGVKAYRS ITNFLLFKHS AELVEPLMRK GIAIRNPVKG FYRVSVGTKE QCNLFLNKLG EVLENSDTKQ R
|
| |