Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_1059 |
Symbol | |
ID | 4484841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 1169183 |
End bp | 1170403 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639729834 |
Product | histidinol-phosphate aminotransferase |
Protein accession | YP_872818 |
Protein GI | 117928267 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0743444 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.133143 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGC CTGACGTCAC CACCGGGGGC GCTGCTGGGG CCGGCGGTAC GGCCCGGCCC GCGCGGGCCG CGGCGGGTTT CACGCCGCCG CTGCGGCTCG ATCTGGTCGG TGAGAGACCG TACGGCGCTC CGCAACTCGC CGTTCCCGTG CGGTTGAACA CCAACGAGAA TCCCTACCCG CCGCCGCCGC AGGTGGCCGC CGCAATGGCC GACGAGATTC GCCGCATTGC CGCCGGCCTC AACCGGTACC CGGACCGTGA AGCCGCTGTG CTCCGCGCCG ATCTTGCTGA ATATCTCGCG GCGACCGAGC ACGTCAGCCT GGACGTCGCC CGGATCTGGG CCGCCAACGG GTCCAACGAA ATCTTCCACC AGCTGCTGCT CGCGTTCGGT GGACCGGGCC GGCGTGCGCT GAGTTTCGCC CCGACGTACT CGATGTATCC GCAGTACTGC CGCGACACCT TCACCCGCTA TGCCACCGAA CCCCGCGGAG CGGATTTCAG CGTGGACACC GAGGCGGCGT GCGCGGCGGT CCGCCGGCAC CGGCCGACCG TGACATTCCT TGCCGCGCCG AACAATCCAA CGGGCACGGC GGTCCCCCTC GACACGGTGG CACACCTCGC GTCCGCGGCC GCGGAGAGCG GGGGATTGCT CGTCGTCGAC GAGGCGTACG CCGAATTCCG CCGCCCGGGA ACGACCAGCG CGTTGACCCT GCTGGACGAT TTTCCGAACC TCGTCGTGAC CCGGACGATG AGCAAGGCAT TTGCCGCCGC CGGCGTCCGG CTGGGGTATC TCGCCGCTCA CCCGGCGGTT GTCGACGCGT TGCGGATTGT CCGGCTGCCG TACCATTTGT CGACGGTTAC CCAAGCGGTG GCACGGGTTG CGCTTGCGCA CGCGGCGGAA TTGCTCGCTC AGGTCGCCGA GATTCGCGCG GAACGGGACG CCGCCGTCGC CTGGCTGCGG GAATCCGGCT TCACGGCGGC GGAATCGGAC GCGAATTTCG TGCTGTTCGG GATTTTCGCC GATGCGCACC GGGTCTGGCA GGAATTGGTC GACGCCGGCG TGCTCATCCG GGAGACCGGG CCGGACGGTT GGCTGCGGGT GAGCATCGGG ACACCGGAGG AAATGCGCGC TTTCCGCACC GCCTTGGCCG CCGCACCGGC TGCGGGCCGG CGGGTGCGGC GTACCGACGG TGCGCCGCAT CAGGAGGGAG GGGTGACGTG A
|
Protein sequence | MTTPDVTTGG AAGAGGTARP ARAAAGFTPP LRLDLVGERP YGAPQLAVPV RLNTNENPYP PPPQVAAAMA DEIRRIAAGL NRYPDREAAV LRADLAEYLA ATEHVSLDVA RIWAANGSNE IFHQLLLAFG GPGRRALSFA PTYSMYPQYC RDTFTRYATE PRGADFSVDT EAACAAVRRH RPTVTFLAAP NNPTGTAVPL DTVAHLASAA AESGGLLVVD EAYAEFRRPG TTSALTLLDD FPNLVVTRTM SKAFAAAGVR LGYLAAHPAV VDALRIVRLP YHLSTVTQAV ARVALAHAAE LLAQVAEIRA ERDAAVAWLR ESGFTAAESD ANFVLFGIFA DAHRVWQELV DAGVLIRETG PDGWLRVSIG TPEEMRAFRT ALAAAPAAGR RVRRTDGAPH QEGGVT
|
| |