Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2112 |
Symbol | hisD |
ID | 4710042 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 2317180 |
End bp | 2318499 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639856586 |
Product | histidinol dehydrogenase |
Protein accession | YP_001003678 |
Protein GI | 121998891 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0141] Histidinol dehydrogenase |
TIGRFAM ID | [TIGR00069] histidinol dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCGACA TCAAGCGGCT GAGCACCACA CAACCCGATT TCTGGGACAG CCTGGACGCC CTGACGAGCT GGGGGAGCCA CCCGGGCGCC TCCATTGAGC AGCGCGTCGC CGAGGTGGTC GAGGGGGTTC GCACCGGCGG TGATGCGGCC CTGCTGGATT ACACGCGGCG TTTCGACGGC CGCGAGGCCG ATTCGGTGGC CGCGCTGGAG ATCCCGACCG AGCAGGTGGT CGCGGCCCAC GAACAGATCG CCCCGGAGAT GCGCGAGGCG CTGGAGCAGG CGGCCGCGCG GATCCGCGAC TACGCGCAGC GTCAGCGCCT GGAGTCCTGG GAGTACCAGG ACGCCGACGG TAACCGCCTC GGGCAGCAGG TCACCGCCCT CGACCGGGTC GGGGTCTACG TCCCCGGCGG CAAGGCCGCC TATCCCTCAT CGGTGCTGAT GAACGCCGTG CCGGCCCGGG TGGCCGGCGT GCAGGAGATC ATCATGACCG TCCCGGCCCC CGGGGGGGAG CTCTCACCAC TGGTGCTGGC CGCTGCCCAT GTGGCGGGCG TGGATCGCAT CTTCACCTTG GGCGGGGCGC AGGCGGTGGC GGCCCTGGCC TACGGGACCG AGACCGTACC CGCCGTGGAC AAGATCGTCG GCCCCGGTAA CGCCTATGTT GCCGAGGCCA AGCGCCGCGT CTACGGGGTG GTGGGCATCG ACATGATCGC CGGCCCCTCC GAGGTGCTGG TCATCAGCGA CGGCCAGGCC GATCCCGAGT GGATCGCCAT GGACCTGTTC TCCCAGGCCG AGCACGACGA GGAGGCGCAG GCCCTGCTGG TCTGCCCGGA CTTCGTCTTC CTCGATCAGG TCCAGGCGGC CATGGAGCGC CTGCTGCCGG ATATGGAGCG TTCGGAGATC ATCCGCACCT CGCTGGCGGA GCGGGGGGCG TTGATCTGTG TCCGCGATCT GGAAGAGGCG CAGCAGGTGG CCAACTATGT CGCTCCGGAA CACCTGGAGC TCTCCGTGGC CGAGCCCGAT CGACTGGCCG AGGGCATCCG TCACGCCGGG GCGATCTTCC TCGGCCATTA CAGCGCCGAG TCACTGGGTG ATTACTGTGC CGGCCCCAAC CACACGCTGC CGACGTCGCG GACCGCGCGG TTCGCATCGC CGTTGGGTGT CTACGACTTC CAGAAGCGTT CCACCACGCT GGCCTGCTCA CCCGCCGGCG CCGCAGCACT GGCCGGGACC GCCGCGGTGA TGGCACGGGG CGAGGGACTG ACCGCCCATG CCCGCTCAGC GGAGTACCGC GGCCAGGGGG AGGAGCGCAG CGGTGACTGA
|
Protein sequence | MVDIKRLSTT QPDFWDSLDA LTSWGSHPGA SIEQRVAEVV EGVRTGGDAA LLDYTRRFDG READSVAALE IPTEQVVAAH EQIAPEMREA LEQAAARIRD YAQRQRLESW EYQDADGNRL GQQVTALDRV GVYVPGGKAA YPSSVLMNAV PARVAGVQEI IMTVPAPGGE LSPLVLAAAH VAGVDRIFTL GGAQAVAALA YGTETVPAVD KIVGPGNAYV AEAKRRVYGV VGIDMIAGPS EVLVISDGQA DPEWIAMDLF SQAEHDEEAQ ALLVCPDFVF LDQVQAAMER LLPDMERSEI IRTSLAERGA LICVRDLEEA QQVANYVAPE HLELSVAEPD RLAEGIRHAG AIFLGHYSAE SLGDYCAGPN HTLPTSRTAR FASPLGVYDF QKRSTTLACS PAGAAALAGT AAVMARGEGL TAHARSAEYR GQGEERSGD
|
| |