Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_2206 |
Symbol | hisD |
ID | 7270291 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 2348675 |
End bp | 2349925 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643570820 |
Product | histidinol dehydrogenase |
Protein accession | YP_002467225 |
Protein GI | 219852793 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0141] Histidinol dehydrogenase |
TIGRFAM ID | [TIGR00069] histidinol dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.209889 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGAAGG CGGTGGAGAT CAATGACTGG ACTATGAAGC GGAGGGCGAC CCTCGCGCAG GTTCAGGATG CCGTGCAGGG GATCATCGGC GAGGTCCGGG AGTCTGGAGA CCGTGCCCTG ATCGATCTCG CCGCCCGGTT CGATCGGGTC AAACTGGACG GAATTCGGGT CAGTGAGGAG GAGCAGCAGG AGGCCTACGA CCTCGTCGAC GAGCAGGTGG TCGAGAGCCT TGTCGAGGCG GCTGCCAGGA TCACGGTCTT CCATGAACTG CAGCGCCCGA AAGACCTCTG GCTCTCTGAG GTGGAGCCGG GCATCACCCT TGGGGTCAAG ACCACGCCGC TCTCACGGGT TGGAGCCTAT GTCCCCGGCG GGCGGGCCTC GTACCCGTCG ACGGCGTTGA TGTGCACTAT TCCCGCGAAG GTCGCCGGGG TCGGTGAGAT CTGCTGCTGC TCGCCGCCGC CGATCCACCC GTTAACCCTG GTTGCCCTTG ATATCGCCGG TGTCTCAGAG ATCTATCGGG CAGGCGGGGC ACAGGCGATC GCCGCGATGG CACTCGGCAC CGAAACGATA AAACCGGTTC AGAAGATCGT CGGGCCGGGA AACGTCTATG TGACGGCGGC TAAGATGCTC CTCCGGGAGT ACGCCGAGAT TGACTTCCCG GCCGGTCCGA GCGAGGTCGC GATCCTGGCC GATGAGACTG CGACCCCCTC CTTCGTCGCT GCCGATATCC TCGCCCAGGC CGAGCATGAC CCAAACGCTG CCTGTCTTCT GATCACCACA GACCCGACCC TGGCCCGGGA GGTCGGTGAG GAGGTCGGTC GGCAACTACT GATGGCCCCA CGGAAGGCGA TCATCGAGCA GTCCCTCAAC AACGCTGGTT ACCTGATCGC CAGCGATCTG GATGTAGCAA TCGAGGCCGT CAACACGGTC GCCCCCGAGC ACCTCTCGAT CCAGGTCGCC GACCCCCTCT CCGCCCTTGG CTCGATTCGA AATGCGGGCT CGATCTTCAT CGGCCCGTAC GCCCCGGTGG CCTGTGGGGA TTACGCGTCC GGTACCAACC ATGTACTGCC GACGGCAGGT TATGCAGCCC GTTTTTCAGG GCTCGATGTG AATCACTTCT GCAAGACATC GACCGTACAG ATGATCAGCA GGCGCGGGCT TGAGACGATT GGAGACGTGG TCGAGACGAT CGCCGAGGCT GAAGGACTCT CTGCCCACGC CGAGTCAGTT CGGGTGCGGC GCAGATCCTG A
|
Protein sequence | MWKAVEINDW TMKRRATLAQ VQDAVQGIIG EVRESGDRAL IDLAARFDRV KLDGIRVSEE EQQEAYDLVD EQVVESLVEA AARITVFHEL QRPKDLWLSE VEPGITLGVK TTPLSRVGAY VPGGRASYPS TALMCTIPAK VAGVGEICCC SPPPIHPLTL VALDIAGVSE IYRAGGAQAI AAMALGTETI KPVQKIVGPG NVYVTAAKML LREYAEIDFP AGPSEVAILA DETATPSFVA ADILAQAEHD PNAACLLITT DPTLAREVGE EVGRQLLMAP RKAIIEQSLN NAGYLIASDL DVAIEAVNTV APEHLSIQVA DPLSALGSIR NAGSIFIGPY APVACGDYAS GTNHVLPTAG YAARFSGLDV NHFCKTSTVQ MISRRGLETI GDVVETIAEA EGLSAHAESV RVRRRS
|
| |