Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2942 |
Symbol | hisD |
ID | 4245284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 4573381 |
End bp | 4574745 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 638107982 |
Product | histidinol dehydrogenase |
Protein accession | YP_722579 |
Protein GI | 113476518 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0141] Histidinol dehydrogenase |
TIGRFAM ID | [TIGR00069] histidinol dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.795445 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCGAA TAATTACTGA GTGGGTTGAG GCACAAGCTG AATTAAAACG GATTACAGAG CGTACCCACG ATGATATGGC TATTCATAAA GAAACAACGG TGCGAGAAAT TTTGCGCAAC GTTAGAGAAA AAGGTGATGA GGCTCTATTA AACTATACTT CTGAGTTTGA CCATATCACC TTGACTTCAG AAGAGTTAAA GATAAGTGGT TCTGAGTTAG ATGCAGCCTA TCAGCAAGTC AATAAGGACT TATTAAGTTC CATTCGTTTG GCTGCTAAAC AGATAAAGGC TTTTCACCGA CAACAAATTC CTAAGTCCTG GGTTCAGTTT GGTGATGATG AAGTGGTCTT GGGCAAACGT TACACTCCTG TGGATAGAGC TGGACTTTAT GTTCCTGGTG GTCAGGCTGC TTATCCTAGT ACAGTTCTGA TGAACGCTAT TCCAGCTCAA GTGGCTAAAG TCCCCAAGAT TGTTATGGTG ACACCACCCT CTCAAGGAAA AAAAATTAAT CCTGCGGTAC TGGTGGCTGC TCAAGAGGCA GGTATTCAGG AAATTTATCG AATTGGAGGT GCTCAGGCAA TAGCGGCTCT GGCTTATGGT ACAGAGACTA TACCCAAAGT GGATGTGATT ACTGGGCCTG GGAATATATA TGTAACTTTG GCCAAAAAAA TTGTTTATGG TACAGTAGGT ATTGATTCTT TGGCTGGGCC TTCAGAGGTG CTGATTATTG CAGATTCTGA GGCTAATCCA GTTTATGTGG CAGCAGATAT GTTGGCCCAA GCAGAACATG ATTCTTTGGC TGCTGCTATT TTATTGACTA CAGATTTAGA ACTAGCACGT CAGGTGGTGG TAGAGGTGGA ACGACAACTT GAAGATCATC CACGTCGCAC TTTGACGGAA AAAGCGATCG CCCATTATGG TTTGGTGGTT GTTGTTTCAT CTTTTGAAGA TGCTGTTGAA CTTTCTAATC AATTTGCTCC AGAACACTTG GAGTTGGAAG TTTCAGATCC TTGGGAACTA CTGGAAAATA TTCGCCATGC TGGTGCTATT TTCCTTGGTT ATTCAACTCC GGAAGCTGTG GGAGATTATT TGGCTGGGCC GAATCACACT TTACCTACTT CTGGTGCTGC TCGTTATGCT TCAGCATTGG GAGTAGAAAC TTTTCTGAAG CATTCTAGTT TGGTTCAATA TTCTCCTACT GCTTTGCAAA AGGTTGCTTC TGATATTGAT TTATTGGCCA CAGCTGAGGG TTTGCATTCT CATGCTAACT CAGTTAGATT GAGGATGAAA GCTCATGAAC ATGAACAACC TGAGTTAGTA AAACAAGAAA CAATTAGTCA GGAGCACGAC GGTAAAAATT TATAA
|
Protein sequence | MLRIITEWVE AQAELKRITE RTHDDMAIHK ETTVREILRN VREKGDEALL NYTSEFDHIT LTSEELKISG SELDAAYQQV NKDLLSSIRL AAKQIKAFHR QQIPKSWVQF GDDEVVLGKR YTPVDRAGLY VPGGQAAYPS TVLMNAIPAQ VAKVPKIVMV TPPSQGKKIN PAVLVAAQEA GIQEIYRIGG AQAIAALAYG TETIPKVDVI TGPGNIYVTL AKKIVYGTVG IDSLAGPSEV LIIADSEANP VYVAADMLAQ AEHDSLAAAI LLTTDLELAR QVVVEVERQL EDHPRRTLTE KAIAHYGLVV VVSSFEDAVE LSNQFAPEHL ELEVSDPWEL LENIRHAGAI FLGYSTPEAV GDYLAGPNHT LPTSGAARYA SALGVETFLK HSSLVQYSPT ALQKVASDID LLATAEGLHS HANSVRLRMK AHEHEQPELV KQETISQEHD GKNL
|
| |