Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_16921 |
Symbol | hisD |
ID | 4718422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 1434952 |
End bp | 1436238 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640079418 |
Product | histidinol dehydrogenase |
Protein accession | YP_001010082 |
Protein GI | 123969224 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0141] Histidinol dehydrogenase |
TIGRFAM ID | [TIGR00069] histidinol dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATCA TAAATAATAA AGAAGATGCT ATCCAAGAAT TAAAAAGGAT TTCCACTCGA ACTAACTCAG AAAACAACCA CAAAATAAAT GCAATAGTCG AAGAAATTCT TCAAGAAGTA AAAAATTATG GAGATACAGC AGTAAAAAAA TATACCAAAA AATTTGATGG TTTCAACCCT GAACCTATGC AAGTAAGTTC GAACCAAATA AAAAATGCAT GGAATGAAAT TGATAACAAT TTGAAGCGCT CTCTTGAGGT AGCTCATAAA AGAATACAAA AATTCCATGA AAAAGAAATT CCTAAATCTT TTACAATAAA AGGTGAATAT GGTGATACAG TCCAAAGAAG ATGGCGACCA GTTAAAAATG CAGGTATTTA TATTCCTGGA GGCAGAGCTG CTTATCCAAG CACTGTATTA ATGAATGCAA TACCTGCGAA AGTAGCAGGA GTTAAGGAAA CTATAATGGT ATCTCCTGGG AATAAAGAAG GAGAAATAAA CAAAACTGTT TTAGCAGCAG CTCACTTATC AGGAATCAAT AAAGTATTTA GAATTGGAGG AGCTCAAGCA ATTGGTGCTT TAGCCTTTGG CACAAATCAA ATCAATAAAG TTGATGTTAT TTCAGGTCCA GGGAATATTT ATGTTACAAC TGCGAAAAAA CTTATTTATG GTTCTACAGG GATTGATTCT TTAGCTGGTC CAAGTGAAAT ATTAATCATT GCAGATGAAA CAGCTCAAAG CACTTATATA GCGTCTGATC TACTAGCCCA AGCAGAACAT GATCCTTTAG CTTCATCAAT CCTTCTAACT ACATCAAAAA ATCAGGCACA AGAAGTTTTA GAAGAACTTT ATAAAAAAAT AGATGATCAT CCAAGAAAAG AAATTTGCAT GCAATCAATT AAAAATTGGG GGTTAATTGT GATTTGTGAG AATTATGAAT CATGTGTTGA ACTAAGTAAT AACTTTGCCC CTGAACACCT AGAAATTCTT ACTATAAATT CAAAAAAAAT TCTTGCAGAT ATAGATAATG CAGGAGCGAT ATTTTTGGGG AAATGGACAC CAGAAGCTGT TGGAGATTAT CTTGCTGGAC CAAATCATAC TTTACCCACA TCAGGAAATT CTAGATTCAG CGGTTCTTTG GGGGTTGAAA CTTTTATGAA AAATACTTCA ATAATAGAAT TTAATGAAGA AAGTTTAAAA ATCAATAGCC CGGATATTAT AAATCTTGCT AATAGTGAGG GCTTACACAG CCACGCTAAC TCAGTACAAA TAAGATTTGA AGATTAG
|
Protein sequence | MKIINNKEDA IQELKRISTR TNSENNHKIN AIVEEILQEV KNYGDTAVKK YTKKFDGFNP EPMQVSSNQI KNAWNEIDNN LKRSLEVAHK RIQKFHEKEI PKSFTIKGEY GDTVQRRWRP VKNAGIYIPG GRAAYPSTVL MNAIPAKVAG VKETIMVSPG NKEGEINKTV LAAAHLSGIN KVFRIGGAQA IGALAFGTNQ INKVDVISGP GNIYVTTAKK LIYGSTGIDS LAGPSEILII ADETAQSTYI ASDLLAQAEH DPLASSILLT TSKNQAQEVL EELYKKIDDH PRKEICMQSI KNWGLIVICE NYESCVELSN NFAPEHLEIL TINSKKILAD IDNAGAIFLG KWTPEAVGDY LAGPNHTLPT SGNSRFSGSL GVETFMKNTS IIEFNEESLK INSPDIINLA NSEGLHSHAN SVQIRFED
|
| |