Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_16791 |
Symbol | hisD |
ID | 4912697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 1410728 |
End bp | 1412014 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640161276 |
Product | histidinol dehydrogenase |
Protein accession | YP_001091903 |
Protein GI | 126697017 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0141] Histidinol dehydrogenase |
TIGRFAM ID | [TIGR00069] histidinol dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.557567 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATAA TAAATAATAA AACAGAAGCT ATTGAAGAAT TGAAGAGGAT TTCTACCCGA ACTAATTCAG AAAATAATAA TAAGATAAAT AAAATTGTTG AAGAAATTCT TCAAGAGGTA AAAATTTCTG GAGATAACGC AGTTGAAAAA TATACCAAAA AATTTGATGG TTTCGATCCT AACCCTATGC AAGTGAATGC GGATCAAATA AAGAATGCAT GGGATGAAAT TGATAACAAT TTGAAACGCT CACTTGAGGA GGCTCACAAA AGAATTCAAA AATTCCATGA AAAAGAAATT CCTCAATCTT TCACGATAAA AGGTGAATAT GGTGATATTG TCCAAAGAAG ATGGAAACCA GTTAAAAATG CAGGTATTTA TATTCCTGGA GGAAGAGCTG CTTATCCAAG TACTGTATTA ATGAATGCAA TTCCTGCAAA AGTAGCAGGA GTAGCTGAAA TTATAATGGT ATCTCCTGGA AATAAAGAAG GGGAAATAAA CAAAACTGTT TTAGCTGCAG CTCACTTATC AGGTATCAAT AAAGTATTTA GAATTGGAGG AGCTCAAGCG ATTGGTGCAT TAGCATTTGG CACAAATCAA ATCAACAAAG TTGATGTTAT ATCAGGTCCA GGGAATATAT ATGTTACAAC TGCAAAAAAA CTAATTTATG GCTCAACAGG AATTGATTCA TTAGCTGGTC CAAGTGAAAT ATTAATCATT GCAGATGAAA CAGCTCAAAG CACTCATATA GCATCTGATC TACTAGCGCA AGCAGAACAT GATCCTTTGG CTTCATCAAT ACTACTAACT ACATCAAAGA ACCAGGCAAA AGAAGTTTTA GAAGAACTTT ATAAAAAAAT TGAGGACCAT CCAAGAAAAG AAATTTGCAT GCAATCAATC AACAATTGGG GTTCAATTGT GATTTGCGAG AATTCTGAAC AATGTGTTGA ACTAAGCAAT AACTTTGCCC CTGAACACCT AGAAATTCTT ACTGTAGATC CAAAAAAAAT TCTTGCAGGT ATAGAGAATG CAGGAGCAAT TTTTTTAGGA AAATGGACTC CAGAAGCTGT TGGAGATTAT CTTGCTGGAC CAAATCATAC TTTACCCACA TCAGGTAATT CTAGATTTAG CGGTTCTTTA GGGGTTGAAA CTTTTATGAA AAATTCTTCA ATAATAGAAT TTAATGAAGA AAGTTTAAAA GTTAATAGCC TTGATATTAT TAATCTAGCT AAAAGTGAGG GCTTGCATAG TCACGCTAAC TCAGTACAAA TAAGATTTGA AGATTAG
|
Protein sequence | MKIINNKTEA IEELKRISTR TNSENNNKIN KIVEEILQEV KISGDNAVEK YTKKFDGFDP NPMQVNADQI KNAWDEIDNN LKRSLEEAHK RIQKFHEKEI PQSFTIKGEY GDIVQRRWKP VKNAGIYIPG GRAAYPSTVL MNAIPAKVAG VAEIIMVSPG NKEGEINKTV LAAAHLSGIN KVFRIGGAQA IGALAFGTNQ INKVDVISGP GNIYVTTAKK LIYGSTGIDS LAGPSEILII ADETAQSTHI ASDLLAQAEH DPLASSILLT TSKNQAKEVL EELYKKIEDH PRKEICMQSI NNWGSIVICE NSEQCVELSN NFAPEHLEIL TVDPKKILAG IENAGAIFLG KWTPEAVGDY LAGPNHTLPT SGNSRFSGSL GVETFMKNSS IIEFNEESLK VNSLDIINLA KSEGLHSHAN SVQIRFED
|
| |