Gene NATL1_18891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_18891 
SymbolhisD 
ID4779203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1551192 
End bp1552541 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content37% 
IMG OID640085178 
Producthistidinol dehydrogenase 
Protein accessionYP_001015709 
Protein GI124026594 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.978409 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAAA TAATTAATAA AGACGAGGTT CAAGAAACCT CTTCAAAAAA ATTGACCATA 
AAAACAGCTA ATAGCATTGA TCAGGCACAG TTTGAGCTAA GGAGAATTAC TGAGAGAACA
TCTGGAACCG TTCAAGATGA AGCTATAAAG GTAGTTGACG ATATTCTTAA AAACGTAAGG
GAAAGAGGGG ATGAAGCACT TACAGAATAC ACTTCTCGCT TTGATGGATT TCTAACGGAA
AAATTTCAAG TTTCATCAGA TTTAATACTG AAAGCTTGGG AAGAGACTCC TAGGGAACTT
CAAGATTCGC TTTTATTGGC AAAAAAAAGA ATTGAAAAAT TTCATAGTCT TCAGGTACCA
AAAAATATTA CTTATACAGG ACCCAATGGT GAAACACTTG GAAGAAGATG GAGCCCTGTT
GAAAAAGCAG GCATTTATGT TCCTGGCGGA AGAGCCGCCT ATCCCAGCAC TGTGTTAATG
AATGCTATTC CTGCTTATGT TGCAGGAGTC AATCAAATTA TTATGGTTTC TCCTGCTAAC
TCTCAAGGAG AGATAAACCA AACCGTTTTA GCTGCAGCAC ATATTACAGG TATCAACAAA
ATCTTTCGTC TTGGAGGCGC TCAAGCTATT TGCGCACTTG CTAGTGGAAC TGAATCAATT
CCAAAAGTAG ATGTAATTAC TGGACCAGGA AATATTTATG TAACGTTGGC AAAGAAAAAA
GTTTATGGAA AGGTAGGAAT TGATTCTTTG GCTGGTCCAA GCGAGATCCT AATAATCGCT
GATCAATCAG CAAAATTAGA ACATGTTGCA TCTGATATGT TAGCTCAGTC AGAACATGAT
CCTTTAGCCT CAGCGATACT AATCACTACA AATACAAAAT TAGCTGAAAA GTTACCCGCA
GAAATTAACC GTCAATTAAT TAATCATCCA AGATTAAAAA TATGTCAAGA ATCAATTTCC
AACTGGGGTT TAATAGTCCT TTGTGATGAT TTAGAAACTT GTGCGCAATT GAGCGATACT
TTTGCCCCAG AACATCTTGA ATTACTTGTA GAGGACCCAA AAAAATTATC AGAAAGCATA
AACAATGCTG GGGCAATATT TATGGGCCCA TGGAGCCCAG AGGCTATTGG AGATTATCTT
GGCGGGCCTA ATCACACTCT TCCCACTTCT GGAACTGCAA GATTTGCTGG CGCTCTTGGA
GTTGAAACTT TTATGAAAAA TACCTCACTT ATAGATTTTT CAAAAGAAGC ATTTAATGAA
AATAAAAATG CAGTTGTACA ATTAGCCAAT AGCGAGGGAC TACATAGTCA TGCAGAATCA
ATAAGAATTA GAGACTCTAA ATCTTTTTAA
 
Protein sequence
MTQIINKDEV QETSSKKLTI KTANSIDQAQ FELRRITERT SGTVQDEAIK VVDDILKNVR 
ERGDEALTEY TSRFDGFLTE KFQVSSDLIL KAWEETPREL QDSLLLAKKR IEKFHSLQVP
KNITYTGPNG ETLGRRWSPV EKAGIYVPGG RAAYPSTVLM NAIPAYVAGV NQIIMVSPAN
SQGEINQTVL AAAHITGINK IFRLGGAQAI CALASGTESI PKVDVITGPG NIYVTLAKKK
VYGKVGIDSL AGPSEILIIA DQSAKLEHVA SDMLAQSEHD PLASAILITT NTKLAEKLPA
EINRQLINHP RLKICQESIS NWGLIVLCDD LETCAQLSDT FAPEHLELLV EDPKKLSESI
NNAGAIFMGP WSPEAIGDYL GGPNHTLPTS GTARFAGALG VETFMKNTSL IDFSKEAFNE
NKNAVVQLAN SEGLHSHAES IRIRDSKSF