Gene PMN2A_1019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPMN2A_1019 
SymbolhisD 
ID3606405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL2A 
KingdomBacteria 
Replicon accessionNC_007335 
Strand
Start bp1514582 
End bp1515931 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content37% 
IMG OID637687888 
Producthistidinol dehydrogenase 
Protein accessionYP_292212 
Protein GI72382857 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCAAA TAATTAATAA AGACGAGGTT CAAGAAACCT CTTCAAAAAA ATTGACCATA 
AAAACAGCTA ATAGCATTGA TCAGGCACAG TTTGAGCTAA GGAAAATTAC TGAGAGAACA
TCTGGAACCG TTCAAGATGA AGCTATAAAG GTAGTTGACG ATATTCTTAA AAACGTAAGG
GAAAGAGGGG ATGAGGCACT TACAGAATAC ACTTCTCGCT TTGATGGATT TCTAACGGAA
AAATTTCAAG TTTCATCAGA TTTAATACTG AAAGCTTGGG AAGAGACTCC TAGGGAACTT
CAAGATTCAC TTTTATTGGC AAAAAAAAGA ATTGAAAAAT TTCATAGTCT TCAAGTACCA
AAAAATATTA CTTATACAGG ACCCAATGGT GAAACACTTG GAAGAAGATG GAGCCCTGTT
GAAAAAGCAG GCATTTATGT TCCTGGCGGA AGAGCCGCCT ATCCCAGCAC CGTGTTAATG
AATGCTATTC CTGCTTATGT TGCAGGAGTT AATCAAATTA TTATGGTTTC TCCTGCTAAC
TCTCAAGGAG AGATAAACCA AACCGTTTTA GCTGCAGCAC ATATTACAGG TATCAACAAA
ATCTTTCGTC TTGGAGGCGC TCAAGCTATT TGCGCACTTG CTAGTGGAAC TGAATCAATT
CCAAAAGTAG ATGTAATTAC TGGTCCAGGA AATATTTATG TAACGTTGGC AAAGAAAAAA
GTTTATGGAA AGGTAGGAAT TGATTCTTTG GCTGGTCCAA GCGAGATCCT AATAATCGCT
GATCAATCAG CAAAATTAGA ACATGTTGCA TCTGATATGT TAGCTCAGTC AGAACATGAT
CCTTTAGCCT CAGCGATACT AATCACTACA AATACAAAAT TAGCTGAAAA GTTACCCGCA
GAAATTAACC GTCAATTAAT TAATCATCCA AGATTAAAAA TATGTCAAGA ATCAATTTCC
AACTGGGGTT TAATAGTCCT TTGTGATGAT TTAGAAACTT GTGCGCAATT AAGCGATACT
TTTGCCCCAG AACATCTTGA ATTACTTGTA GAGGACCCAA AAAAATTATC AGAAAGCATA
AACAATGCTG GGGCAATATT TATGGGCCCA TGGAGCCCAG AGGCTATTGG AGATTATCTT
GGCGGGCCTA ATCACACTCT TCCCACTTCT GGAACTGCAA GATTTGCTGG CGCTCTTGGG
GTTGAAACTT TTATGAAAAA TACCTCACTT ATAGATTTTT CAAAAGAAGC ATTTAATGAA
AATAAAAATG CAGTTGTACA GTTAGCCAAT AGCGAGGGAC TACATAGTCA TGCAGAATCA
ATACGAATTA GAGACTCTAA ATCTTTTTAA
 
Protein sequence
MTQIINKDEV QETSSKKLTI KTANSIDQAQ FELRKITERT SGTVQDEAIK VVDDILKNVR 
ERGDEALTEY TSRFDGFLTE KFQVSSDLIL KAWEETPREL QDSLLLAKKR IEKFHSLQVP
KNITYTGPNG ETLGRRWSPV EKAGIYVPGG RAAYPSTVLM NAIPAYVAGV NQIIMVSPAN
SQGEINQTVL AAAHITGINK IFRLGGAQAI CALASGTESI PKVDVITGPG NIYVTLAKKK
VYGKVGIDSL AGPSEILIIA DQSAKLEHVA SDMLAQSEHD PLASAILITT NTKLAEKLPA
EINRQLINHP RLKICQESIS NWGLIVLCDD LETCAQLSDT FAPEHLELLV EDPKKLSESI
NNAGAIFMGP WSPEAIGDYL GGPNHTLPTS GTARFAGALG VETFMKNTSL IDFSKEAFNE
NKNAVVQLAN SEGLHSHAES IRIRDSKSF