Gene Ava_4204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4204 
SymbolhisD 
ID3680948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5265756 
End bp5267057 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content49% 
IMG OID637719551 
Producthistidinol dehydrogenase 
Protein accessionYP_324698 
Protein GI75910402 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.536856 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCGAA TCATTACTCA GCAGGCAGAT GTTAAAGCAG AACTGCAAAG AATCTGCGAT 
CGCACTCACG ACGAACAGGT GCTTCACAAG GAAGCAACTG TGCGGGAAGT GTTGCAAGCA
GTGAAACGCC AAGGCGACAA AGCTGTTTTG CATTACACAG ATGAATTTGA CAATCAAATT
CTCAAAGCTG AAGAGTTACG CGTTACAGGT TCAGAACTGG ACGCAGCTTA CCAACAGGTA
TCCAAGGAAC TGCTGGAGGC GATTCAGCTA GCTAGCCGCC AAATTGAAGC TTTTCATCGT
CAGCGAGTCC CCAAAAGCTG GGTACACTTT GGCGATGATG ATATTGTACT GGGCAAACGC
TACACTCCTG TAGACCGTGC GGGTTTGTAT GTTCCTGGTG GTCGTGCTGC TTACGTCAGT
ACAGTGCTGA TGAACGCAAT TCCGGCGAAG GTGGCTGGTG TACCGCGTAT AGTAATGGCG
ACACCACCAG GCGCACAGAA AGCGATTAAT CCCGCAGTGT TAGTAGCAGC TCAAGAAGTG
GGAGTACAAG AAATTTATCG GGTAGGTGGG GCGCAAGCGA TCGCTGCTTT AGCCTATGGT
ACAGAGACAA TCCCCAAGGT GGATGTAATT ACTGGCCCTG GTAACATCTA TGTCACTTTG
GCGAAAAAAC TGGTTTACGG CACTGTGGGG ATCGATTCCT TAGCCGGGCC TAGTGAAGTG
CTGATTATTG CCGATGAAGG AGCAAATCCC GTCCATGTAG CCACTGATAT GCTGGCACAG
GCGGAACACG ATCCAATGGC GGCGGCAATT TTGTTCACCA CAGACCCAGC TCTAGCGAAG
AATGTGCAAG TAGCAGTGGA AAGACAATTG GTAGATCATC CACGGCGGAT AGATACCGAA
AAAGCGATCG CTCATTACGG TTTAATCGTG TTGGTAGAAT CCCTAGATGC AGCCGCAGAA
CTCTCCAATG AATTTGCACC AGAACACCTA GAGTTAGAAG TTAAAGATCC TTGGGCTGTA
TTACCCAACA TTCGCCATGC TGGTGCTATC TTCCTCGGTT ATTCCACACC AGAAGCAGTA
GGGGACTATC TAGCCGGCCC CAACCATACT TTACCTACAT CTGGTGCTGC CCGTTATGCC
TCTGCCTTAA GTGTAGAAAC TTTCCTCAAA CATTCCAGCA TCATTCAGTA TTCCCAAACT
GCACTCAATA AGGTAGCTGG AGCCATTGAC GCTTTAGCCA CAGCCGAGGG CTTACCCTCT
CACGCTGACT CAGTGAAGCG GCGAATTCAG CAAGATGAAT GA
 
Protein sequence
MLRIITQQAD VKAELQRICD RTHDEQVLHK EATVREVLQA VKRQGDKAVL HYTDEFDNQI 
LKAEELRVTG SELDAAYQQV SKELLEAIQL ASRQIEAFHR QRVPKSWVHF GDDDIVLGKR
YTPVDRAGLY VPGGRAAYVS TVLMNAIPAK VAGVPRIVMA TPPGAQKAIN PAVLVAAQEV
GVQEIYRVGG AQAIAALAYG TETIPKVDVI TGPGNIYVTL AKKLVYGTVG IDSLAGPSEV
LIIADEGANP VHVATDMLAQ AEHDPMAAAI LFTTDPALAK NVQVAVERQL VDHPRRIDTE
KAIAHYGLIV LVESLDAAAE LSNEFAPEHL ELEVKDPWAV LPNIRHAGAI FLGYSTPEAV
GDYLAGPNHT LPTSGAARYA SALSVETFLK HSSIIQYSQT ALNKVAGAID ALATAEGLPS
HADSVKRRIQ QDE