Gene Bcep18194_A3523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A3523 
SymbolhisD 
ID3748700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp389555 
End bp390871 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content69% 
IMG OID637761797 
Producthistidinol dehydrogenase 
Protein accessionYP_367769 
Protein GI78065000 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.305808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATCA CCATCCGCAA GCTCGATTCG ACGAGCGCAG GCTTCGGCGC CGAGCTGCGC 
GCGCTGCTCG CATTCGAGGC GAGCGAAGAC GCGGCGATCG AGCAATCGGT CGCGCAGATC
CTCGCCGACG TGAAGTCGCG CGGCGACGCC GCGGTGCTCG AATACACGAA CCGCTTCGAC
CGGCTGAGCG CGAGCAGCAT TGCCGCGCTG GAGCTGCCGC AGGACGCGCT GCAGACGGCG
CTCGACAGCC TCGCGCCGAA GGCGCGCGCG GCGCTGGAGG CGGCGGCCGC GCGCGTGCGC
GCGTACCACG AGAAGCAGAA GATCGAGTGC GGCACGCATA GCTGGCAGTA CACGGAAAGC
GACGGCACGG TGCTCGGCCA GAAGGTCACG CCGCTCGACC GCGTCGGCCT GTACGTGCCG
GGCGGCAAGG CCGCGTATCC GTCGTCGGTG CTGATGAACG CGATTCCCGC GCGTGTCGCG
GGCGTCGGCG AGATCGTGAT GGTCGTGCCG ACGCCGGACG GCGTGAAGAA CGATCTCGTG
CTCGCCGCAG CGCTGCTCGG CGGCGTCGAT CGCGTGTTCA CGATCGGCGG CGCGCAGGCG
GTCGGTGCGC TCGCCTACGG CACGGCGACG GTGCCGGCCG TCGACAAGAT CTGCGGCCCC
GGCAACGCGT ACGTCGCGTC GGCGAAGCGC CGTGTGTTCG GCACGGTCGG CATCGACATG
ATCGCCGGCC CGTCGGAAAT TCTCGTGCTG TGCGACGGCA CGACCGATCC GAACTGGGTC
GCGATGGACC TGTTCTCGCA GGCCGAGCAC GACGAACTCG CGCAATCGAT CCTGCTGTGC
CCGGACGGTG CGTTCCTCGA GCGCGTCGAA AAGGCGATCG ACGAGCTGCT GCCGTCGATG
CCGCGCCAGG ACGTGATCCG CGCGTCGCTC GAAGGCCGCG GCGCGCTGAT CAAGGTGCGC
GACATGGCCG AAGCCTGCCG GATCGCGAAC GACATCGCGC CCGAGCACCT GGAAATTTCC
GCGCTGGAGC CGCAGCAATG GGGCCAGCAG ATCCGCCATG CGGGCGCGAT CTTCCTCGGC
CGCTACACGA GCGAGAGCCT CGGCGACTAC TGCGCGGGCC CGAACCACGT GCTGCCGACG
TCGCGCACCG CGCGTTTCTC GTCGCCGCTC GGCGTGTACG ACTTCATCAA GCGCTCGAGC
CTGATCGAGG TCAGTGCGGA AGGTGCACAG ACGCTCGGCG AGATCGCATC CGAGCTCGCG
TACGGCGAGG GGCTGCAGGC GCACGCGAAG AGCGCCGAGT TCCGGATGAA GCACTGA
 
Protein sequence
MSITIRKLDS TSAGFGAELR ALLAFEASED AAIEQSVAQI LADVKSRGDA AVLEYTNRFD 
RLSASSIAAL ELPQDALQTA LDSLAPKARA ALEAAAARVR AYHEKQKIEC GTHSWQYTES
DGTVLGQKVT PLDRVGLYVP GGKAAYPSSV LMNAIPARVA GVGEIVMVVP TPDGVKNDLV
LAAALLGGVD RVFTIGGAQA VGALAYGTAT VPAVDKICGP GNAYVASAKR RVFGTVGIDM
IAGPSEILVL CDGTTDPNWV AMDLFSQAEH DELAQSILLC PDGAFLERVE KAIDELLPSM
PRQDVIRASL EGRGALIKVR DMAEACRIAN DIAPEHLEIS ALEPQQWGQQ IRHAGAIFLG
RYTSESLGDY CAGPNHVLPT SRTARFSSPL GVYDFIKRSS LIEVSAEGAQ TLGEIASELA
YGEGLQAHAK SAEFRMKH