Gene VC0395_A0705 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0705 
SymbolhisB 
ID5136810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp729173 
End bp730246 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content50% 
IMG OID640532163 
Productimidazole glycerol-phosphate dehydratase/histidinol phosphatase 
Protein accessionYP_001216655 
Protein GI147673310 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0131] Imidazoleglycerol-phosphate dehydratase
[COG0241] Histidinol phosphatase and related phosphatases 
TIGRFAM ID[TIGR01261] histidinol-phosphatase
[TIGR01656] histidinol-phosphate phosphatase family domain
[TIGR01662] HAD-superfamily hydrolase, subfamily IIIA 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCAAAC AACAAAAAAT TCTTTTTATC GACCGCGACG GCACCTTAAT TGTTGAGCCG 
CCAGTGGATT TTCAAGTTGA CCGTTTGGAC AAACTCAAAC TTGAGCCTTA TGTGATCCCA
AGCCTGCTCA AGCTACAAGA GGCGGGTTAT CGTTTGGTGA TGGTGACCAA TCAGGATGGT
TTAGGCACTT CAAGCTACCC GCAAGCGGAT TTCGATGCGC CGCATAACAT GATGATGGAC
ATTTTTGAAT CGCAAGGCGT GAAGTTTAAT GATGTGCTGA TTTGCCCGCA CTTTGAGCGT
GATAACTGTT CGTGCCGCAA ACCGAAACTT GGCTTGGTCA AAGAGTATCT GCAAGGTGGC
AAAGTCGATT TCAAATCGTC AGCCGTGATT GGCGATCGCA TGACCGATCT GCAACTGGCT
GAAAATATGG CGATTCGCGG TATTCAATAC CACCCACAAA CCATGGGGTG GCTGGATATC
GTTAAAGATC TCACCACCAA GCCACGTGTT GCCCAAGTGG TGCGTAAAAC CAAAGAGACC
GATATTCAGG TTCTAGTCAA TCTTGACCAA ACTGGCGGTA ATCAGATTGA AACTGGATTG
GGCTTTTTTG ATCACATGCT AGATCAAATC GCGACCCACG GCGGTTTCCA ACTGCAACTG
AAAGTGGTGG GCGATCTGCA CATTGATGAT CACCACACGG TGGAAGATAC CGCCTTGGCA
CTAGGGCAAG CACTGCGTGA AGCACTGGGT GACAAACGTG GTATTGGACG CTTTGGTTTC
ACATTGCCCA TGGATGAGTG TTTGGCACAG TGCGCGTTGG ATCTCTCGGG CCGCCCGTAT
CTGAAATTTG ATGCGAGCTT TAGCCGCCCG CAAGTGGGCG ATCTGTCGAC TGAGATGGTG
TATCACTTCT TTCGCTCATT GACCGATACT TTGGCGTGTA CTCTGCACCT CTCTTCCAGT
GGCGATAACG ATCACCACAT CATTGAGAGC CTGTTTAAAG CGTTTGGCCG CACGCTGCGT
CAAGCCATCA CGGTGCAAGG TAACGATCTG CCGAGCAGTA AAGGGGTGCT CTGA
 
Protein sequence
MSKQQKILFI DRDGTLIVEP PVDFQVDRLD KLKLEPYVIP SLLKLQEAGY RLVMVTNQDG 
LGTSSYPQAD FDAPHNMMMD IFESQGVKFN DVLICPHFER DNCSCRKPKL GLVKEYLQGG
KVDFKSSAVI GDRMTDLQLA ENMAIRGIQY HPQTMGWLDI VKDLTTKPRV AQVVRKTKET
DIQVLVNLDQ TGGNQIETGL GFFDHMLDQI ATHGGFQLQL KVVGDLHIDD HHTVEDTALA
LGQALREALG DKRGIGRFGF TLPMDECLAQ CALDLSGRPY LKFDASFSRP QVGDLSTEMV
YHFFRSLTDT LACTLHLSSS GDNDHHIIES LFKAFGRTLR QAITVQGNDL PSSKGVL