Gene lpp1199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus taglpp1199 
SymbolhisB 
ID3118653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLegionella pneumophila str. Paris 
KingdomBacteria 
Replicon accessionNC_006368 
Strand
Start bp1337801 
End bp1338859 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content41% 
IMG OID637579892 
Productimidazole glycerol-phosphate dehydratase/histidinol phosphatase 
Protein accessionYP_123523 
Protein GI54297154 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0131] Imidazoleglycerol-phosphate dehydratase
[COG0241] Histidinol phosphatase and related phosphatases 
TIGRFAM ID[TIGR01261] histidinol-phosphatase
[TIGR01656] histidinol-phosphate phosphatase family domain
[TIGR01662] HAD-superfamily hydrolase, subfamily IIIA 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA TTTTATTTAT TGACCGCGAT GGCACTTTAG TTGAAGAACC CTTTGATTTT 
CAAGTAGATT CTCTGGATAA AATTAAATTA ACTTCTGGAG TAATACCAGC CCTATTGCAA
TTACAAAAAG CTGGCTTTAC CTTCATTATG GTATCCAATC AAAATGGCAT AGGAACCGTA
GCTTTTCCAG AAGAGGATTT TGCTGTTTGC CATGAATTTA TCCTGGATCT GTTTTCCTCT
CAAGGCATTC TTTTTGATGA GATTTTTATA TGCCCTCACA CCCCTGAAGA CAATTGTATT
TGTAGAAAAC CTAAAACTGG CTTGCTCGAA CCTTACTTAA AAGAAACAGC ATTTGCCAAA
CATTATTCAT GGGTAATTGG CGACCGGGAT ACTGATAAAG AGTTTGCTGA TAATCTGGGC
GTTAATTTCT TGCCCATTTC GAAAACACAT ACTTGGGAAA TGGTGGTTTC TGCAATTATC
AACGATGCTC GCAAAGCGTC TGTGCAAAGA AAAACAAAAG AGACGGCAAT AGATTTGTCA
GTTCAACTGG ATAGCGATCA AACCAGTGTT ATAGACACCC CTATCCCCTT CTTTACCCAC
ATGTTGGAGC AAGTGGCAAA ACACGGAGGC TTTGATTTGC GATTACAAGC TTCTGGAGAT
TTGGAAGTAG ATGAGCACCA CTTGATTGAG GATACAGCCA TTGCATTGGG GGAAGCAATT
AGAACAGCGC TTGGTGACAA ATGGGGAATT AATCGTTACG GTTATACTCT TCCTATGGAT
GAATCATTAG CTTGCGTTGC AATCGATATT AGCGGCAGAA GCTTTTGTGA CTTCAAAGGT
CAGTTTACTC GTGAATTTGT TGGTGGTATG GCAACAGAAA TGGTGCCTCA CTTTTTCCAA
TCGCTATCGA GCGCGTTGGG GGCAACTATC CATATCGAGG TAACAGGCAC CAATCATCAC
CACATGATCG AAGCATGCTT CAAAGTTTTA GGCAGAGCCT TGCGACAGGC ATGCTCAAGA
ACGAATAATT ATCTACCTTC AACCAAGGGT GTATTATGA
 
Protein sequence
MKKILFIDRD GTLVEEPFDF QVDSLDKIKL TSGVIPALLQ LQKAGFTFIM VSNQNGIGTV 
AFPEEDFAVC HEFILDLFSS QGILFDEIFI CPHTPEDNCI CRKPKTGLLE PYLKETAFAK
HYSWVIGDRD TDKEFADNLG VNFLPISKTH TWEMVVSAII NDARKASVQR KTKETAIDLS
VQLDSDQTSV IDTPIPFFTH MLEQVAKHGG FDLRLQASGD LEVDEHHLIE DTAIALGEAI
RTALGDKWGI NRYGYTLPMD ESLACVAIDI SGRSFCDFKG QFTREFVGGM ATEMVPHFFQ
SLSSALGATI HIEVTGTNHH HMIEACFKVL GRALRQACSR TNNYLPSTKG VL