Gene lpl2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus taglpl2004 
Symbol 
ID3113969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLegionella pneumophila str. Lens 
KingdomBacteria 
Replicon accessionNC_006369 
Strand
Start bp2246535 
End bp2247872 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content41% 
IMG OID637583775 
Producthypothetical protein 
Protein accessionYP_127340 
Protein GI54294925 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGAAT GGTCGCCAAC TACATGGCAG AAATATTCGT ATTTGCAGGC AGCTTCATAC 
GCTGATGAAG AGCAATTAAA TAAAGTGGTA GAACAATTAA GTTTGCTGCC CCCATTAGTT
ACCAGCGGTG AAATCAAACA TTTAAAAAAT GAAATTGCCC AGGCTGGCCG GGGTAATGCT
TTTATTCTGC AGGGTGGTGA TTGTGCTGAA TCTTTTAATG ACTGTCGTTC AGAAGTCATA
AGTAACAAAT TAAAAATCAT ACTTCAAATG AGCCTAATCT TATTATATGG CTTGCGTAAA
CCAATTATTC GTATCGGAAG AATAGCTGGG CAATATGCAA AACCACGCTC CTCAGATTAC
GAAACCATAA ATGGAGTTAC TTTACCCAGT TATCGTGGAG ATATCGTTAA TTCACCTGAA
TTTACTGCCA CTGCCAGGGA GCCTAATCCT AAATTATTAC TTCAGGCATA CAGTTGTTCT
GCCATGACTT TAAATTTTAT CCGAGCGCTG CTAGATGGCG GTTTTGCTGA TCTGCATCAC
CCTCAACGCT GGGATTTAGG TTTTGTTGAG CACTCCCCTC AGAAGAACGA ATACCAACAT
ATCGTAGACT CTATTGAAGA CGCCCTGGAT TTTTTGAATT CCATAGATGG AATACGTTCG
AGTAGCATAA GTAAAGTTGA TTTTTACACC TCTCATGAAG CGTTGCATTT ACATTATGAG
CAAGCATTAA CAAGACAGTT GAAAGATGGA AAATGGTACA ATCTTTCAAC TCATTTACCC
TGGATTGGAA TGCGTACGGC ACAAACGGAC AGTGCACATC TTGAGTTTCT ACGAGGGGTA
CAAAATCCAA TAGGCATTAA GATAGGCCCA GCAGCTACAC CTGAATGGTT ATCGGAGGTA
TTAAGCATAG CCAATCCGCA AAAAGAAGAA GGACGAGTTT TACTTTATAC TCGCCTGGGA
GCAAAACTTA TCGACCGGTT GTTACCTCCA CTGATTGACA CAGTAAGGAA AAGCAAAGTT
CCAGTCACGT GGTCATGTGA CCCTATGCAT GGCAATACCG AAACAACAGA AGACGGTACT
AAAACACGTC ACTTCGATAA CATTTTATCG GAATTAAAAC AAGCTTTGGA AATTCATCGC
AGCATGGGTA GCTACCTCGG AGGTGTCCAT TTTGAGCTAA CTGGTGACAA TGTAACAGAG
TGTATCGGGG GCGCTCGTGG ATTAGCTCCT CATGACCTCA AAACTGCCTA TCACAGCCTG
GTTGATCCAA GATTAAACTA TGAACAATCT CTGGAAATGG CTATTCAGCT AAGCCATCAA
TTCAGAAATG AATCTTAA
 
Protein sequence
MQEWSPTTWQ KYSYLQAASY ADEEQLNKVV EQLSLLPPLV TSGEIKHLKN EIAQAGRGNA 
FILQGGDCAE SFNDCRSEVI SNKLKIILQM SLILLYGLRK PIIRIGRIAG QYAKPRSSDY
ETINGVTLPS YRGDIVNSPE FTATAREPNP KLLLQAYSCS AMTLNFIRAL LDGGFADLHH
PQRWDLGFVE HSPQKNEYQH IVDSIEDALD FLNSIDGIRS SSISKVDFYT SHEALHLHYE
QALTRQLKDG KWYNLSTHLP WIGMRTAQTD SAHLEFLRGV QNPIGIKIGP AATPEWLSEV
LSIANPQKEE GRVLLYTRLG AKLIDRLLPP LIDTVRKSKV PVTWSCDPMH GNTETTEDGT
KTRHFDNILS ELKQALEIHR SMGSYLGGVH FELTGDNVTE CIGGARGLAP HDLKTAYHSL
VDPRLNYEQS LEMAIQLSHQ FRNES