Gene lpp1248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus taglpp1248 
SymbolhmgA 
ID3117133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLegionella pneumophila str. Paris 
KingdomBacteria 
Replicon accessionNC_006368 
Strand
Start bp1385487 
End bp1386737 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content40% 
IMG OID637579942 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_123572 
Protein GI54297203 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTATTTGC AAGGATTTGG TAATTATCAC CACAGCGAGG CTGTTAAAGG AGCATTACCC 
ACAAATCAAA ACTCACCGCA GCACTGTAGC TTAGGACTTT ACGCAGAGCA ATTGAGTGGA
ACCTCGTTCA CCCGTCCCCG GCATAATAAT CTTCGAAGTT GGCTATATAG AATACTTCCT
ACTGTTACCC AGGGAACGTA TTACCCCTAT GAGTTTAATG TTATGCAACC TTTTGTTGAT
GAGTTGTCAC CCAATGCCAT GCGTTGGTCA CCTCTTTATA ACAGCTCTCA AATTAAATGT
GATTTTGTTG AAGGACTATT TCATATTGCC GGTAGCCCGT TAGTTAATAC CTATACTTAT
TATTGCAACC ACTCCATGAG CGATAAATAT TTCGCCAATA ATGATGGTGA GTTATTATTT
GTTCCCTATG CAGGCGAGAT TCATCTGCAT ACTGAATTTG GCAAATTAAT ACTATCTTCC
GGATCGATCG CAGTGATACC TCGTGGCGTT AAATTTAAAG TGGAAGTAAT CAGCAAGGAG
GCAAAAGGTT ATCTTTGTGA AAATAGCGGA AATCCCTTAA CCTTACCTCA GTTAGGCCCC
ATTGGAGCCA ATGGTTTGGC AAACCCAAGA CATTTTCAAT ATCCAGTAGC CGCATTTGAA
AACTCTGGTG GCGAGCATAC TATAATCTGT AAAAACCAGA AAAAATTATG GTTTACTGTA
TGCAACCACT CTCCTTTAAA TGTCGTCGCC TGGCATGGCA ATTATGCACC ATATTGTTAT
GATCTCAGTT TGTTCAATAC AATTAACACA GTCAGTTTTG ATCACCCTGA TCCTTCCATA
TTCACTGTAT TAACTTCAGA AAGCGAAATA CCCGGTGTTT CTAACTTGGA CTTTGTTATT
TTCCCACCTC GCTGGATGGT TGCCGAACAT ACTTTTAGAC CGCCCTATTT TCATAGAAAC
TACATGAATG AACTGATGGG ACTTGTCTAT GGTGAATATG ATGCCAAGAA GGAAGGATTC
ATACCGGGTG GTATCAGCAT CCATAATTGC ATGACTCCAC ACGGACCTGA TTATGAATCT
TACGAAATTG CAGCGTCGCA GGATCTAAAA CCAAATTATA TCAACTCCCT CGCCTTTATG
TTTGAAACCA AAGACTACTG GCAAGTAACT GAGCAAGCTT ATCGGCATCC CAGCAGACAA
ATGGATTACC TTAATTGTTG GCAAGGCTTT AAAATAGAGT TTAGTCAATA A
 
Protein sequence
MYLQGFGNYH HSEAVKGALP TNQNSPQHCS LGLYAEQLSG TSFTRPRHNN LRSWLYRILP 
TVTQGTYYPY EFNVMQPFVD ELSPNAMRWS PLYNSSQIKC DFVEGLFHIA GSPLVNTYTY
YCNHSMSDKY FANNDGELLF VPYAGEIHLH TEFGKLILSS GSIAVIPRGV KFKVEVISKE
AKGYLCENSG NPLTLPQLGP IGANGLANPR HFQYPVAAFE NSGGEHTIIC KNQKKLWFTV
CNHSPLNVVA WHGNYAPYCY DLSLFNTINT VSFDHPDPSI FTVLTSESEI PGVSNLDFVI
FPPRWMVAEH TFRPPYFHRN YMNELMGLVY GEYDAKKEGF IPGGISIHNC MTPHGPDYES
YEIAASQDLK PNYINSLAFM FETKDYWQVT EQAYRHPSRQ MDYLNCWQGF KIEFSQ