Gene lpl1248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus taglpl1248 
SymbolhmgA 
ID3114748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLegionella pneumophila str. Lens 
KingdomBacteria 
Replicon accessionNC_006369 
Strand
Start bp1391232 
End bp1392482 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content40% 
IMG OID637583021 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_126599 
Protein GI54294184 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTATTTGC AAGGATTTGG TAATTATCAC CACAGCGAGG CTGTTAAAGG AGCATTACCC 
CCAAATCAAA ACTCACCTCA ACACTGTAGC TTAGGACTTT ACGCAGAGCA ATTGAGTGGA
ACCTCGTTCA CCCGTCCCCG ACATAATAAT CTTCGAAGTT GGCTATATAG AATACTTCCT
ACTGTTACCC AGGGCACGTA TTACCCCTAT GAGTTTAATA TTATGCAACC TTTAGTTGAT
GAGTTGTCAC CCAATGCCAT GCGTTGGTCA CCTCTTTATA ACAGCTCTCA AATTAAATGT
GATTTTGTTG AAGGACTATT TCATATTGCC GGTAGCCCGT TAGTTAATGC CTATACTTAT
TATTGCAACC ACTCCATGAG CGATAAATAT TTCGCCAATA ATGATGGTGA GTTATTATTT
GTTCCCTATA CAGGCGAGAT TCATCTGCAT ACTGAATTTG GCAAATTAAT GCTCTCTTCT
GGATCGATCG CAGTGATACC TCGTGGCGTT AAATTTAAAG TGGAAGTAAT CAGCAAGGAG
GCAAAAGGTT ATCTTTGTGA AAATAGCGGA AATCCCTTAA CCTTACCTCA GTTAGGCCCC
ATTGGAGCCA ATGGTTTGGC AAACCCAAGA CATTTTCAAT ATCCAGTAGC CGCATTTGAA
AACTCTGTAG GCGAGCATAC TATAATCTGT AAAAACCAGA AAAAATTATG GTTTACTGTA
TGCAACCACT CTCCTTTAAA TGTCGTCGCC TGGCATGGCA ATTATGCACC ATATTGTTAT
GATCTCAGTT TGTTCAATAC AATTAACACA GTCAGTTTTG ATCATCCTGA TCCTTCCATA
TTCACTGTAT TAACTTCAGA AAGCGAAATA CCCGGTGTTT CTAACTTGGA CTTTGTTATT
TTCCCACCTC GCTGGATGGT TGCCGAACAT ACTTTTAGAC CGCCTTATTT TCATAGAAAC
TACATGAATG AACTGATGGG ACTTGTCTAT GGTGAATATG ACGCCAAGAA GGAAGGATTC
ATACCGGGCG GTATCAGCAT CCATAATTGC ATGACTCCAC ACGGACCTGA TTATGAATCT
TATGAAATTG CAGCGTCGCA GGATCTAAAA CCAAATTATA TCAACTCCCT GGCCTTTATG
TTTGAAACCA AAGACTACTG GCAAGTAACT GAGCAAGCTT ATCGACATCC CAGCAGACAA
ATAGATTATC TTAATTGTTG GCAAGGCTTT AAAATAGAGT TTAGTCAATA A
 
Protein sequence
MYLQGFGNYH HSEAVKGALP PNQNSPQHCS LGLYAEQLSG TSFTRPRHNN LRSWLYRILP 
TVTQGTYYPY EFNIMQPLVD ELSPNAMRWS PLYNSSQIKC DFVEGLFHIA GSPLVNAYTY
YCNHSMSDKY FANNDGELLF VPYTGEIHLH TEFGKLMLSS GSIAVIPRGV KFKVEVISKE
AKGYLCENSG NPLTLPQLGP IGANGLANPR HFQYPVAAFE NSVGEHTIIC KNQKKLWFTV
CNHSPLNVVA WHGNYAPYCY DLSLFNTINT VSFDHPDPSI FTVLTSESEI PGVSNLDFVI
FPPRWMVAEH TFRPPYFHRN YMNELMGLVY GEYDAKKEGF IPGGISIHNC MTPHGPDYES
YEIAASQDLK PNYINSLAFM FETKDYWQVT EQAYRHPSRQ IDYLNCWQGF KIEFSQ