Gene lpp1020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus taglpp1020 
Symbollig 
ID3117317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLegionella pneumophila str. Paris 
KingdomBacteria 
Replicon accessionNC_006368 
Strand
Start bp1136515 
End bp1138536 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content41% 
IMG OID637579715 
ProductDNA ligase 
Protein accessionYP_123348 
Protein GI54296979 
COG category[L] Replication, recombination and repair 
COG ID[COG0272] NAD-dependent DNA ligase (contains BRCT domain type II) 
TIGRFAM ID[TIGR00575] DNA ligase, NAD-dependent 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGATC AAGGAATTAA GGAATCGATA GAAACGCTTA AAGAGCAAAT AAGAAAATAC 
GATTATCACT ATTATGTTTT AGATGAACCT TTGGTTCCTG ACGCGGAATA TGATCGATGT
TTCAAGGCAT TGCAACAGTA TGAAGAGCAA TATCCGCAAT TTTTATCGCC AGATTCCCCT
ACACAGAGAG TGAGCGGTAC TCCTTCAGAT GCTTTTATGC CGGTAGCCCA TAAGCAACCC
ATGTTGTCTT TATCCAATGT GTTTACTATC GATGAATTAA AAGCATTCAT TAAACGAGCA
ATTGAGAAAC TGGATGAACC AAATCAACAA CTGGTATTTG CTTGCGAACC AAAGCTTGAT
GGGTTGGCTG TTAACATGAC TTATGAGGGC GGGATCTTGA CTCATGCCGC AACTCGTGGC
GATGGTGCTG TAGGAGAAAA CATCACGGCA AATATTAAGA CTATTGCTTC AGTTCCATTA
AGGCTAAGGG TTAGTAACCC TCCAAAATTG ATCGAAGTGC GGGGTGAAGT CTATATCCCC
AAAGCCGATT TTGAAGCTTA CAACGCAAGG GCTAGAGAAC TCGGTGAAAA AACTTTTGCT
AATCCGCGAA ATGCTGCTGC AGGCAGTTTA AGACAATTAA ATCCTGAAAT TTCTGCCAGT
CGTCCACTTG CTATTTATTG TTATAGTATA GGGGCTTGCG AGGATTATAA GTTACCTAAC
AGTCATTTGG AGCAATTGAA TTTATTAAAA GAGTTTGGAT TTAGAGTGTC TCCAGAAACG
AGGAGGGCGA TTGGAGTAGA AGGCTGTTTA GATTATTACC AGTATATGTT AGCGAAACGG
AATCAATTGC CATTTGAAAT CGATGGGGTT GTTTATAAGA TTGACAGTAT CTCCTTGCAA
CAGCAATTAG GTTATGTTTC TCGTGCCCCA AGATTTGCTT GTGCCCATAA ATTTCCCGCT
ACAGAAGAAA TGACTCGTCT GATAGCCGTG GATTTCCAGG TAGGTAGAAC GGGTGCTGTG
ACGCCGGTTG CACGTTTGGA GCCAGTTAGT GTTGGTGGTG TTACAGTAAG TAACGCGACT
TTGCATAATT TTGATGAAAT TACACGAAAA GACATTCGTA TTGGTGATAC GGTTATTATT
CGTCGTGCCG GTGATGTGAT CCCTGAAGTA GTTTCTGTGA TTTTGGAAAA GCGTCCCATT
AATGCCAGAA AGATTGAGCT TCCTAAAAAT TGCCCTGTTT GTGGTTCTGA AGTCGTAAGG
GAAGCGGATG AAGCAATTGC TCGGTGTATC GGCGGTTTAT ATTGTAAAGC ACAATTAAAA
AGGATGATGT GGCATTTTGC TTCTCGAAAA GCGATGTATA TTGAAGGACT TGGTAGTGTT
TTAATTGATC AGTTAGTTGA TGAGGGTATT GTCCATCATT TGGCGGATCT TTATGAACTC
GATTTGCAGA CTTTAGCTAA CCTGCCAAGG ATGGGGGAGA AATCTGCAAA AAACTTATTA
TCCGCTTTGG AAAAAAGTAA AAAAACGACT TTCAATCGCT TTCTTTATGC TTTGGGGATC
AGAGAAATCG GTGAAGCTGG CGCAAGGGTT TTAGCGGAGC ACTACTGTGA TGTAGAGAGC
TTGAAATCAG CAACGATTGA GGAATTAATG ACTCTGAATG ACATAGGTCC AGTAGCGGCT
TCTCATGTAG TCCATTTCTT TGCTCAAGCG CATAATCTTG AAGTGATTGA CCGTCTTCTC
GAGTTGGGTA TTCATTGGCC TAAGCCCGAA AAAATACAGG TTAATCAGCA AAATCCATTT
TTTGGTAAAA CAGTAGTTTT AACTGGAACT CTGAGTGCCA TGGGGAGGGA AGAGGCAAAG
GCAAAATTAT TAGCCTTAGG TGCAAAAGTG AGTGGAAGTG TGTCTTCCAA AACGGATTAT
GTAATAGCAG GAAGTGAAGC CGGTTCAAAG CTGATTAAAG CGACAGAACT GGGAGTAGCG
ATTATAGAGG AAGACGAGTT TTTAAAATGG GTTAATTCAT GA
 
Protein sequence
MNDQGIKESI ETLKEQIRKY DYHYYVLDEP LVPDAEYDRC FKALQQYEEQ YPQFLSPDSP 
TQRVSGTPSD AFMPVAHKQP MLSLSNVFTI DELKAFIKRA IEKLDEPNQQ LVFACEPKLD
GLAVNMTYEG GILTHAATRG DGAVGENITA NIKTIASVPL RLRVSNPPKL IEVRGEVYIP
KADFEAYNAR ARELGEKTFA NPRNAAAGSL RQLNPEISAS RPLAIYCYSI GACEDYKLPN
SHLEQLNLLK EFGFRVSPET RRAIGVEGCL DYYQYMLAKR NQLPFEIDGV VYKIDSISLQ
QQLGYVSRAP RFACAHKFPA TEEMTRLIAV DFQVGRTGAV TPVARLEPVS VGGVTVSNAT
LHNFDEITRK DIRIGDTVII RRAGDVIPEV VSVILEKRPI NARKIELPKN CPVCGSEVVR
EADEAIARCI GGLYCKAQLK RMMWHFASRK AMYIEGLGSV LIDQLVDEGI VHHLADLYEL
DLQTLANLPR MGEKSAKNLL SALEKSKKTT FNRFLYALGI REIGEAGARV LAEHYCDVES
LKSATIEELM TLNDIGPVAA SHVVHFFAQA HNLEVIDRLL ELGIHWPKPE KIQVNQQNPF
FGKTVVLTGT LSAMGREEAK AKLLALGAKV SGSVSSKTDY VIAGSEAGSK LIKATELGVA
IIEEDEFLKW VNS