Gene lpp1643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus taglpp1643 
Symbol 
ID3118276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLegionella pneumophila str. Paris 
KingdomBacteria 
Replicon accessionNC_006368 
Strand
Start bp1842391 
End bp1844508 
Gene Length2118 bp 
Protein Length705 aa 
Translation table11 
GC content35% 
IMG OID637580334 
Producthypothetical protein 
Protein accessionYP_123961 
Protein GI54297592 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATATC ATTTTTTAGC AGAAGGAAAT GGGATTATTG CATGTTATGA CATGTACCCA 
ACTCAGTTTC ATTCTATTAA GGACATGATT AATTATTTGC CTGTCTTAAA GCAGATGGGG
TTTAATGCCT TATGGATTAA TCCCATGCAA ATGCCCGGTG ACATTAGCGG CTTTTTTAAA
ACGGATAAAA ATAATGGTGT AAAAACAGGA AATGAGGTCA CTAGAAGCTT ATATGCCATG
AGTCACCCGT TATTATTTAA TCCTCAATTC AGTTTGGACT CCCCTGAAGA CCCCATGGAG
ACAACCCAAC GATTAAACTC AGAAGCTTTA CAGTTATTCA CTCAAAATGC AAGGAACCTC
GGAATAGTTC CAATGTTTGA TCTGGTTTTA AATCATGTCG CCATTGATTC TCCCTTGTGT
CAGGATAAGC CTCATTGGTT CAAAGGAGTT CATGAAGACT TCAAAGATGT TCGCGGCTTC
AATTATGATG ATGAAAGCAT AAGAAAAGAA ATAATAACCG AATTTTGGCA ACCTTATATC
AAACGTTACA TGATCGAGTA CGGTTTTGAT GGTGTTCGAG TCGATGCTGT AGGTTATGTA
CATCCTGCAG TAAGAAGTGA AGTATATGCC TATATTCACT CTCTTGCTGC TGAATATGGT
AAACCAAATC CGGTTATCCT TGATGAAGCT TTATTTAGTA AACGACCACT TGCGGATGAA
GTAAATTATT TGAAATTACC CGGCATTGGC CCAACTCACA TCACTACAGA AGCTTATAAT
GTTGAATTTG ATCTTGCCAG CGCTGATCTG CCTTATGAAA TAACATTGGA AGAGCAGTTA
AAGGCGAGTG TAGTCTTCCA ACAAAAAGAT GGAACATGGC GTGAAAACAC GAAGGGGGGC
TGCATTAATT TTTGTGGGAA TCATGACTAT CGTTCATTGG CGATGACTAT TTTATTTCAA
ATGGCCAAAA AAAGACTGAA ATCTGACGCT TTTTATAATG ACTTAATATC TTCCTATCAA
GACTTGTACT TTAAGTTTAA CGCCGAGCAA GAAAAACCAC ATGAACTCAA GGAGCCTTTG
AAAACAACCT TACTTTACTC TTATGTTGAT CAAATCAAAA AGGAGTTGGA AGATAATAAA
GATGAAACAG CACAAGAATT TAAAACTCTG GTATTAGAAA AACTGGCTTT ATCTGCTCTT
ACAGCCAGTG GCGGATGGTT TTTATTATCT GGTGATGAAA CATGCGATAT CACTGCAAAA
ACAGTATTTC AGAGAAAAGG TGCTGTTGAT AAATCTTATT ACCCACAACG TGAACATCGT
ATTTTTAGTG AAAAACCAGT TATTGCTCAC AAAATTTTAG AAAAAATGGC AGAAGAGAAT
TTCTTCATAG AAAACTCGGA AAATAAGTGG ATGCAGGAAC TTTACCGAAG TTTATCAAGT
TTTCCTGAAA TGCAAAAACG TCTTTTAGTA TCTCATATTG ATAATATTAA ACACCAAATA
AATGCCGGCA TAGATCATGT ACAAGAAAAA TTTTCCAGGC TACTTGCCAG TGAAAGTTTA
AATATAGGTT TCACATCTCA AGATTTTTTA GAGAATCCAA GAACCCACGA AAATGGATGG
CTTGGATTAC AGGATAATTT TCTATTTATC AAACAATTAA ATAGATTATT AAAAACACTA
CCTGCACCAC TACCCGGATT CTCCAGTGAG TTGGTTCGAC TCCCTGAAAA ACCGCATCTC
ATTATTATCG TAAGAAAAAA TGGTAATGAA TTAAATGCTC CAATTGATGT TGTTGTTGTT
AACCTTGAAC AGGAAAAACG ACAAACATTA ACTAAAGAAG ATTTTCAATT AATTGCCAAA
AACTATTGTA ATCAATATTT TTCTAAAAAT AGACCGAGTG TTTGCGATAG TTTACGCGAA
AAATTATATT CCCATGTTTT AAAATCTGCT GTAAATAATC GAGTCCATAC AGACACCAGT
ATTAAACTGA ATGGAATCCA TTTATTTGAG TTTAAAGAAA CTTCGTTAAA CACTTTTTTT
ACAATTGAAA AAGAAAAAAA CTATTCTCAT GGAAATGAAG ACCATACTGC TTTGACTGGA
AATACAATCT TGCATTAG
 
Protein sequence
MGYHFLAEGN GIIACYDMYP TQFHSIKDMI NYLPVLKQMG FNALWINPMQ MPGDISGFFK 
TDKNNGVKTG NEVTRSLYAM SHPLLFNPQF SLDSPEDPME TTQRLNSEAL QLFTQNARNL
GIVPMFDLVL NHVAIDSPLC QDKPHWFKGV HEDFKDVRGF NYDDESIRKE IITEFWQPYI
KRYMIEYGFD GVRVDAVGYV HPAVRSEVYA YIHSLAAEYG KPNPVILDEA LFSKRPLADE
VNYLKLPGIG PTHITTEAYN VEFDLASADL PYEITLEEQL KASVVFQQKD GTWRENTKGG
CINFCGNHDY RSLAMTILFQ MAKKRLKSDA FYNDLISSYQ DLYFKFNAEQ EKPHELKEPL
KTTLLYSYVD QIKKELEDNK DETAQEFKTL VLEKLALSAL TASGGWFLLS GDETCDITAK
TVFQRKGAVD KSYYPQREHR IFSEKPVIAH KILEKMAEEN FFIENSENKW MQELYRSLSS
FPEMQKRLLV SHIDNIKHQI NAGIDHVQEK FSRLLASESL NIGFTSQDFL ENPRTHENGW
LGLQDNFLFI KQLNRLLKTL PAPLPGFSSE LVRLPEKPHL IIIVRKNGNE LNAPIDVVVV
NLEQEKRQTL TKEDFQLIAK NYCNQYFSKN RPSVCDSLRE KLYSHVLKSA VNNRVHTDTS
IKLNGIHLFE FKETSLNTFF TIEKEKNYSH GNEDHTALTG NTILH