Gene lpp0239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus taglpp0239 
Symbol 
ID3118370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLegionella pneumophila str. Paris 
KingdomBacteria 
Replicon accessionNC_006368 
Strand
Start bp262505 
End bp263653 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content43% 
IMG OID637578931 
Producthypothetical protein 
Protein accessionYP_122582 
Protein GI54296213 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAA GCCTGATTTT TTTAGCAATA GGTATGTTTA CAGTAGGTTG CAATACCTTT 
CTGATTGCTG GTTTGCTTCC TCAAATAGGC GAAACGCTCG GGCAGCCGGT TGCAGTAACT
GGGCAAGGAG TGAGTCTATT CAGTTTGACT TATCTTCTCT CAGCGCCACT TTTTTCTCTG
ATTTTTGTTA ATCAGCCGGT AAAGCGTATG GTTCAGCTTG CGCTTACTGT CTTCATGTTT
GGCAATTTAA TAACGCTACT TTCTGAAAAT ATCGTGCTGT TTTTAATTGG AAGATCTCTG
GCGGGAGCAG GAACCGGGAT TTTTACGCCG TTATGTATCA GCATTGCCGT TCATTTTGCC
AGCCCATCTG CCAAAGGACG AATTTTAAGT TTTATCTGGA GTGCTAACAG TGCGGGTGTA
GTGTTTGGCG TTCCTGCCGG ACTTTACTTA TCCTCCTTGT TTCATTGGCA GTTATCGATT
GCCAGTCTTA TTATTTTAAG TTTGCTTGCA TTGATTGGTT TTTCAATGCA AAACATTGAT
ATAAAACTAC CCAAACCTTC GCCATTTGGA GGCAGGCTTC GTCTTCTGGT CGAGCCAAAA
ACGCTATCGG TAATTGGAAT TACTTGCTTT ACCGCCTTGG CAAGTTTGGG ACTATACTCG
TATGTCACCC TGATTCAATC AGGATCCCCT AATTCGCTCA GTATGACGCT ATTGAGTTGG
GGACTGGGAG GATTTATAGG AAGCTCACTG ATTGGGGTGT TTATCGATCG AACGGGTAAA
CCACGGGTTA TTATGGCCTT AATTTTGGTT GGCCTCATGT TTGCTCTGAT TGCCATACCA
TTCACCAGGA ATCTGCCTTA CCTGGGATTA ATCCCTTTTT TTATGTGGGG TGCTTGCGGA
TGGGCTATAG TGACTCCCCA GCAACACATT TTATATGAAT TACATGAAAA TCAGGGAATT
ATCCTTGCCG CCATCAATTC ATCGGCCTTG GGCTTGGGGT CAGCTTTGGG AACGTTGCTT
GGCGGCTTAT TGATTTCCTC TGGATTCAGG GGAATCTATC TTCCTTTTTC TGCTGCCACT
TTATTGTTTT TCGTATTGAT AATTCAGCTG ATAGTAATTA ACACTTCACA TAAGGTAAAT
AACATATGA
 
Protein sequence
MKKSLIFLAI GMFTVGCNTF LIAGLLPQIG ETLGQPVAVT GQGVSLFSLT YLLSAPLFSL 
IFVNQPVKRM VQLALTVFMF GNLITLLSEN IVLFLIGRSL AGAGTGIFTP LCISIAVHFA
SPSAKGRILS FIWSANSAGV VFGVPAGLYL SSLFHWQLSI ASLIILSLLA LIGFSMQNID
IKLPKPSPFG GRLRLLVEPK TLSVIGITCF TALASLGLYS YVTLIQSGSP NSLSMTLLSW
GLGGFIGSSL IGVFIDRTGK PRVIMALILV GLMFALIAIP FTRNLPYLGL IPFFMWGACG
WAIVTPQQHI LYELHENQGI ILAAINSSAL GLGSALGTLL GGLLISSGFR GIYLPFSAAT
LLFFVLIIQL IVINTSHKVN NI