Gene Lcho_2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_2049 
Symbol 
ID6163495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp2227292 
End bp2228863 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content72% 
IMG OID641664818 
ProductRND family efflux transporter MFP subunit 
Protein accessionYP_001791081 
Protein GI171058732 
COG category[M] Cell wall/membrane/envelope biogenesis
[S] Function unknown 
COG ID[COG0845] Membrane-fusion protein
[COG5569] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.00264035 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACACCC GACTCAAATC TCTCGTCGCA GCCCTCATGC TGCTCGGCCT CGGCGGCGCG 
GCCGGCTGGG GCATCAGCCA GTGGCGCACC GGTGCATCGG CACAGACCGC CGCGTCCGAC
ACGGCGGCAC CGACCGCCGA GCGCAAGGTG CTGTATTGGT ACGACCCGAT GGTGCCCACG
CAGAAGTTCG ACCAGCCCGG CAAGTCGCCG TTCATGGACA TGCAGCTGGT GCCGCGTTAC
GCCGACGAGG CCGACGCCGG TGCCGACAGC GGCGCCGCAC TCGCCGTCTC GACCCAGGCC
CGGCAGGCGC TCGGCCTGCG GCTGGCGACG GTCGAAAAGC GCCCGATCGG CGCCGCCATC
GAGGTGGTCG GCACGGTGCA GCTCAACGAG CGCGACATCA GCATCGTGCA GGCACGCACC
GCCGGTTTCG TCGAACGGGT CTACGCCCGC GCACCCGGCG ACGTGATTGC AGCGGGTGCG
CCGCTGGTCG ATCTGCTGAA CCCCGAGTGG CTCGGCGCGC AGCACGAATA CCTGGCCGTC
AAGGCCACCG GCGACGCCGC GATGACGCAG GCGGCGCGCG CCCGGCTGGT GCTGCTGGGC
ATGCCGGCGG CGCTGATCGA GCAGGTCGAG CGCAGCGGCC AGCCGGCGGC GGTGCAGACC
CTCGTCGCGC CGGGCGGCGG TGTCATCAGC GAGCTGATGG TGCGCCAGGG CATGACGGTG
TCGGCCGGCA TGACGCTGGC GCGCATCAAC GGCCTGGCCA CGGTCTGGCT GGAGGCGGCC
TTGCCCGAGG CGCAGGCGGC GACGATCCGC ACCGGCCAGG CGGTCGAGGT GCGTTTCCCG
GCGCTGCCGG GCGAGGTGGT GCGCGGCAAG GTCGCGGCGG TGCTGCCCGA GGCAGACCGC
GAAACACGCA CGCTGCGCTT GCGCATCGAG CTGCCGAATC CGAAGCAGCG CCTGCGCGCC
GGCCTGTTCG CGCAGGTCAG CCTGCGCGCC GCACAAGGCG AGGCGCTGAT GGTGCCGGCC
GAGGCGGTGA TCCGCACCGG CCGGCGCGCG CTGGTCTACC TGTCGGAGCA GCCGGGGCAC
TTCCGCCCGG TCGAGGTCGA GATCGGCGAG CAGTTCGATG AATACATCGT CGTGCGCAGC
GGCCTCGCCG CCGGGCAGCA GGTGGTGGCG TCAGGGCAGT TCCTGGTCGA CTCGGAAGCC
AGCTTGCAGG GCGTGATGGC GCGCAGCGCG CCACCTGCGG CCAGCGCCGC CGTCACGCCG
GTGGCAGAGG CCCCGGTGTA CCGCACGACC GGCGTCGTCG TCGAGACCGA CTCGGGCTCG
ATCACGCTCG AACACGCGCC GGTGCCGGCG CTGAAATGGC CGGGCATGAC GATGCCGTTC
AAGGTCACCG ACCCGAAGCT GCTGAAGGGG CTCAAGCCCG GCCAGGCGGT ACGTTTCGGC
TTCGACCAGC ACGGCGACGA CTACCGCCTG ACCGAGATCG CACCGGCCAC GGCCGCATCC
GGCAGCGTCG ACGCCGACCC GCACGCCGGC CATCGCGCCC CGGCCGTTGC CGCCTCGGGA
GCACAACGAT GA
 
Protein sequence
MNTRLKSLVA ALMLLGLGGA AGWGISQWRT GASAQTAASD TAAPTAERKV LYWYDPMVPT 
QKFDQPGKSP FMDMQLVPRY ADEADAGADS GAALAVSTQA RQALGLRLAT VEKRPIGAAI
EVVGTVQLNE RDISIVQART AGFVERVYAR APGDVIAAGA PLVDLLNPEW LGAQHEYLAV
KATGDAAMTQ AARARLVLLG MPAALIEQVE RSGQPAAVQT LVAPGGGVIS ELMVRQGMTV
SAGMTLARIN GLATVWLEAA LPEAQAATIR TGQAVEVRFP ALPGEVVRGK VAAVLPEADR
ETRTLRLRIE LPNPKQRLRA GLFAQVSLRA AQGEALMVPA EAVIRTGRRA LVYLSEQPGH
FRPVEVEIGE QFDEYIVVRS GLAAGQQVVA SGQFLVDSEA SLQGVMARSA PPAASAAVTP
VAEAPVYRTT GVVVETDSGS ITLEHAPVPA LKWPGMTMPF KVTDPKLLKG LKPGQAVRFG
FDQHGDDYRL TEIAPATAAS GSVDADPHAG HRAPAVAASG AQR