Gene Acid345_1068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1068 
Symbol 
ID4068717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1340088 
End bp1343267 
Gene Length3180 bp 
Protein Length1059 aa 
Translation table11 
GC content57% 
IMG OID637983076 
Producthydrophobe/amphiphile efflux-1 HAE1 
Protein accessionYP_590145 
Protein GI94968097 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.186507 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTGACT TCTTTATCAA GCGCCCCATT TTCGCGACCG TATGTGCGTT GCTGATTATC 
CTGGCGGGCG CGGTTTGCAT CCCGACGTTG CCGATCTCGC TGTATCCCGA GCTTGCTCCC
CCACAGGTTA CTGTGACGAG CAATTACGTC GGCGCGAACG CTCAAGTGGT GGAGTCTGCG
GTCACCACTC CGCTTGAACA GCAGATCAAC GGTGTGGAGG GCATGCACTA CATCTCGTCC
ACCAGTTCCA ACGACGGCAC CAGCAGCATC AACGTCACGT TTCGTACCGG GTACGACTTA
AACATCGCCG CCGTCGACGT GCAGAACCGC GTCGCGGCTG CGCAAGGACG GCTCCCGCAG
GAAATCAAAA ACACTGGCGT TACCATCACG AAGGCGAATC CGAACTTCGT TTTTGCCGCC
GGTTTTTACT CGCCCGACAA CACTCTTTCC AACCAATACA TTTCGAACTA TCTCGACGTT
TACGTGAAAG ATGCGCTCAA GCGAATCCAG GGCGTGGGCG ATGTAGTGAT CTTCGGCGAA
CGCAAGTACG CCATGCGCAT CTGGCTCGAT CCCACGAAGC TCGCCGCGCG CCAACTCACG
GCTGCTGACG TCGTAGCGGC GCTCCAGGAG CAGAACGTCG AAATTCCAGC TGGACAACTC
GGGCGTCCGC CGGCAGATCC CAAGCAGAAC TTCCAGGTCA CGCTACGCGC TGTTGGACGG
CTCTCTGACC CGCGAGAGTT CGAAGACATC ATCCTCAAGA GCACCAAGAA CGGAATCGTT
CAACTCAAGG ACGTTGGCCA TGCGGAAGTC GGCGCCGAGA ATTACGATAC CAACCTGCTC
TACAGCGGAC ACGAGGCGAT CGGTATCGGT GTACAGCAGC TATCGAACGC GAATGCGCTT
GAAGTCGACA AAGCGGCCAA AGCCGAACTG GCTGAGCTCT CCAAGTCTTT TCCTCCCGGC
ATCAAGTACG TGGTTGCCTT TGACACGACG ACCGTGGTTG GCGATTCCGT AAAAGAGGTC
ATCACCACAC TCGAAGAAGC AGTGTTGATC GTCATCATCG TGATCTTCCT CTTCCTTCTC
GATTGGCGCG CAACGATCAT TCCGGCGGTC ACGATCCCGG TTTCCTTGAT TGGCACATTT
GCCTTCATCA AGATGTTCGG GTTCTCGATC AACTCGCTCA CGTTGTTCGG CATTACTTTG
GCGACGGGGT TGGTGGTCGA CGACGCTATC GTCGTCATCG AAAACGCCCA GCGGCACATC
AACCAAGACC ATACCGATCC ACATCGCGCA ACCTCGGTCG CTATGGCGGA GGTATCCAGC
GCCGTAGTTG CAACATCGCT GGTTCTGATC TCAGTCTTTG TCCCGGTATC GTTTTTCCCC
GGCACCACCG GTATTCTCTA CCGCCAGTTC TCGCTGACTA TCGCTTTCTC AATCGCGATA
TCCCTATTCA ACGCGCTCAC GCTCTCGCCG GCATTGGCGG CCATCTTGTT GCGCAGCGAA
GAGAAGAAGT ACGGCATGCT CGATTGGACG CGGAGCAAAA CCATCTCGGG TGGCTATCGC
AAAATCGCGC ACGGGATTGA CGACGCCATC CACGGCCTGG GTGCGTGGTA TGGAAAAGTT
ATTGGAACAG TCTTGCGCTT GCGCTACGTG ATGCTGGCAC TCTTTGTGGC TGGTCTCGTG
GCCACCGGAT ACATGTACGT CCACGTGCCT ACGGGTTTCG TTCCGCAGGA AGACCAGAAC
TATTTCATTA CCGTCGTGCA GGCTCCGCAG GGTGCCTCGC TCGCGTACAC GACGTCGATC
GCCAAGCAGG CAGAGCAGAT CCTGCGCGCC GATCCAGATG TCTTTGGCAC CTTCGCGGTA
CCTGGTTTCT CTCTGACCGG AGGTAGTTCG TCCAACTACG GTCTGATCTT TGCGCCTCTG
AAGCCGATCG ACGAGCGCAA GGGCAAGGGC CATGCTGCAA GCGACATCGT GGCGAGGATA
GGACCAAAGC TTTTCTCGGT TCCGGGCGCG ATTATCGTTC CCTTCGAACC TCCCGCCATC
CAAGGTATCG GTAGCTTTGG CGGATTCCAG TTCCAATTGC AGGACCTCGG CCGCAACACG
CTGCAGGATC TCGACGTTGT CGCGCACAAG ATCATCGGGG CTAGTCGTCA GCGGCAGGAC
CTGGCGGGAT TGTTCACGAG CTATACGGCG AACGATCCCC AGGAACTCGT TGAGATCGAT
CGCCAGAAGG CAAAGGCCAT GGGCGTGCCG ATCAGCCAGA TCACCCAGGC TCTGGGCGTG
TATATGGGGT CGGAGTATGT GAACGACTTC GATTTCAACA ATCGGTCCTA CCGCGTTTAC
GTCCAGGCCG ATCAACCGTT CCGCATGACG GCACGCGACA TTCGTCAATA TTATGTGCGC
TCTGACACCA ACGGTCTCGT ACCTCTTGAA AACATCGTCA CGCTGAAAGA AACCTCCGGC
CCGCAGGTCA TCAATCACTT CAACCTGTTC CGGTCCGCCG AAATCGATGG CGCTCCGGCT
CCGGGCTTTA GCTCCAGCCA CGGACTGGAA GCAATGCAGG AACTCGCGCA TCAGAACATG
ATCCAGGGCA TGTCATTCCA ATGGAGCGGC CTTGCGCTGG AAGAAGTCGA AGCTGGCGGC
AAAGCGATCC TCATCTTCGC GCTCGGCATT CTCGTGGTGT ATCTCACGCT TTCCGCGCAA
TACGAGAGCT TCGCGTTGCC ATTCATCATT CTGCTGGCGG TTCCGACTGC TGTGCTGGGG
GCGTTGTCGT TTATCTCTTT ACGCGGATTG GTGAACGATG TCTACGTACA AATCGGGCTG
GTGATGCTCA TTGGCCTTTC GGCGAAAAAT TCAATTTTGA TCGTCGAGTT TGCCGAGCAG
CTCATCGAGC AAGGTCGCTC GATCACCGAA GCCGCGATCG AGGCCGGCGA ATTGCGCCTG
CGCCCAATTC TGATGACGTC GTTCGCGTTC ATCCTCGGAG TTCTCCCGCT CTACTTCGCC
ACCGGTGCCG GGAAAATCGG TCGTCACTCG GTGGGCACAG CGATCGTGGG CGGCATGCTC
TTCTCGACAG TACTGAACTT GATCTTCATC CCAGTGCTGT ACGTCATCGT GAAGACACTC
TTAGGAGCAC GCACGTCCGG AGTAGCTCTC GAAGCAGAGG AAGTGGAAGA GGCGGCTTAG
 
Protein sequence
MVDFFIKRPI FATVCALLII LAGAVCIPTL PISLYPELAP PQVTVTSNYV GANAQVVESA 
VTTPLEQQIN GVEGMHYISS TSSNDGTSSI NVTFRTGYDL NIAAVDVQNR VAAAQGRLPQ
EIKNTGVTIT KANPNFVFAA GFYSPDNTLS NQYISNYLDV YVKDALKRIQ GVGDVVIFGE
RKYAMRIWLD PTKLAARQLT AADVVAALQE QNVEIPAGQL GRPPADPKQN FQVTLRAVGR
LSDPREFEDI ILKSTKNGIV QLKDVGHAEV GAENYDTNLL YSGHEAIGIG VQQLSNANAL
EVDKAAKAEL AELSKSFPPG IKYVVAFDTT TVVGDSVKEV ITTLEEAVLI VIIVIFLFLL
DWRATIIPAV TIPVSLIGTF AFIKMFGFSI NSLTLFGITL ATGLVVDDAI VVIENAQRHI
NQDHTDPHRA TSVAMAEVSS AVVATSLVLI SVFVPVSFFP GTTGILYRQF SLTIAFSIAI
SLFNALTLSP ALAAILLRSE EKKYGMLDWT RSKTISGGYR KIAHGIDDAI HGLGAWYGKV
IGTVLRLRYV MLALFVAGLV ATGYMYVHVP TGFVPQEDQN YFITVVQAPQ GASLAYTTSI
AKQAEQILRA DPDVFGTFAV PGFSLTGGSS SNYGLIFAPL KPIDERKGKG HAASDIVARI
GPKLFSVPGA IIVPFEPPAI QGIGSFGGFQ FQLQDLGRNT LQDLDVVAHK IIGASRQRQD
LAGLFTSYTA NDPQELVEID RQKAKAMGVP ISQITQALGV YMGSEYVNDF DFNNRSYRVY
VQADQPFRMT ARDIRQYYVR SDTNGLVPLE NIVTLKETSG PQVINHFNLF RSAEIDGAPA
PGFSSSHGLE AMQELAHQNM IQGMSFQWSG LALEEVEAGG KAILIFALGI LVVYLTLSAQ
YESFALPFII LLAVPTAVLG ALSFISLRGL VNDVYVQIGL VMLIGLSAKN SILIVEFAEQ
LIEQGRSITE AAIEAGELRL RPILMTSFAF ILGVLPLYFA TGAGKIGRHS VGTAIVGGML
FSTVLNLIFI PVLYVIVKTL LGARTSGVAL EAEEVEEAA