Gene Acid345_4071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4071 
Symbol 
ID4072493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4817967 
End bp4821134 
Gene Length3168 bp 
Protein Length1055 aa 
Translation table11 
GC content58% 
IMG OID637986102 
Producthydrophobe/amphiphile efflux-1 HAE1 
Protein accessionYP_593145 
Protein GI94971097 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.702974 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.202748 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAATT TCTTCATTAG AAGGCCGATT GTGGCCATGG TGATCGCCAT TCTTATGGTG 
ATCATCGGCG TGGTCTCCAT GCTCCGCCTG CCCACCGCCC AATTCCCCGA CATCGCGCCG
CCGGAAGTTC AAGTGAAAGC TACCTATCCG GGCGCCGATG CAGAAACCGT TGAACAGGCA
GTCGCAACCC CGATCGAGCA GCAGATGAGC GGCGTGGACA ACATGAACTA CATGTTCTCC
ACTAACGCAA ACAATGGCGC GACGACCTTG ACCGTCAACT TCGACATCAA GACCGATCCC
AGCACCGACC AGATTCTGTC GCAGATGCGC ACCAACCAGG CCAACTCTCA GTTACCTGCG
GACGTCGTGA ACTTCGGTGT CACCGTGCAA AAGTCCACGA TGGCGCCTTT GATGCTCATC
ACGCTGTATT CGCCCAAGGG CACCTACGAC AACATCTTCC TGGCGAACTA CTCCTACATC
AACTTGAACG ATCAGTTGAC CCGCGTGCCC GGTATCGCGA GCGTTACCGT ATTCGGCGCC
GGCCAGTACG CGATGCGCGT GTGGGTGAAG CCAGACACGT TGGCGAAGTT AAGCGTGACT
GTGCCGGAGA TCATCAAGGT CATCCAGGCG CAGAACACCG TGAACCCAGC CGGCCAGATC
GGCGGCGAGC CGGTGCCGCA AGGACAAGAC TTCACCTACA ACGTCCGTGC GAAAGGCCGC
CTTCCTTCGG CGGAAGAATT CGAACAAATT GTCGTTCGTG CCAATCCTGA TGGTTCGATC
CTTCGTCTGA AAGATGTCGC TCGAATCGAA CTCGGCGCAC AGAACTACAA CATCATCGGA
CGCTACAGCG GCAAGCCCGC GGCTGTGGTC GCTGTCTATC AGCTACCCGG ATCGAACGCA
GTAAAAGCCG CAGCCGGCGT GCGGGCCTTG ATGGAAGAAG CCAAGACTCG CTTCCCGCAG
GACCTGGATT ACGACATCGC GCTCGACACG ACCGTTGCGG TCACCGAAGG CCTCAAGGAG
ATTCAGCACA CCCTCGCTGA AGCCATCGTC CTCGTGATCC TCGTTGTGTA CATCTTCCTG
CAAGGCTGGC GCACCACGCT GATCCCCCTG CTCGCCGTGC CGGTTTCTCT CGTCGGCACT
TTCGTCGTCT TCCCCTTGCT GGGATTTTCC ATCAACACTC TCTCCCTCTT CGGACTGGTG
CTGGCCATCG GCATCGTCGT GGACGATGCC ATCGTCGTAG TGGAAGCCGT CGAGCACCAC
ATCGAACATG GCCTCTCTCC AAAAGATGCC GCCTACAAGG CGATGGAGGA AGTCTCCGGC
CCGGTCGTCG CCATCGCGCT GATTCTGGCC GCCGTCTTCG TGCCCACCGC CTTCATCCCC
GGCATCACCG GCCGCCTCTA CCAGCAGTTC GCAATCACTA TCGCGATTTC CGTGATCTTC
TCCGCGTTCA ACGCTCTCTC GCTCAGCCCA GCGCTGGCCG CCCTTTTGCT GAAACCGAAG
AAAGAAGCAC GCGGACCGCT CGGTGCGTTC TTCCGCTGGT TCAACAAGTG GTTTGGCAGA
GCCACAGACG GTTACGTCAG CATTTGCGGG GGGATGATTC GCAAAGCCGC ACTTAGCATG
ATCCTCCTCG CGGGTCTTAC CGTGCTCGCC GGCTGGTTCG GCAAGAACCT GCCGAAGAGC
TTCCTGCCCG ACGAAGACCA GGGCTACGTG TTCGCGGGGT TGCAGCTGCC AAACGCTGCT
TCTCTCCAGC GCAACAGCGA CGCCGCCAAG AAGATCGAGG AAATGATCCT CAAGACTCCT
GGCGTGCACT CCGTGACGAC CGTCGCCGGA TACAGCATGT TGAGCGGCGT GCAGGCCACG
TACAGCACCT TCTTCTGGAT CACTCTCAAA GAGTGGAGCG AACGCAAAAC TCCGGAAGAG
TCGTATGAGG GAATCAAGAA GCACCTCAAC AAGGAGTTGG CGTCGGTAAC GGAAGGTGTT
GCGTTCTCGT TCCCGCCGCC TGCGATTCCC GGTGTAGGAG CTTCCGGCGG ATTCACCTTC
TTGCTCGAAG ACCGCGCCGG CAAAGATGTC GCATTCCTCA GCCAGAACCT GCAAAAGTTC
ATGGCCGCTG CCAGGAAACG TCCTGAGATC GCCGGAATCT CCACAACGGC GTTGCTCTCG
GTGCCCCAGG TTTATGTTGA CGTCGATCGC CCGAGGGTGA TCGCACAAGG CGTTCAACTC
AATGACGTTT ACCGCACCAT GCAGACCTTT ATGGGCGGCA CACTCGTCAA TTACTTCAAC
CGATTCGGCC GCCAGTGGCA GGTCTACGTG CAAGCCGAAG GCGATTACCG TACTAAGGCC
GAAAACGTTG GCCAGTTCTA CGTGAACAAC AACGATGGCC AATCCGTACC GCTGAGTGCC
GTCACTAGTA TCAAGCGAAC CTCCGGTCCC GAGTTCACTA TGCGCTACAA CTTGTACCGG
TGCGTTCAGA TCAATGGAAA TGCGGCGCCC GGCTACAGTT CGGAACAGGC AATTCAGGCA
CTTGAAGAGA CCTTCAAGGA AAGCATGCCC AGCGAGATGG GCTTCGACTA CATGGGTATG
TCGTACCAGG AAAAGAAAGC TGCGGAAGGC GTACCGGCTT CTGCAATCTT CGGCATGTCG
CTGCTGTTCG TGTTCCTCAT CCTCGCCGCG CAATATGAAA GCTGGTCTCT GCCTTTCAGC
GTACTGCTGG GCACGCCGAT CGCCGTGGCC GGCGCATTCG CGTTCCTCTA TTTCCGCGGA
ATGGAAAACA ACATCTACGT GCAGATCGGC CTCGTCATGC TCATCGGACT TGCTGCGAAG
AACGCGATTC TTATCGTGGA ATTCGCGAAA ATGGAATACG ACAAGGGCAA GTCTGCGGAG
GAAGCAGCAC TTATCGCAGC TAAGCTCCGT CTGCGACCGA TTCTCATGAC GGCGTTCGCC
TTCATTCTCG GTTGCGTTCC TCTGTGGGCG GCTTCCGGCG CCGGCGCCAT TTCCCGTCGC
GTCCTCGGAA CCGCGGTGAT TGGAGGAATG ATGGCAGCGT CCCTGCTCGC GATCTTCCTC
ATTCCGGTCA GCTTCGACGT CGTCGAACGT TTGTCGCACA TGGGAGGCAG TAAACATCCG
CCATCGAATG AAGGCGCAGA GACCACCGTC GCAGGCGGAG GGCACTGA
 
Protein sequence
MANFFIRRPI VAMVIAILMV IIGVVSMLRL PTAQFPDIAP PEVQVKATYP GADAETVEQA 
VATPIEQQMS GVDNMNYMFS TNANNGATTL TVNFDIKTDP STDQILSQMR TNQANSQLPA
DVVNFGVTVQ KSTMAPLMLI TLYSPKGTYD NIFLANYSYI NLNDQLTRVP GIASVTVFGA
GQYAMRVWVK PDTLAKLSVT VPEIIKVIQA QNTVNPAGQI GGEPVPQGQD FTYNVRAKGR
LPSAEEFEQI VVRANPDGSI LRLKDVARIE LGAQNYNIIG RYSGKPAAVV AVYQLPGSNA
VKAAAGVRAL MEEAKTRFPQ DLDYDIALDT TVAVTEGLKE IQHTLAEAIV LVILVVYIFL
QGWRTTLIPL LAVPVSLVGT FVVFPLLGFS INTLSLFGLV LAIGIVVDDA IVVVEAVEHH
IEHGLSPKDA AYKAMEEVSG PVVAIALILA AVFVPTAFIP GITGRLYQQF AITIAISVIF
SAFNALSLSP ALAALLLKPK KEARGPLGAF FRWFNKWFGR ATDGYVSICG GMIRKAALSM
ILLAGLTVLA GWFGKNLPKS FLPDEDQGYV FAGLQLPNAA SLQRNSDAAK KIEEMILKTP
GVHSVTTVAG YSMLSGVQAT YSTFFWITLK EWSERKTPEE SYEGIKKHLN KELASVTEGV
AFSFPPPAIP GVGASGGFTF LLEDRAGKDV AFLSQNLQKF MAAARKRPEI AGISTTALLS
VPQVYVDVDR PRVIAQGVQL NDVYRTMQTF MGGTLVNYFN RFGRQWQVYV QAEGDYRTKA
ENVGQFYVNN NDGQSVPLSA VTSIKRTSGP EFTMRYNLYR CVQINGNAAP GYSSEQAIQA
LEETFKESMP SEMGFDYMGM SYQEKKAAEG VPASAIFGMS LLFVFLILAA QYESWSLPFS
VLLGTPIAVA GAFAFLYFRG MENNIYVQIG LVMLIGLAAK NAILIVEFAK MEYDKGKSAE
EAALIAAKLR LRPILMTAFA FILGCVPLWA ASGAGAISRR VLGTAVIGGM MAASLLAIFL
IPVSFDVVER LSHMGGSKHP PSNEGAETTV AGGGH