Gene Caul_4026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4026 
Symbol 
ID5901488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4362023 
End bp4365172 
Gene Length3150 bp 
Protein Length1049 aa 
Translation table11 
GC content67% 
IMG OID641564547 
Producthydrophobe/amphiphile efflux-1 (HAE1) family protein 
Protein accessionYP_001685649 
Protein GI167647986 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.40035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.354384 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTCCC GTTTCTTCAT CGACCGGCCG ATCTTCGCGT GGGTCGTCGC GATCGTCATC 
ATGCTGGCGG GGGTCCTGGC CATCCGCACC CTGCCGATCG CCCAATATCC GAACATCGCC
CTGCCCCAGG TCTCGGTCAA CGCCATCTAT CCCGGCGCCT CGGCCAAGAC CGTCGAGGAC
AGCGTCACCC AGGTCATCGA GCAGAAGATG AAGGGCCTGG ACGGCCTGGA CTACATGTCG
TCGACCAGCG ACAGCTCGGG CTCGGCCACC ACCACCCTGA CCTTCAAGGC CGGCACCAAC
ATCGATATCG CCCAGGTGCA GGTGCAGAAC AAGCTGCAGA CGGCCACCGC CCTGCTGCCG
CAGGAAGTGC AGCAGCAGGG CCTGACCGTG GCCAAGTCGG CGCGCAACTT CCTGCTGATC
GTCGGGATGT ATTCGGAGGA TCCGAAGACC ACCAACACCG ACCTGGCCGA CTACATCGCC
AGCAATCTGC AAGACCCCCT GAGCCGCGTC GACGGCGTGG GCGAGGTCCA GCTGTTCGGC
GCGCAATACG CCATGCGCGT CTGGCTGGAC CCTACCAAGC TGGCCAGCTA CAGCCTGACC
CCCGCCGACG TGATCGCCGC GATCAAGGCC CAGAACGCCC AGGTCTCGGC CGGCCAGCTG
GGCGGCACGC CCAATCTGCC GGGCATTGGC CTGAACGCCA CGATCACGGC CCAGTCACGC
CTGAGCACCC CGGCCGAGTT CCAGAACATC ATCGTCAAGA ACAGCGCCGG CGGCGCCATC
GTGCATCTGC GCGACGTCGC CCGCGTGGAA CTGGGCGCGG AAAGCTACGG CTTCGGCGCC
AAGTACAATG GCAAGCCCGC CGCGGGCCTG GCCATCCGCC TGGCTCCCGG CGCCAACGCC
CTGGACACCG CCGACGCCAT CAAGGCGCGG ATGTCCCAGC TGGAAAAGAA CTTCCCGGCC
AACTTCAAGT ACGTGATCCC GTACGACTCC ACCCCGTTCG TGAGGCTGTC GATCGAGGAA
GTCGTCAAGA CCCTGATCGA GGCCATCGTC CTGGTCTTCA TCGTCATGTT CCTGTTCCTG
CAGAACTGGC GCGCGACCCT GATCCCGACC ATCGCCGTGC CGGTCGTCCT GCTGGGCACC
TTCGGCGTGC TGGCGGCCTT CGGCTACTCG ATCAACACCC TGACCATGTT CGGCCTGGTG
CTGGCCATCG GCCTGCTGGT CGATGACGCC ATCGTCGTGG TCGAGAACGT CGAGCGGGTG
ATGAGCGAGG AAGGGCTGTC GCCCATCGAG GCCACGCGCA AGTCGATGAA CGAGATCACC
GGCGCCCTGA TCGGCATCGC CCTGGTCCTG GCCGCGGTGT TCGTGCCCAT GGCCTTCTTC
GGCGGCTCGC AGGGCGTGAT CTATCGCCAG TTCTCGATCA CCATCGTCTC GGCCATGGCC
CTGTCGGTGC TGGTCGCCCT GATCCTGACG CCGGCCCTGT GCGCCACCCT GCTCAAGCCG
GTCGCCAAGG GTCACACCAT CGCCGACAAG GGCTTCTTCG GCTGGTTCAA CCGCAACTTC
AACGACGTCA GCCATCGCTA CCAGGGCACG GTTCGCGGCC TGCTGGGCAA GAGCGGCCGC
TGGCTGGTCG CCTATGGCGT GATCATCCTG CTCGTGGGCT TCCTGTTCAT CCGCCTGCCC
AGCGCCTTCC TGCCCGAGGA AGACCAGGGC ACGATGTACG CCCTCGTCCA ACTGCCGCCC
GGCGCGCCGG AAGAACGCAC CCAGGCCGTG CTCGACACCG TGCGATCACA CTTCCTGGTC
GATGAGAAGG ACTCCGTTCA GGCGGTCTTC ACCGTCGCCG GCTTCAGCTT CGCCGGCGCG
GGTCAGAACG CCGGCCTGGC CTTCATCCGC CTGAAGGACT TCGACCAACG CAAGGCCGCC
AATCTCAAGG CTCCGGCCAT CGCCGGCCGG GCCATGGGCA AGTTCATGCA GATCCGCGAC
GCCATGGTCT TCGCCATCGT GCCGCCGGCC GTGCCGGAAC TGGGCACCTC GTCGGGCTTC
GACTTCCAAT TGCAGGACGT CGGCGGCGTG GGTCACGAGG CCCTGGTCGC CGCCCGCAAC
CAGGTGCTGG GCATGGCCGC CCAGGACCCG ACCCTGGTCG GCGTACGTCC CAACGGCCAG
GACGACACGC CGCAGCTGAA GATCGACATC GACCAGGCCA AGGCCGGGGC GATGGGCCTG
ACCACCGCCG ACATCAACGC CGCGCTCAGC GCCGCGTGGG GCGGTTCCTA CGTCAACGAC
TTCATCGATC GCGGCCGGGT CAAGAAGGTC TATGTCCAGG CCGACGCGCC CTACCGCATG
CGTCCGGAAG ACCTGAACAC CTGGTACGTC CGCACCAGCG CCGGCACCAT GGCCCCGTTC
TCGTCCTTCG CCACCACCCA CTGGATCTAC GGCTCGCCGC GTCTCGAGCG CTACAACGGC
CTTTCCTCGC TCAACATCCA GGGCGCGCCC GCCCCGGGCA AGAGCTCGGG CCAGGCCATG
GCCGCCATGG AGAAGATCGC CGCCCAGCTG CCGCCGGGCG TCGGCTTCGA ATGGACCGGC
CTGTCGGCTC AGGAACGCGA GTCCGGCAAC CAGGCCCCGG CCCTGTACGG CATCTCGATC
CTGGTGGTGT TCCTGCTGCT GGCCGCTCTC TATGAAAGCT GGTCGATCCC GCTGTCGGTG
ATCATGGTCA TTCCGCTGGG CATCGTCGGC GCCCTGCTGG CCACCAGCCT GCGCGGCCTG
TCCAACGACA TCTACTTCCA GGTGGGCCTG CTGACGACCA TGGGCCTGGC GTCCAAGAAC
GCCATCCTGA TCGTCGAGTT CGCCAAGGAC CTGCACGAGA AGGGCATGGA CCTGGTCGAG
GCGACGGTGG AGGCGGTGCG CATCCGTCTG CGGCCGATCA TCATGACCTC GCTGGCCTTC
GTGTTCGGCG TGCTGCCCCT TGCCATCTCG AACGGGGCGG GCTCGGGCAG CCAGCACGCC
ATCGGCACCG GCGTGATCGG CGGCATGATC TCGGCCACCC TGCTGGCGAT CTTCTTCGTT
CCGCTGTTCT TCGTGATCGT CGAGCGGATC TTCAAGCCCA AGCCGCGTCA CCGTGACGAG
GCCCCCGCGA CTCCGGCCGA GAGCCACTGA
 
Protein sequence
MLSRFFIDRP IFAWVVAIVI MLAGVLAIRT LPIAQYPNIA LPQVSVNAIY PGASAKTVED 
SVTQVIEQKM KGLDGLDYMS STSDSSGSAT TTLTFKAGTN IDIAQVQVQN KLQTATALLP
QEVQQQGLTV AKSARNFLLI VGMYSEDPKT TNTDLADYIA SNLQDPLSRV DGVGEVQLFG
AQYAMRVWLD PTKLASYSLT PADVIAAIKA QNAQVSAGQL GGTPNLPGIG LNATITAQSR
LSTPAEFQNI IVKNSAGGAI VHLRDVARVE LGAESYGFGA KYNGKPAAGL AIRLAPGANA
LDTADAIKAR MSQLEKNFPA NFKYVIPYDS TPFVRLSIEE VVKTLIEAIV LVFIVMFLFL
QNWRATLIPT IAVPVVLLGT FGVLAAFGYS INTLTMFGLV LAIGLLVDDA IVVVENVERV
MSEEGLSPIE ATRKSMNEIT GALIGIALVL AAVFVPMAFF GGSQGVIYRQ FSITIVSAMA
LSVLVALILT PALCATLLKP VAKGHTIADK GFFGWFNRNF NDVSHRYQGT VRGLLGKSGR
WLVAYGVIIL LVGFLFIRLP SAFLPEEDQG TMYALVQLPP GAPEERTQAV LDTVRSHFLV
DEKDSVQAVF TVAGFSFAGA GQNAGLAFIR LKDFDQRKAA NLKAPAIAGR AMGKFMQIRD
AMVFAIVPPA VPELGTSSGF DFQLQDVGGV GHEALVAARN QVLGMAAQDP TLVGVRPNGQ
DDTPQLKIDI DQAKAGAMGL TTADINAALS AAWGGSYVND FIDRGRVKKV YVQADAPYRM
RPEDLNTWYV RTSAGTMAPF SSFATTHWIY GSPRLERYNG LSSLNIQGAP APGKSSGQAM
AAMEKIAAQL PPGVGFEWTG LSAQERESGN QAPALYGISI LVVFLLLAAL YESWSIPLSV
IMVIPLGIVG ALLATSLRGL SNDIYFQVGL LTTMGLASKN AILIVEFAKD LHEKGMDLVE
ATVEAVRIRL RPIIMTSLAF VFGVLPLAIS NGAGSGSQHA IGTGVIGGMI SATLLAIFFV
PLFFVIVERI FKPKPRHRDE APATPAESH