Gene Caul_1087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1087 
Symbol 
ID5898542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1150936 
End bp1154115 
Gene Length3180 bp 
Protein Length1059 aa 
Translation table11 
GC content67% 
IMG OID641561569 
Producthydrophobe/amphiphile efflux-1 (HAE1) family protein 
Protein accessionYP_001682715 
Protein GI167645052 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTTCT CGAAATTCTT CGTCAGCCGG CCGCGCTTCG CCGCCGTGCT GTCGATCGTC 
ATCTTCATCG CCGGGTTGGT GGCTCTGCCG CGCCTGCCGA TCTCGGAATA TCCCGAGGTG
GTGCCGCCGA CCGTGGTCGT GCGCGCCGCC TATCCGGGCG CCAACCCCGC CGTCATCGGC
CAGACCGTCG CCGCCCCGCT GGAGCAGGCG ATCAACGGCG TCGAGGGCCT GATCTACCAG
TCGTCGCAGT CGGCCTCCGA CGGCGCGATG ATCCTGACCG TCACCTTCGC CCTCGGGACC
GACCTCGATA AGGCCCAGGT GCAGGTGCAG AACCGCGTCG CCCAGGCCCT GCCGAAACTG
CCCCAGGAGG TGCAGCGCAT CGGCGTGACC ACCGACAAGG CCTCGCCCGA CCTGACCCTG
GTCGTGCACA TGATCTCGCC CAACAACCGC TACGACATGC TGTATCTCAG CAACTACGCG
CAGCTGAACG TCCGCGACCG CCTCAAGCGC GTCGACGGCG TGGGCGACGT GCAGATCTTC
GGGGCCGGCG CCTATTCCAT GCGGATCTGG CTGGACCCGG AAAAGCTGGC CTCGCTGAAC
ATGACGGCCG GTGACGTGGT GAAAGCCCTG CGCGAGCAGA ACGTCCAAGT CGCCGCCGGC
CAGCTGGGCG CGCCCCCCAC CCCGGGCGGC GCCGAGTTCC AGCTGTCGGT CAACGCCCCC
GGCCGCCTGA CCGACGAGGA GCAGTTCCGC AACGTCATCA TCCGCTCGGG CGACCATGGC
GAGATCACCC ACCTGGGCGA CGTAGCCCGC GTGGAGATGG GCGCCAACAA CTACGCCCTG
CGCAGCCTGC TCGACAACAA GTCGGCCGTG GCCATGCCGA TCTTCCAGCG TCCCGGCTCC
AACGCCCTGC AGATGGCCGC CGACATCAAG AAGACGATGA AGGAGCTGAA GAAGGAGTTC
CCCGAGGGCG TCGACTACGA GATCATCTAC GACACCACCG CCTTCGTTCA GGAGAGCATC
GACTCGGTGA TCCACACCCT GATCGAGGCC ATCATCCTGG TCGTGCTGGT GGTGGTGCTG
TTCCTGCAAG GCTGGCGCGC CTCGATCATT CCGCTGATCG CCGTCCCCGT CTCGCTGATC
GGCACCTTCG CCATCCTGCT GATGCTGGGC TTTGGCCTCA ACGCCCTGAC GCTGTTTGGC
CTGGTGCTGG CCATCGGCAT CGTCGTCGAC GACGCCATCG TCGTGGTCGA GAACGTCGAG
CGAAACATCA CCAACGGGCT GGAACCGGCG GCGGCCACCC GCAAGGCCAT GAGCGAGGTC
ACCGGACCGA TCATCTCCAC GGCCCTGGTG CTGTGCGCGG TGTTCATCCC CACCGCCTTC
ATCAGCGGCC TGTCGGGCCA GTTCTACCGC CAGTTCGCCC TGACCATCGC CATCTCGACG
GTGATCTCGG CCTTCAACTC CCTGACCCTG TCGCCGGCCC TGGCCGCCGT GCTCCTCAAG
AGCCACGACG CCCCCAAGGA CCGCTTCCAG AAGGTGATCG ACGCCGCCCT GGGTTGGCTG
TTCAACCCGT TCAACCGCTT GTTCGCCAAG GCGTCCAACG GCTATGTCCG CAATGTCGGC
CGCACCCTGG GCCGGTCGAC CGCCGGCCTG GTGGTCTACG GGATCCTGCT GGCCCTGACG
ATCGGCGCCT TCATGAAGAC CCCGGGCGGC TTCGTGCCGC AGCAGGACAA GGCCTACGTC
GTGGCCGTCG TGCAACTGCC CGACGCGGCC TCGCTGGACC GCACCGAGGC GGTTATCCGC
CAGATGGGCG ACATCGCCGC CAAGGTGCCG GGCATCAAGC ACTCGGTGGC CTTCCCCGGC
CTGTCGGCCA ACGGCTTCAT CAACAGCCCC AACTCCGGCG CGGTGTTCTT CCCGCTGGCC
GACTTCAAGG ACCGCAAGGA CAAGTCGATG TCGGCCAACG CCATCGTCGG CCAGCTGAAC
GCCAAGTTCG GGGCCATTCC CGACGCCCAG ATCGCGGTCT TCCCGCCGCC TTCGGTGCAG
GGGCTGGGCA CGATCGGCGG CTTCCGGATG CAGATCGTCG ACCGCGCCGG CCTGGGTCCT
GACGAACTCA ACAAGCAGAC CCAGAACCTG ATCGACAAGG CCCGCAAGGA CCCCGCCCTG
ACCGGCGTGT TCTCGACCTA CCAGGTGGGC GTGCCGAAGA TCGAGGCCAA CATCGACCGC
GAAAAGGCCC GGGCCTCGGG CGTCAGCCTG ACCGACCTCT TCGAGACCAT GCAGGTTTAT
CTGGGCTCGC TGTACGTCAA CGACTTCAAC CGCTTTGGCC GCACCTACGA GGTCAATGTC
CAGGCCGACC AGAAATTCCG CATGCAGCCC GAGCAGATGC TGCGCCTGCA AACGCGCAAC
GCCACCGGCC AGATGATCCC GCTGGGCGCC TTCGTCAGCT TCAAGGAGGC CACCGGCCCC
GACCGCGAGA GCCACTACAA CGGCGTGCTC ACGGCCGAGA TCAACGGCGG CCCGGCCCCC
GGCTACTCGA CGGGCCAGGC CCAGAAGGCG CTGGAGGACC TGGCCAAGCA GGAACTGCCC
AACGGCATGG GCTTCGAGTG GACCGAGCTG ACCTATCAGC AGATCCTGGC GGGCAACACG
GCGATCTATG TCTTCCCGCT GTGCGTGCTG CTGGCCTTCC TGGTGCTGGT CGCCCAGTAC
GAAAGCTGGA CCCTGCCGCT GGTGGTGATC CTGATCGTGC CGATGACCCT GCTGTCGGCC
CTGGCCGGCG TGCTGCTGAC CCATGGCGAC AACAACATCT TCACCCAGAT CGGGCTGATC
GTGCTGGTGG GGCTGGCCTG CAAGAACGCC ATCCTGATCG TCGAATTCGC CAAGGAGCGA
GAGGAGCATG GCGACACGCC GCTGCAAGCG GTGCTGGAAG CCTGTCGCCT GCGCCTGCGG
CCGATCCTGA TGACCTCGAT CGCCTTCATC ATGGGGGTGT GGCCGCTGGT CACCTCGCAC
GGGGCGGGCG CCGAGATGCG CCGGGCCATG GGCGTGGCGG TGTTCTCCGG CATGCTGGGG
GTGACGGTCT TCGGCCTGAT CCTGACCCCG ATCTTCTACT TCGTCATCCG CCGCAACACC
GCCCGCCGGG CGGCTCTGAA GGCGGCCAAG ACACCGACGA CCGCCGTGGA GGCGCACTGA
 
Protein sequence
MNFSKFFVSR PRFAAVLSIV IFIAGLVALP RLPISEYPEV VPPTVVVRAA YPGANPAVIG 
QTVAAPLEQA INGVEGLIYQ SSQSASDGAM ILTVTFALGT DLDKAQVQVQ NRVAQALPKL
PQEVQRIGVT TDKASPDLTL VVHMISPNNR YDMLYLSNYA QLNVRDRLKR VDGVGDVQIF
GAGAYSMRIW LDPEKLASLN MTAGDVVKAL REQNVQVAAG QLGAPPTPGG AEFQLSVNAP
GRLTDEEQFR NVIIRSGDHG EITHLGDVAR VEMGANNYAL RSLLDNKSAV AMPIFQRPGS
NALQMAADIK KTMKELKKEF PEGVDYEIIY DTTAFVQESI DSVIHTLIEA IILVVLVVVL
FLQGWRASII PLIAVPVSLI GTFAILLMLG FGLNALTLFG LVLAIGIVVD DAIVVVENVE
RNITNGLEPA AATRKAMSEV TGPIISTALV LCAVFIPTAF ISGLSGQFYR QFALTIAIST
VISAFNSLTL SPALAAVLLK SHDAPKDRFQ KVIDAALGWL FNPFNRLFAK ASNGYVRNVG
RTLGRSTAGL VVYGILLALT IGAFMKTPGG FVPQQDKAYV VAVVQLPDAA SLDRTEAVIR
QMGDIAAKVP GIKHSVAFPG LSANGFINSP NSGAVFFPLA DFKDRKDKSM SANAIVGQLN
AKFGAIPDAQ IAVFPPPSVQ GLGTIGGFRM QIVDRAGLGP DELNKQTQNL IDKARKDPAL
TGVFSTYQVG VPKIEANIDR EKARASGVSL TDLFETMQVY LGSLYVNDFN RFGRTYEVNV
QADQKFRMQP EQMLRLQTRN ATGQMIPLGA FVSFKEATGP DRESHYNGVL TAEINGGPAP
GYSTGQAQKA LEDLAKQELP NGMGFEWTEL TYQQILAGNT AIYVFPLCVL LAFLVLVAQY
ESWTLPLVVI LIVPMTLLSA LAGVLLTHGD NNIFTQIGLI VLVGLACKNA ILIVEFAKER
EEHGDTPLQA VLEACRLRLR PILMTSIAFI MGVWPLVTSH GAGAEMRRAM GVAVFSGMLG
VTVFGLILTP IFYFVIRRNT ARRAALKAAK TPTTAVEAH