Gene Caul_1336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1336 
Symbol 
ID5898791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1416520 
End bp1419663 
Gene Length3144 bp 
Protein Length1047 aa 
Translation table11 
GC content62% 
IMG OID641561823 
Producthydrophobe/amphiphile efflux-1 (HAE1) family protein 
Protein accessionYP_001682964 
Protein GI167645301 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCGCT TCTTTATCGA CAGACCCATC TTCGCGTGGG TCATCGCCAT CGTGATCATG 
CTGGCCGGGG CGCTCGCCAT CCGGACCTTG CCGGTCGCGC AGTATCCTGC AATCGCACCG
CCCCAAGTCG CCGTCAGCGC AATGTATCCC GGCGCGTCGG CCAAGACGCT CGAAGACAGC
GTCACCCAGG TCATCGAACA GAAGATGAAG GGCATCGACG GCCTGGATTA CATGTCGTCA
ACGAGCGACA GCACCGGCGC TGCGACGGTC ACCCTGACTT TCAAAGCTGG CACGGATATC
GACATCGCGC AGGTGCAGGT CCAGAACAAG CTGCAGACGG CTACCGCGCT GTTGCCGCAG
GAAGTTCAGC AGCAAGGCCT AACCGTCGCA AAGTCGGCCC GTAACTTCCT GCTCGTGATC
GGTCTCTATT CTGACAATCC GAAAACAACC AGCAACGACC TTACCGACTA TATCGCCTCC
AACATCCAGG ACCCGCTGAG CCGGGTCGAT GGGGTCGGCG ACGTCCAGCT TTTCGGCGCC
CAGTACGCCA TGCGCATCTG GCTCGACCCG AACAAGCTGG CGTCCTACAG CCTGACGCCG
AGCGACGTGT CGGCCGCCAT CCGCGCCCAG AACGCGCAGG TTTCGGCCGG CCAGCTAGGC
GGTACGCCCA ACCTGCCGGG CACGGCGCTC AACGCGACGA TCACGGCGCA GTCGCGCCTT
CAGACGCCTG AAGAGTTCGA AAACATCATC CTGAAGAACA GCCAGGGCGG CGCCATTGTC
TATATGCGTG ACGTCGCGCG TGTGGAGATG GGCGCGGAAA GCTACAGCGC AGCCGCGAAA
TACAATGGAA AGCCCTCCGG CGGTCTGGCG ATCAAGCTGG CGCCGGGTGC CAATGCTTTG
GACACGGCCA AAGCGGTCAA GGCGCAGGTT GATGAGCTGG CGAAGAATTT CCCTTCTGAA
TACAAATACA TAGTTCCATA CGACTCGACG CCCTTTGTCG AACAGTCGAT CCACGAGGTG
GTCAAGACGC TGATGGAGGC CATTGTGCTG GTCTTCATCG TCATGTTCCT GTTCCTGCAA
AACTGGCGCG CGACGCTGAT CCCGACGATT GCCGTTCCCG TCGTCCTGCT CGGCACCTTC
GGCGTGCTGG CCATGTTCGG CTACTCGATC AACACCCTCA CCATGTTCGG CCTGGTGCTG
GCCATCGGCC TCCTGGTCGA TGACGCCATC GTCGTGGTCG AGAACGTCGA GCGGGTGATG
TCCGAAGAGG GCCTGTCGCC CAAGGAAGCC ACCCGCAAGT CGATGAACGA GATCACCGGC
GCCCTGATCG GCATCGCCCT GGTGCTGTCG GCGGTGTTCG TGCCGATGGC CTTCTTCGGC
GGCTCGCAAG GCGTCATCTA CCGTCAGTTC TCGATCACCA TCGTCTCGGC GATGATGTTG
TCTGTGGTGG TCGCCCTCGT GCTCACACCG GCGCTCTGCG CCACCATGCT CAAGCCTATC
GCCAAGGGCC ACATGGTCGC GGAGACGGGG TTCTTCGGCT GGTTCAACCG CAGCTTCAAC
GACGTCTCTG GTCGGTATCA GAACTCGGTG CGCGGCATCC TGGGCAAGAG CGTCCGCTGG
ATGGCGATCT ATGCCGCCAT CATTGTGGCC ATGGGTCTGC TGTTCGTTCG CTTGCCGAGC
GCCTTTCTAC CCGAAGAGGA TCAGGGGTTT ATGCTGACGC TGGTGCAGCT CCCGCCGGGT
GCCACTGAGG AGCGCACTCT GGGCGTCCTC GATCAGGTCC GCAGCCACTT CATGGAGAAC
GAGAAGGAGG CCGTGCAATC TGTGTTCACC GTCAACGGCT TCAGCTTCAG CGGGCCGGGG
CAGAACGCAG GCCTGGCGTT CATTCGGCTC AAGAATTTCG ACGAACGAAA GTCCGCTGAT
CTCAAGGTGC CGGCTCTGGC AGGTCGGGCC ATGGGTAAGT TCGCGCAGAT CCGCGATGCG
ATGGTGTTCG CCATCGTGCC GCCGGCTGTC ACCGAACTGG GCAACTCTTC GGGCTTCGAT
TTTCAGCTGA AGGATATTGG CGGGCTTGGC CATGAGGCCT TGGTCGCCGC CCGCAACCAG
ATGCTGGGTA TGGCCGCGCA GGATCCGGCC CTGGTTGGCG TACGTCCCAA CGGTCAGGAC
GACACCCCAC AGCTGAAGAT CGACATCGAC CAGGCCAAGG CCGGGGCCAT GGGCTTGACC
ACGGCCGACA TCAACGCCGC GCTGAGCACG GCCTGGGGCG GTTCGTATGT GAACGACTTC
GTCGATCGCG GCCGGGTCAA AAAGGTCTAT GTCCAGGCCG ACGCACCGTT CCGCATGACG
CCGGAAGATC TCAACAAATG GTACGTCCGC AACAGCAGCG GCCAGATGGT GCCCTTCTCG
GCCTTCGCCA CGAGCAACTG GATTTATGGC TCGCCGCGTC TTGAGCGCTA CAACGGCTTG
CCGTCAATGA ACATCCAGGG CTCCCCGGCG CCTGGCGCCA GTTCCGGTGA CGCGATCAAG
GCCATGGAGG CGATCGCTTC GAAACTCCCG GCCGGGATCG GTTACGAGTG GACGGGCCTC
TCCGCCCAGG AACTGGAAGC GGGCAATCAG GCTCCGGCCC TCTATGCCCT GTCCATCCTG
GTCGTGTTCC TGCTGCTGGC GGCCCTTTAC GAGAGCTGGA CGATCCCGCT GTCGGTGATC
ATGGTCATTC CGCTTGGCAT CGTCGGCGCC CTGCTGGCGA CCACCCTGCG CGGGCTGTCC
AATGACATCT ACTTCCAGGT GGGTCTGCTC ACGACCATGG GGCTGGCGGC CAAGAACGCC
ATCCTGATCG TCGAGTTCGC CAAGGAAATC TACCTGCGCA CCGGCAATCT CGTCGAGGCG
ACCCTGGACG CCGTCCGCAT CCGCCTACGT CCGATCATCA TGACATCGCT GGCCTTCATC
TTCGGCGTGT TGCCGTTGGC GATTTCCAAC GGGGCCGGCT CGGGCAGCCA GCACGCGATT
GGCACGGGCG TCATCGGCGG GATGCTGTCG GGGACCATCC TGGCAATCTT CTTCGTCCCG
CTGTTCTTCG TGCTCGTCGA ACGAATCTTC AAGGCCAAGC TCCGCAAAGA CGAGGCTTCC
GCCGCCCCGG TCGAGGGTCA CTAA
 
Protein sequence
MSRFFIDRPI FAWVIAIVIM LAGALAIRTL PVAQYPAIAP PQVAVSAMYP GASAKTLEDS 
VTQVIEQKMK GIDGLDYMSS TSDSTGAATV TLTFKAGTDI DIAQVQVQNK LQTATALLPQ
EVQQQGLTVA KSARNFLLVI GLYSDNPKTT SNDLTDYIAS NIQDPLSRVD GVGDVQLFGA
QYAMRIWLDP NKLASYSLTP SDVSAAIRAQ NAQVSAGQLG GTPNLPGTAL NATITAQSRL
QTPEEFENII LKNSQGGAIV YMRDVARVEM GAESYSAAAK YNGKPSGGLA IKLAPGANAL
DTAKAVKAQV DELAKNFPSE YKYIVPYDST PFVEQSIHEV VKTLMEAIVL VFIVMFLFLQ
NWRATLIPTI AVPVVLLGTF GVLAMFGYSI NTLTMFGLVL AIGLLVDDAI VVVENVERVM
SEEGLSPKEA TRKSMNEITG ALIGIALVLS AVFVPMAFFG GSQGVIYRQF SITIVSAMML
SVVVALVLTP ALCATMLKPI AKGHMVAETG FFGWFNRSFN DVSGRYQNSV RGILGKSVRW
MAIYAAIIVA MGLLFVRLPS AFLPEEDQGF MLTLVQLPPG ATEERTLGVL DQVRSHFMEN
EKEAVQSVFT VNGFSFSGPG QNAGLAFIRL KNFDERKSAD LKVPALAGRA MGKFAQIRDA
MVFAIVPPAV TELGNSSGFD FQLKDIGGLG HEALVAARNQ MLGMAAQDPA LVGVRPNGQD
DTPQLKIDID QAKAGAMGLT TADINAALST AWGGSYVNDF VDRGRVKKVY VQADAPFRMT
PEDLNKWYVR NSSGQMVPFS AFATSNWIYG SPRLERYNGL PSMNIQGSPA PGASSGDAIK
AMEAIASKLP AGIGYEWTGL SAQELEAGNQ APALYALSIL VVFLLLAALY ESWTIPLSVI
MVIPLGIVGA LLATTLRGLS NDIYFQVGLL TTMGLAAKNA ILIVEFAKEI YLRTGNLVEA
TLDAVRIRLR PIIMTSLAFI FGVLPLAISN GAGSGSQHAI GTGVIGGMLS GTILAIFFVP
LFFVLVERIF KAKLRKDEAS AAPVEGH