Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4026 |
Symbol | |
ID | 5901488 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4362023 |
End bp | 4365172 |
Gene Length | 3150 bp |
Protein Length | 1049 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641564547 |
Product | hydrophobe/amphiphile efflux-1 (HAE1) family protein |
Protein accession | YP_001685649 |
Protein GI | 167647986 |
COG category | [V] Defense mechanisms |
COG ID | [COG0841] Cation/multidrug efflux pump |
TIGRFAM ID | [TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.40035 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.354384 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTTCCC GTTTCTTCAT CGACCGGCCG ATCTTCGCGT GGGTCGTCGC GATCGTCATC ATGCTGGCGG GGGTCCTGGC CATCCGCACC CTGCCGATCG CCCAATATCC GAACATCGCC CTGCCCCAGG TCTCGGTCAA CGCCATCTAT CCCGGCGCCT CGGCCAAGAC CGTCGAGGAC AGCGTCACCC AGGTCATCGA GCAGAAGATG AAGGGCCTGG ACGGCCTGGA CTACATGTCG TCGACCAGCG ACAGCTCGGG CTCGGCCACC ACCACCCTGA CCTTCAAGGC CGGCACCAAC ATCGATATCG CCCAGGTGCA GGTGCAGAAC AAGCTGCAGA CGGCCACCGC CCTGCTGCCG CAGGAAGTGC AGCAGCAGGG CCTGACCGTG GCCAAGTCGG CGCGCAACTT CCTGCTGATC GTCGGGATGT ATTCGGAGGA TCCGAAGACC ACCAACACCG ACCTGGCCGA CTACATCGCC AGCAATCTGC AAGACCCCCT GAGCCGCGTC GACGGCGTGG GCGAGGTCCA GCTGTTCGGC GCGCAATACG CCATGCGCGT CTGGCTGGAC CCTACCAAGC TGGCCAGCTA CAGCCTGACC CCCGCCGACG TGATCGCCGC GATCAAGGCC CAGAACGCCC AGGTCTCGGC CGGCCAGCTG GGCGGCACGC CCAATCTGCC GGGCATTGGC CTGAACGCCA CGATCACGGC CCAGTCACGC CTGAGCACCC CGGCCGAGTT CCAGAACATC ATCGTCAAGA ACAGCGCCGG CGGCGCCATC GTGCATCTGC GCGACGTCGC CCGCGTGGAA CTGGGCGCGG AAAGCTACGG CTTCGGCGCC AAGTACAATG GCAAGCCCGC CGCGGGCCTG GCCATCCGCC TGGCTCCCGG CGCCAACGCC CTGGACACCG CCGACGCCAT CAAGGCGCGG ATGTCCCAGC TGGAAAAGAA CTTCCCGGCC AACTTCAAGT ACGTGATCCC GTACGACTCC ACCCCGTTCG TGAGGCTGTC GATCGAGGAA GTCGTCAAGA CCCTGATCGA GGCCATCGTC CTGGTCTTCA TCGTCATGTT CCTGTTCCTG CAGAACTGGC GCGCGACCCT GATCCCGACC ATCGCCGTGC CGGTCGTCCT GCTGGGCACC TTCGGCGTGC TGGCGGCCTT CGGCTACTCG ATCAACACCC TGACCATGTT CGGCCTGGTG CTGGCCATCG GCCTGCTGGT CGATGACGCC ATCGTCGTGG TCGAGAACGT CGAGCGGGTG ATGAGCGAGG AAGGGCTGTC GCCCATCGAG GCCACGCGCA AGTCGATGAA CGAGATCACC GGCGCCCTGA TCGGCATCGC CCTGGTCCTG GCCGCGGTGT TCGTGCCCAT GGCCTTCTTC GGCGGCTCGC AGGGCGTGAT CTATCGCCAG TTCTCGATCA CCATCGTCTC GGCCATGGCC CTGTCGGTGC TGGTCGCCCT GATCCTGACG CCGGCCCTGT GCGCCACCCT GCTCAAGCCG GTCGCCAAGG GTCACACCAT CGCCGACAAG GGCTTCTTCG GCTGGTTCAA CCGCAACTTC AACGACGTCA GCCATCGCTA CCAGGGCACG GTTCGCGGCC TGCTGGGCAA GAGCGGCCGC TGGCTGGTCG CCTATGGCGT GATCATCCTG CTCGTGGGCT TCCTGTTCAT CCGCCTGCCC AGCGCCTTCC TGCCCGAGGA AGACCAGGGC ACGATGTACG CCCTCGTCCA ACTGCCGCCC GGCGCGCCGG AAGAACGCAC CCAGGCCGTG CTCGACACCG TGCGATCACA CTTCCTGGTC GATGAGAAGG ACTCCGTTCA GGCGGTCTTC ACCGTCGCCG GCTTCAGCTT CGCCGGCGCG GGTCAGAACG CCGGCCTGGC CTTCATCCGC CTGAAGGACT TCGACCAACG CAAGGCCGCC AATCTCAAGG CTCCGGCCAT CGCCGGCCGG GCCATGGGCA AGTTCATGCA GATCCGCGAC GCCATGGTCT TCGCCATCGT GCCGCCGGCC GTGCCGGAAC TGGGCACCTC GTCGGGCTTC GACTTCCAAT TGCAGGACGT CGGCGGCGTG GGTCACGAGG CCCTGGTCGC CGCCCGCAAC CAGGTGCTGG GCATGGCCGC CCAGGACCCG ACCCTGGTCG GCGTACGTCC CAACGGCCAG GACGACACGC CGCAGCTGAA GATCGACATC GACCAGGCCA AGGCCGGGGC GATGGGCCTG ACCACCGCCG ACATCAACGC CGCGCTCAGC GCCGCGTGGG GCGGTTCCTA CGTCAACGAC TTCATCGATC GCGGCCGGGT CAAGAAGGTC TATGTCCAGG CCGACGCGCC CTACCGCATG CGTCCGGAAG ACCTGAACAC CTGGTACGTC CGCACCAGCG CCGGCACCAT GGCCCCGTTC TCGTCCTTCG CCACCACCCA CTGGATCTAC GGCTCGCCGC GTCTCGAGCG CTACAACGGC CTTTCCTCGC TCAACATCCA GGGCGCGCCC GCCCCGGGCA AGAGCTCGGG CCAGGCCATG GCCGCCATGG AGAAGATCGC CGCCCAGCTG CCGCCGGGCG TCGGCTTCGA ATGGACCGGC CTGTCGGCTC AGGAACGCGA GTCCGGCAAC CAGGCCCCGG CCCTGTACGG CATCTCGATC CTGGTGGTGT TCCTGCTGCT GGCCGCTCTC TATGAAAGCT GGTCGATCCC GCTGTCGGTG ATCATGGTCA TTCCGCTGGG CATCGTCGGC GCCCTGCTGG CCACCAGCCT GCGCGGCCTG TCCAACGACA TCTACTTCCA GGTGGGCCTG CTGACGACCA TGGGCCTGGC GTCCAAGAAC GCCATCCTGA TCGTCGAGTT CGCCAAGGAC CTGCACGAGA AGGGCATGGA CCTGGTCGAG GCGACGGTGG AGGCGGTGCG CATCCGTCTG CGGCCGATCA TCATGACCTC GCTGGCCTTC GTGTTCGGCG TGCTGCCCCT TGCCATCTCG AACGGGGCGG GCTCGGGCAG CCAGCACGCC ATCGGCACCG GCGTGATCGG CGGCATGATC TCGGCCACCC TGCTGGCGAT CTTCTTCGTT CCGCTGTTCT TCGTGATCGT CGAGCGGATC TTCAAGCCCA AGCCGCGTCA CCGTGACGAG GCCCCCGCGA CTCCGGCCGA GAGCCACTGA
|
Protein sequence | MLSRFFIDRP IFAWVVAIVI MLAGVLAIRT LPIAQYPNIA LPQVSVNAIY PGASAKTVED SVTQVIEQKM KGLDGLDYMS STSDSSGSAT TTLTFKAGTN IDIAQVQVQN KLQTATALLP QEVQQQGLTV AKSARNFLLI VGMYSEDPKT TNTDLADYIA SNLQDPLSRV DGVGEVQLFG AQYAMRVWLD PTKLASYSLT PADVIAAIKA QNAQVSAGQL GGTPNLPGIG LNATITAQSR LSTPAEFQNI IVKNSAGGAI VHLRDVARVE LGAESYGFGA KYNGKPAAGL AIRLAPGANA LDTADAIKAR MSQLEKNFPA NFKYVIPYDS TPFVRLSIEE VVKTLIEAIV LVFIVMFLFL QNWRATLIPT IAVPVVLLGT FGVLAAFGYS INTLTMFGLV LAIGLLVDDA IVVVENVERV MSEEGLSPIE ATRKSMNEIT GALIGIALVL AAVFVPMAFF GGSQGVIYRQ FSITIVSAMA LSVLVALILT PALCATLLKP VAKGHTIADK GFFGWFNRNF NDVSHRYQGT VRGLLGKSGR WLVAYGVIIL LVGFLFIRLP SAFLPEEDQG TMYALVQLPP GAPEERTQAV LDTVRSHFLV DEKDSVQAVF TVAGFSFAGA GQNAGLAFIR LKDFDQRKAA NLKAPAIAGR AMGKFMQIRD AMVFAIVPPA VPELGTSSGF DFQLQDVGGV GHEALVAARN QVLGMAAQDP TLVGVRPNGQ DDTPQLKIDI DQAKAGAMGL TTADINAALS AAWGGSYVND FIDRGRVKKV YVQADAPYRM RPEDLNTWYV RTSAGTMAPF SSFATTHWIY GSPRLERYNG LSSLNIQGAP APGKSSGQAM AAMEKIAAQL PPGVGFEWTG LSAQERESGN QAPALYGISI LVVFLLLAAL YESWSIPLSV IMVIPLGIVG ALLATSLRGL SNDIYFQVGL LTTMGLASKN AILIVEFAKD LHEKGMDLVE ATVEAVRIRL RPIIMTSLAF VFGVLPLAIS NGAGSGSQHA IGTGVIGGMI SATLLAIFFV PLFFVIVERI FKPKPRHRDE APATPAESH
|
| |