Gene Caul_5122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5122 
Symbol 
ID5897408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp39812 
End bp43030 
Gene Length3219 bp 
Protein Length1072 aa 
Translation table11 
GC content64% 
IMG OID641555225 
Producthydrophobe/amphiphile efflux-1 (HAE1) family protein 
Protein accessionYP_001676556 
Protein GI167621771 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAACATCT CCAAGTACTT TATCGACCGG CCCATCTTCG CAAGCGTGCT TTCGGCCGTC 
GTCCTGTTGG GCGGGATCAT CTCGGTCTTC AAGCTGCCGA TCTCTGAATA TCCCGACGTC
ATTCCGCCCC AGGTCCTGGT CCACGCCGAG TTTCCCGGCG CCAATCCCAA GGTTATCGCC
GAGACGGTCG CCGCCCCGAT CGAGGAGAAG ATCAACGGCG TCCAGGACAT GCTCTACATG
CAGTCCCAGG CCAACAGCGA CGGCAAGATG ACGACGACCG TCACCTTCAA GCTGGGCACC
AATCCCGACC TCGCCCAACA GCTGGTGCAG AACCGCGTCA CCCAGGCCGT CCCCCACCTT
CCCGAGGACG TACAGCGTCT GGGCGTGACG ACGGTCAAGG CCTCCTCGAC CATGACCTTG
GCCGTGCAGA TCAGCGGCGA TCAAAATCAT GACCTGGCCT ATCTGCGCAA TTACGCCCTG
ATCAACATCA AAGACCGCTT GGCCCGGATC CCTGGCGTCG GCGACGTGCA GCTCTACGGG
GCGGGCGACT ACGCCATGCG GATATGGCTC GACCCCCAGA AGCTGGCCCA GCGCAACTTG
ACCGCCACCG ACGCCGTCGC GGCCATCCGC GAGCAGAACG TCCAGGTGTC CGCCGGCATG
ATCGGCGGCA CGCCGAACGT TCCGGGCGTG CCGCTGCAGC TGGACGTCAA CGTCAAGGGC
CGCCTGCAGA GCGTCGAAGA GTTTGGCGAG ATCGTCGTCA AGACGGGTCC GGGCGGCGGC
GTCATCTACC TGCGTGACGT GGCCCGGATC GAGTTGGGCG CGGCCGAGTA TGGCCAGCGA
ACGGCGCTCA ACAACGGCCG CTCGATGGCG ATCTGGATCT TCCAGCTGCC GGGCGCCAAC
GCCCTGCAGA TCGCCGACGC CGTGCGCAAG ACCATGGCCG AGGAAATCGG TCCGGAGATG
CCGGCCGGCG TCAGCTATCG GATCGTCTAC GACCCCACCC GGTTCGTCAA AGCCAGCATC
GAGGCGGTGA TCCACACCCT GCTGGAAGCC GTCGCCCTGG TCGTCCTGGT GGTGATCATG
TTCCTCCAGA CCTGGAGGGC CTCGGTCATT CCCCTGCTGG CCGTGCCCGT GGCGATCGTC
GGCACCTTCG CCTTCCTGCT GGGCTTTGGC TATTCGATCA ACGCTCTGTC GCTGTTTGGA
TTGGTGCTGG CCATCGGCAT CGTGGTCGAT GACGCCATCG TCGTTGTCGA AAACGTCGAA
CGAAACCTCG AAGCGGGGCT CTCGCCCAAG GCGGCCACCT ACAAGGCCAT GCAGGAAGTC
AGCGGGCCGA TCATCGCCAT CGCCCTGACC CTGATCGCCG TGTTTGTGCC GCTAGCCTTC
ATGTCGGGGC TGACCGGACA GTTCTACAAG CAGTTCGCCG TAACGATCGC CGTCTCCACG
GTGATTTCCG CCTTTAATTC CCTGACGCTG TCGCCTGCCC TGGCGGCTCT GCTTCTACGC
GCGCCCCATC AGCCCAAGGA TTGGCTGACG CGGGTGATCG ACAGGCTTTT TGGACCCGCC
TTCAACGTCT TCAACACCGT CTTCAAGCGC GGCTCTCACG CGTACGGATC GGGCGTCACC
AGCGCCCTGG GCCGCAAGTC GATCATGCTG GTGCTCTACG CGGTGCTGGT CGGCGGAGCC
GTCTTGATGG GTAAGCTCGT GCCGGGCGGG TTCGTGCCGG CCCAGGACAA AGGCTACCTG
ATCGCCGTCG CCCAACTTCC CGACGGCGCA TCGCTGGACC GCACCGACGC GGTGCTGCGC
CAGATGTCGG ACCTCTCCAA GGGGGCGCCG GGCGTCCAGG ACGCCGTCCA GCACCCGGGC
CTGTCGATCA ATGGCTTTAC GACCTCGTCC AGTTCGGGCG TCGTGTTCCT GGGGCTGACG
CCGTTCGAAG AGCGGTACGG CCACGGCAAG CCGCTCGCGG CCGACCAGAT CGCCACGGCG
ATGACGGAGC GTTTTGGCGC GATCAAGGGC GCCAAGATCG GGGTGTTCAA TCCGCCACCG
GTTCTGGGTC TGGGCACCCT AGGCGGCTTC AAGTTTCAGA TCGAGGATCG CGGCGCCCAG
GGCTACGCCG CGCTCAACGA CGCCACGAAC GCCTTCATCA AGGCCGCGGC GCAGGAGCCG
GCCCTCGGTC CGATGTTTTC CAGCTACCAG GTCAACGTCC CGCAGCTGAA CGTCGACGTA
AATCGGGTGA AAGCCAAGCA ACTGGGCGTG TCGGTGACCG ACATCTTCAC GACGATGCAG
ATCTATCTGG GCTCGCTCTA CGTCAACGAC TTCAACCGCT TTGGCCGCGT CTACCAGGTC
CGAGCCCAGG CCGACGCCCC GTTTCGGGCC TATGCCGACC ACATCGGTTT GCTCAAAACC
CGCAATGCGG CCGGTGAGAT GGTGCCGCTG GGCAGCTTCT TGACCGTTAC GCCGGGCTAC
GGCCCTGAGA TGGTCGTCCG CTACAACGGC TTCACAGCCG CCGATATCAA TGGCGGTCCC
GCGCCGGGCT ATTCCTCCGA CCAGGCCAAG GCCGCCGTCG AACGCGTCGC GGCCAAGACC
TTGCCGGCCG GGTTCAAGTT CGAGTGGACC GACCTGACCT ATCAGCAGAT CCTGGCGGGC
AATTCGGCGC TGTGGGTGTT CCCGGTCAGC CTGCTGCTGG TGTTCATGGT CCTGGCCGCT
CAGTACGAGA GCCTGACCCT GCCGCTGGCC GTGATCCTGA TCGTACCGAT CAGCGTCTTT
GCGGCGCTGT TTGGGGTCTG GCTGCTGCGC GGCGACAACA ACATCTGCAC GCAGATCGGC
CTGATGGTGC TGGTGGGCCT GTCGGCCAAG AACGCCATCC TGATCGTGGA GTTCGCCCGC
GACCTGGAAA TGCAGGGCCG CAGCATCGTT GACGCCGCCG TCGAGGCCAG CCGCATGCGG
CTGCGACCGA TCTTGATGAC CTCGTTCGCC TTCATCATGG GTGTCATCCC CATGGTGCTG
TCGAGTGGCG CGGGCGCGGA AATGCGCCGG GCCATCGGGG TGGCGGTGTT CTTCGGAATG
CTCGGGGTGA CGCTGTTTGG CCTGATGCTG ACGCCGGTGT TTTACGTGCT GCTGCGGCTT
CTGGCCGGGG CTCCGCCGAT CATCGACAAG AACCACACCC ACACCCCCAT CATCGAGTTG
GGCGAGCACG TCGACGAGCC GCTCGAGGCT CAAGCGTGA
 
Protein sequence
MNISKYFIDR PIFASVLSAV VLLGGIISVF KLPISEYPDV IPPQVLVHAE FPGANPKVIA 
ETVAAPIEEK INGVQDMLYM QSQANSDGKM TTTVTFKLGT NPDLAQQLVQ NRVTQAVPHL
PEDVQRLGVT TVKASSTMTL AVQISGDQNH DLAYLRNYAL INIKDRLARI PGVGDVQLYG
AGDYAMRIWL DPQKLAQRNL TATDAVAAIR EQNVQVSAGM IGGTPNVPGV PLQLDVNVKG
RLQSVEEFGE IVVKTGPGGG VIYLRDVARI ELGAAEYGQR TALNNGRSMA IWIFQLPGAN
ALQIADAVRK TMAEEIGPEM PAGVSYRIVY DPTRFVKASI EAVIHTLLEA VALVVLVVIM
FLQTWRASVI PLLAVPVAIV GTFAFLLGFG YSINALSLFG LVLAIGIVVD DAIVVVENVE
RNLEAGLSPK AATYKAMQEV SGPIIAIALT LIAVFVPLAF MSGLTGQFYK QFAVTIAVST
VISAFNSLTL SPALAALLLR APHQPKDWLT RVIDRLFGPA FNVFNTVFKR GSHAYGSGVT
SALGRKSIML VLYAVLVGGA VLMGKLVPGG FVPAQDKGYL IAVAQLPDGA SLDRTDAVLR
QMSDLSKGAP GVQDAVQHPG LSINGFTTSS SSGVVFLGLT PFEERYGHGK PLAADQIATA
MTERFGAIKG AKIGVFNPPP VLGLGTLGGF KFQIEDRGAQ GYAALNDATN AFIKAAAQEP
ALGPMFSSYQ VNVPQLNVDV NRVKAKQLGV SVTDIFTTMQ IYLGSLYVND FNRFGRVYQV
RAQADAPFRA YADHIGLLKT RNAAGEMVPL GSFLTVTPGY GPEMVVRYNG FTAADINGGP
APGYSSDQAK AAVERVAAKT LPAGFKFEWT DLTYQQILAG NSALWVFPVS LLLVFMVLAA
QYESLTLPLA VILIVPISVF AALFGVWLLR GDNNICTQIG LMVLVGLSAK NAILIVEFAR
DLEMQGRSIV DAAVEASRMR LRPILMTSFA FIMGVIPMVL SSGAGAEMRR AIGVAVFFGM
LGVTLFGLML TPVFYVLLRL LAGAPPIIDK NHTHTPIIEL GEHVDEPLEA QA