Gene Caul_1280 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1280 
Symbol 
ID5898735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1342688 
End bp1345801 
Gene Length3114 bp 
Protein Length1037 aa 
Translation table11 
GC content69% 
IMG OID641561765 
Productacriflavin resistance protein 
Protein accessionYP_001682908 
Protein GI167645245 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0345102 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAATA TTTCCCGCTG GGCGATCAAG AACCCGATCC CCGTCATCCT GCTGTTCCTG 
CTGCTGACCC TGGCCGGGAT CTCCGGCTTC AGCAGCATGC GGATCAACGA CAATCCCGAC
GTCGACCTGC CCTTCGTGGT GGTCACCGCC TCGCGCCCCG GCGCCGCGCC CACCGAGCTG
GAAACCCAGG TCACCCGGCT GATCGAGGAC TCGATCGCCG GTCTGGGCCA GGTGCGCCAC
ATCAGCTCGA CGGTCGGTGA CGGCTATTCC TCGACCTTCA TCGAGTTCGA GCTGGGCGTC
GACCACGAGC GGGTGACCAA CGACGTCCGC AACGCCATGG CCAATCTGCA GAGCTCCCTG
CCGCAGGACA TGCAGATCCC CAACGTCACC CGCATCGACA TCTCGGGCGA CGCGCTGATC
ACCTATGTGG TCCAGGCCCC CACCCTGACG CCCGAGCAAC GCAGCTGGTT CGTCGACAAC
GACGTCAGCC GCGCCCTGCT GGCCATCAAG GGCGTCGGCG AGGTCAACCG CCAGGGCGGC
GTCAGCCGCG AGATCCAGGT GGCGCTCGAC CCCGACCGCC TGTCGGCCCG CGGCGTCACG
GCCGCCCAGG TCAGCCAGGC CCTGCAATCG GCCAACGCCG ACCTGCCCGG CGGCCGTGTC
ACCGTCTCGG GCTCCGAACG CGCCATCCGC ACCCTGGGCG CGGCCGGCTC GGTCGACCAG
CTGCGCGAAA CCCGCGTCCA GCTGGGCAAC GGCGAGACCG TGCGCCTGGG CGACCTGGGC
GACGTCGAGG ACCGCTGGGC CGAACCGCGC AACCGCGCGC GGTTCAACAA CCAGGAGGTC
GTGACCTTCA ACATGGTCCG CTCGCGCGGC GCCAGCGAGG TCAAGGTCGC CGAGAAGGTC
CGCAAGGAAG TCGAGAAGCT CGACAAGGCC CATCCCGAGC TGAAGATCGT CGAGCTGACC
TCGAACGTGA AATATATCGA GCAAAGCTAC TACGCTTCGC TCGAGGCCCT GGGACTGGGC
GCCCTGCTGG CGGTGCTGGT GGTGCTGCTG TTCCTGCGCG ACTGGCGGGC GACGTTTCTG
GCGGCCGTGG CGATCCCGCT GTCGCTGTTG CCCACCTTCG CGGTCATGGC CCCGCTGGGC
CAGTCGCTGA ACGGCGTCAC CCTGCTGGCC CTGTCTCTGA CGGTCGGGAT CCTGGTCGAC
GACGCCATCG TCGAGATCGA GAACATCGTC CGGCACATGC GCGGGGGCAA ATCGCCCTAT
GACGCGGCCA TGGAGGCCGC CGACGAGATC GGCCTGGCCG TGGTGGCCAC CACCTTCACC
ATCGTGGCGG TGTTCGCGCC GGTCGGCTTC ATGCCCGGCA TCGTCGGCCA GTTCTTCAAG
GCCTTCGCCC TGGCGGCCTG CATCTCGGTG CTGTTCTCGC TTCTGGTCGC CCGCATGCTG
ACGCCGCTGA TGGGCGCCTA CATGCTCAAG GCCCACAACA AGCCTGACGC CGATCCGTTC
TGGATGGGTC CCTATCTTAG GGCCCTGAAC TGGGCGCTGG GCCACAAGAT CATCGTCGCC
CTGCTGGCCT TGCCGATCCT GGTCGGCTCG TTCCTGCTGG CCCAGAAGCT GCCGTTCGAG
TTCCAGCCGG CGGCCGACCG CGGCCGCGCG CCGTTCAGCG TCGAACTGCC CCCGGGCGCC
ACCCTCGACG AGACCGACGC CATCGTGATG CGGATGACCC GCGAGTTGGA GGCCCGGCCC
GAAGTGACCG GCGTCTACGC CTCGGTTGGC AGCGACGGCG TCAACCAGGC CCGCGTCACG
GCCGACCTTG TGCCGAAGGG CCAGCGGCTC AGCCAGCGCG ACTTCAGCCG CCAGATGGTC
GACCAGTTCA AGGCGATCCC CGGCGCGCGG ATCGGGGCCG GCGGCAACAA CGGCGGCGGT
CCCAGCGACG GCTCGGCCTA CACCCTGCGC CTGGTCGGCG ACAACGGTCC GCAACTCGAG
GCGGCGGCCC GTGCGATCGA GGCCCAGATG CGCGGCGTCA AGGGCCTGGC CAACGTGGTC
AACACCGCGG CCATCGCCCG TCCCGAGATC ATCGTCTCGC CCAAGCCCGA CCAGGCCGCC
CGGGCCGGCG TCTCGGCCGG CGCGATCTCG CAGGCCGTGC GCGTGGCCAC GATCGGCGAC
ATCGACCAGT CGTTGCCCAA GTACAATCTG GGCGACCGAC AAGTGCCGAT CCGCCTGCGC
CTGACCGACG ACGCGCGCGA GAACATGTCG GTGCTGGAGA CCCTTCAGGT TCCCTCCAGC
CACGGCGGTT CGGTGCCGCT GAACGCGGTG GCCGACGTGC GCTTCGGGGC TGGTCCCTCG
CAGATCTCGC GCCAGGACCG CTCGCGGGTG GCCGAGATCA CCGCCGAGCT CGACGGCATC
GTCGTCGGCG ACGCGGCCAA GGCCGTCCAC GCCCTGCCAG CGGTCAAGAG CCTGCCGCCT
GGCGTGAAGG AAGCCCCGGC CGGCGACGCC GAGTTCATCG CCGAGATGCT GACCGGCTTC
GTCGCCGCCT TCGTGACCGG CATTCTCCTG ATGTACGTCG TGCTGGTGCT GCTGTTCCGC
AGCTTCGCCC ACCCGGTGAC CATCATGGCC GCCCTGCCCC TGGCCATCGG CGGCGCCTTC
GCTCTGCTGG TGCTGGCCCA CTCGAGCTTC TCGATGTCGA CCCTGATCGG CATCCTGATG
CTGATGGGCA TCGCGGCCAA GAACTCGATC CTGCTGGTCG AGTACGCGAT CATGGCCATG
AAGGGCGGCA TGACCCGCCG CGAGGCCCTG GTCGACGCCG CCCACAAGCG GGCCCGTCCG
ATCATCATGA CCACGGTCGC CATGGCCGCG GGCATGATGC CGGTGGCGCT CGGCTTCGGC
GCCGACGTGG AGTTCCGGGC GCCGATGGCC ATCGCGGTGA TCGGCGGGCT GATCACCTCG
ACCTTCCTGT CCCTGCTCTA CATCCCGGTG GTGTTCGTGT TCATGGACAT GCTGAAGAAC
TGGTCCAGCA AGATCATCGA CAAGCTGTTC CAGCATCAGA AGCGCCCGCC CAGCCACGCG
CCCCACGGCC CTGGCCCCGC GCCCGAGCCG ATCGTCAGCG GCCGGGAGCT TTAA
 
Protein sequence
MNNISRWAIK NPIPVILLFL LLTLAGISGF SSMRINDNPD VDLPFVVVTA SRPGAAPTEL 
ETQVTRLIED SIAGLGQVRH ISSTVGDGYS STFIEFELGV DHERVTNDVR NAMANLQSSL
PQDMQIPNVT RIDISGDALI TYVVQAPTLT PEQRSWFVDN DVSRALLAIK GVGEVNRQGG
VSREIQVALD PDRLSARGVT AAQVSQALQS ANADLPGGRV TVSGSERAIR TLGAAGSVDQ
LRETRVQLGN GETVRLGDLG DVEDRWAEPR NRARFNNQEV VTFNMVRSRG ASEVKVAEKV
RKEVEKLDKA HPELKIVELT SNVKYIEQSY YASLEALGLG ALLAVLVVLL FLRDWRATFL
AAVAIPLSLL PTFAVMAPLG QSLNGVTLLA LSLTVGILVD DAIVEIENIV RHMRGGKSPY
DAAMEAADEI GLAVVATTFT IVAVFAPVGF MPGIVGQFFK AFALAACISV LFSLLVARML
TPLMGAYMLK AHNKPDADPF WMGPYLRALN WALGHKIIVA LLALPILVGS FLLAQKLPFE
FQPAADRGRA PFSVELPPGA TLDETDAIVM RMTRELEARP EVTGVYASVG SDGVNQARVT
ADLVPKGQRL SQRDFSRQMV DQFKAIPGAR IGAGGNNGGG PSDGSAYTLR LVGDNGPQLE
AAARAIEAQM RGVKGLANVV NTAAIARPEI IVSPKPDQAA RAGVSAGAIS QAVRVATIGD
IDQSLPKYNL GDRQVPIRLR LTDDARENMS VLETLQVPSS HGGSVPLNAV ADVRFGAGPS
QISRQDRSRV AEITAELDGI VVGDAAKAVH ALPAVKSLPP GVKEAPAGDA EFIAEMLTGF
VAAFVTGILL MYVVLVLLFR SFAHPVTIMA ALPLAIGGAF ALLVLAHSSF SMSTLIGILM
LMGIAAKNSI LLVEYAIMAM KGGMTRREAL VDAAHKRARP IIMTTVAMAA GMMPVALGFG
ADVEFRAPMA IAVIGGLITS TFLSLLYIPV VFVFMDMLKN WSSKIIDKLF QHQKRPPSHA
PHGPGPAPEP IVSGREL