Gene Caul_0806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0806 
Symbol 
ID5898261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp856925 
End bp860053 
Gene Length3129 bp 
Protein Length1042 aa 
Translation table11 
GC content69% 
IMG OID641561287 
Productacriflavin resistance protein 
Protein accessionYP_001682435 
Protein GI167644772 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.326964 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGCG GACAAAAGAC CGGCGAGCTG CGCATCTCCG CCTGGGCGAT CAAAAACCCG 
ATCCCCGTCG CGGTCCTATT CATCGCCCTG ATCATCGCCG GGATCGGCTG CTACATGGGC
CTGCCCATCA AGCAGTTCCC CAATGTCGAG TTCCCCGCCG TGACCGTGAC GGTCACCCAG
AGCGGCGCGG CGCCCGGCGA GATGGAGACC CAGGTCACCC GCCCCATCGA GGACGCGGTG
GCCAGCATCT CCAACGTCAA GACCATCCGC TCGTCTGTGG TCCAGGGGAC CTCGACCACC
ACGATCGAGT TCAACCTCGG CGAGGACCTG CAGAAGGTCA CCGACGAGGT GCGCTCCAAG
ATCGACCAGA CGCGCTCGCT GCTGCCCAAG GAGGTCGACG AGCCGATCGT CCAGCGCCTG
GAGATCACCA GCGCCCCGAT CATCACCTAC GCCGTCGCCG CGCCGAACAT GAGTTCGACC
GAGCTGTCCT GGTTCATCGA CGACACCGTC ACCCGCGCCC TGCAAGGCGA AAAGGGCGTG
GCCCAGGTGG CCCGGGTCGG CGGCGTCGAC CGCGAGATCA ACGTCGTCAT CGATCCCGAC
CGCATGGCCT CGTTCGGCGT CACCGCCCCG CAGTTGAACC AGGCCCTGGC CAGTTTCAGC
GTCGACGCCC CCGGCGGCCG CGCCAAGATC GGCGGCCGCG AGCAGACCCT GCGCGTGCTG
GGCGCGGCCA CCACGGTCGA GCAGCTGCGA AACATCACCA TTCCGATCAC CGGCGGTCGC
TATGTGAGGC TGACCGACGT AGCCCAGGTC GGCCAGGGCT CGGAAGAGGA GCGCGGCTTC
GCCCGCCTCG ACAACAGGCC CGTGGTCGCC TTCCAGGTGA TGAAGACCCG CGACTCCAGC
GATGTGGCCG TCGAGGACCG CGTCAAGGCG GCCGTCGACA GGCTGGAGGC CAAGCAGCCC
GGCGTCAGCT TCGTGAAGAT CTTCTCGACC GTCGACGAGA CCCGCGCCAG CTTCGCGGCC
ACCGAGCACA CCCTGCTGGA AGGCATGCTG CTGGCCTCGC TGGTGGTGTT CCTGTTCCTG
CGCGAGTGGC GGGCCACCCT GATCACCGCC ATCGCCATGC CGGTGTCGCT GATCCCCACC
TTCGCCTTCA TGGCGATCAT GGGCTTCTCG CTCAACGTCG TGACCCTGCT GGCCCTGACC
CTGGTCATCG GCATCCTGGT CGACGACGCC GTGGTCGAGA TCGAGAACAT CGAGAAACGC
GTCGCCCGCG GCCAGCGGCC GTTCCAGGCG GCGATGGAGG GCGCGGACTC CATCGGCCTG
GCCGTCGTGG CCACCACCTT CACCATCGTG GCGGTGTTCG TGCCGGTGTC CTACATGCCC
GGCACGCCCG GCCAGTTCTT CAAGGAATTC GGCCTGACCG TGGCGATGGC GGTGCTGTTC
TCGCTGGTCG TGGCCCGCCT GCTGACCCCG CTGCTGGCCG CCTATTTCCT CAAGCCCGCC
AAGGATCCTC ACCCCCGCCC CGAGTTCAAG GGCTTCTATC GCGGCGTGCT GGACTGGTCG
CTGGATCACC GCTTCCTCAG CATCATCATG GGCACGGTGA TCCTGGTCGG CTCGTTCGCC
TTGGTGAAGT TCATTCCCAC CGCCTTCCAG CCGGCGGGCA ACGCCAACTA CTACTATCTG
AAGGTCCAAG GTCCGCCCGG GGCGACGACC GCCGACATGG AGCGCACCGT CCAGGCGGTG
ACCACCATGT TCCGCAAGCG CCCCGAGACC GCCCACGTCT TCGCCCAGGT CGGCTCGAAC
ATCGGCAGCG GCTGGGGCGG TCAGAGCGGC GCCGACATCC GCGACGCCAC CATCACCGTC
GTGCTGAACG GCAAGCGCGA CCTGACGGTC ACCCAGATCA AGCAGGTGGT CCGCAACGGC
CTGCACGACA TTCCCGACGC TCGCGTCAAC CTGCTGGGCG ACTGGGGCAC TTCGGAGGTC
CAGACCATCC TGATCTCCGA CGACGGCCCG CTGCTGGAAC GCACCGCCGC CCAGATCGAG
CGCGAGATGC AGTCGCTGAG CACCGTGGCC GATCCGCGCC CGTCCTCGCC GCCCAGCGGC
CCGGAGATCG TCATCCGGCC CAAGCCCGAC GAGGCCGCCC GCCTGGGCGT CAGCGCCGCC
GACATCGCCG CCATCGCCCG CGTGGCCACG GTCGGCGACA TCGACGCCAA TGTCGCCAAG
ATGACCCAGG GCGAGCGCCG GATCCCGATC CGCGTGCGCC TGCCCGCGGA AACCCGCGCC
GACCTCGACG CCCTGGGCGC CCTGCGCGTG CCCACGGCCG GCGGCGGCTC GACCCGGCTC
GACACGGTCG CCGACCTGTC GTTCCAGGCG GGCCCGGCCA AGATCGACCG CTTCGCCCGC
AAGCGGCAGG TGACCATCGA GGCCGACCTC GCCAACGGCG CCCAGCTGGG CCAGGCAATG
GCCGACGTGG GCAAGCTGCC GACCATGAAG AGCCTGCCGG CCAGCGTCGG GCCGGCCACG
GCTGGCGACC AGGAGGCGTT CGTCGAACTG TTCACCGGCT TCGCCGTCGC CCTGCTCTCG
GCGGTCGGCC TGGTGTTCGG CGTGCTGGTG CTGCTGTTCC GCAGCTTCTT CAAGCCGATC
ACCATCCTGT CGGCCCTGCC CCTAGCGATC GGCGGCGCGT TCCTGGCCCT GCTGGTCACC
GGCCAGTCGC TGTCGATGCC GTCGCTGATC GGCTTCCTGA TGCTGATGGG CCTGGCGGCC
AAGAACTCGA TCCTGCTGGT CGAATACGCC ATCGAGCAGG AGCGGGCTGG CATGAGCCAG
CGCGACGCCA TCCTCGACGC CTGCCACGAG CGAGCCCGGC CGATCGTCAT GACCACCCTG
GCGATGATGG CCGGCATGCT GCCCACGGCG CTGGGCATCG GCACGGGTTC GGAGTTCCGC
CAGCCCATGG CCGTGGCGGT GATCGGCGGC CTGATCACCT CGACCGTGCT GTCGCTGGTG
CTGGTGCCGG TGGTTTATGA GATCGTCGAC GACATCGAAC AATGGCTGAC GCCGAAGCTC
TCGCGCTGGA CCACCCCGCG CGAGGCGGCT GGGGCGACCG GCGCGGCCTC GCCGGTGGAT
CGACTCTAG
 
Protein sequence
MAGGQKTGEL RISAWAIKNP IPVAVLFIAL IIAGIGCYMG LPIKQFPNVE FPAVTVTVTQ 
SGAAPGEMET QVTRPIEDAV ASISNVKTIR SSVVQGTSTT TIEFNLGEDL QKVTDEVRSK
IDQTRSLLPK EVDEPIVQRL EITSAPIITY AVAAPNMSST ELSWFIDDTV TRALQGEKGV
AQVARVGGVD REINVVIDPD RMASFGVTAP QLNQALASFS VDAPGGRAKI GGREQTLRVL
GAATTVEQLR NITIPITGGR YVRLTDVAQV GQGSEEERGF ARLDNRPVVA FQVMKTRDSS
DVAVEDRVKA AVDRLEAKQP GVSFVKIFST VDETRASFAA TEHTLLEGML LASLVVFLFL
REWRATLITA IAMPVSLIPT FAFMAIMGFS LNVVTLLALT LVIGILVDDA VVEIENIEKR
VARGQRPFQA AMEGADSIGL AVVATTFTIV AVFVPVSYMP GTPGQFFKEF GLTVAMAVLF
SLVVARLLTP LLAAYFLKPA KDPHPRPEFK GFYRGVLDWS LDHRFLSIIM GTVILVGSFA
LVKFIPTAFQ PAGNANYYYL KVQGPPGATT ADMERTVQAV TTMFRKRPET AHVFAQVGSN
IGSGWGGQSG ADIRDATITV VLNGKRDLTV TQIKQVVRNG LHDIPDARVN LLGDWGTSEV
QTILISDDGP LLERTAAQIE REMQSLSTVA DPRPSSPPSG PEIVIRPKPD EAARLGVSAA
DIAAIARVAT VGDIDANVAK MTQGERRIPI RVRLPAETRA DLDALGALRV PTAGGGSTRL
DTVADLSFQA GPAKIDRFAR KRQVTIEADL ANGAQLGQAM ADVGKLPTMK SLPASVGPAT
AGDQEAFVEL FTGFAVALLS AVGLVFGVLV LLFRSFFKPI TILSALPLAI GGAFLALLVT
GQSLSMPSLI GFLMLMGLAA KNSILLVEYA IEQERAGMSQ RDAILDACHE RARPIVMTTL
AMMAGMLPTA LGIGTGSEFR QPMAVAVIGG LITSTVLSLV LVPVVYEIVD DIEQWLTPKL
SRWTTPREAA GATGAASPVD RL