Gene Caul_3080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3080 
Symbol 
ID5900535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3340201 
End bp3343338 
Gene Length3138 bp 
Protein Length1045 aa 
Translation table11 
GC content66% 
IMG OID641563583 
Productacriflavin resistance protein 
Protein accessionYP_001684705 
Protein GI167647042 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.907454 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTCCG ACCTTTCCGT CCGCCGCCCG GTCTTCGCGG CCGTTGTCGC GATCATCCTC 
TGCGTCATTG GCCTGGCTGC CTTCGGCAGC CTGCCGATCC GGGAACTGCC CAACGTCGAT
CCGCCCGTGG TGTCGATTTC GACCAGCTAT ACCGGCGCCT CCGCCGAGGT CATCGAAGAG
CGCATCACCC AGGTCATCGA GCGCCAGGTG GCCGGGATCC AGGGCATCGA CCGGGTCAAC
AGCTCCTCGC GTGACGGCCG CTCGCAGATC ACCATCACCT TCACTCTGGA CCGCGACCTC
GACGCCGCCG CCAACGACGT GCGCGACGCC GTTAGCCGGG TGACCTCGAA CCTGCCCGAC
CAGGCCGATC CGCCGCAGAT CGCCAAGGCC CAGGCCGACA GCTCGCCGAT CGTCATCCTC
AACCTGACCT CGACGACGCT GTCGCCGCTG GAGCTGGCCG ATTACGCCGA CCGTTATCTG
GTCGAGCGGT TGTCGACCGT GCCGGGCGTG GCCCAGGCCG GTCTCAACGG CCAACTCTAC
GCCATGCGCA TCTGGCTCAA TCCCGACGCC ATGGCCGCGC GTGGCGTCAC CGTCGACGAC
GTCGAGACCG CGCTGAACAA CCAGAACGTC GAACTGCCGG CCGGGGCGCT GGAAAGCGAC
GCCAAGGACT TCACCATCCG CGTGGCCCGC AGCTATTCGC GGCCCGAGGA CTTCGTGAAG
CTGCCGCTGC GCCCGGCCGA CGCCAATGGC TTCGTGGTCC GCCTGGGCGA CGTGGCGCGT
GTGGAGGAAG GCTCCAACGA GCGCCGCAAG ATCTTCCGCG GCAACGGCGT CAATCAGGTC
GGCATCTTCC TGACCCGCCA GTCGCAAGCC AACGACGTCG CCATTTCCCG GGTGGTCGAC
AAGGAACTTC TGGAAATCAA CAAGACGCTG CCGCCTGGCA CTCAACTGAT AAAGGCCGTC
GACAACTCGA TCTTCACCGC CGAGGCGATC CATGAGGTGT GGGTCACCAT GGGCATTTCG
ATCGGCCTGG TGGCGCTGGT GAACCTGGTG TTCCTGGGCA GTTGGCGTTC GGCCCTTATC
CCGTCGATCA TCGCGCCGAT CTGCATCCTG TCGACCTTCA TCATCCTGGC CCCGCTGGGT
TTCTCGCTGA ACCTGCTGAC CCTGCTGGCC CTGGTGCTGG CCGTTGGCCT GGTGGTCGAC
GACGCCATCG TGGTGGTCGA GAACATCCAG CGCCGGGTGG ACGAGGGGGA GCCGGCCAAT
GTCGCGGCCC TGCGCGGCAC CCGGCAGGTC TTTTTTGCCG TCGTGGCGAC CACCATCGTG
CTGATTTCGG TGTTCGCGCC GCTGATGTTC CTGCCTGGCT ATATCGGCCG GTTGTTCGTC
GAACTGGCCG TGGCGATCGC GGCCGCCGTG GGGTTCTCGG CTCTGCTGGC GCTAAGCCTG
TCGCCGATGC TGGCCTCCAA GCTGCTGAAA CCGGCCAGCG GTGGCAGCTG GTTCAACCGC
AAGGTCGACT ATGCGATGGA CAGGCTGAAG GACAGCTATC GTCATTCGCT GGAACGCCTG
CTGGCCGCGC GGTCCGCCAG CATCGGCGCA GGCTTGGTGG TGCTGTTCCT GGCCCTGGTG
GCCGGGGGGC TGTTCGCGGT CATGCCGCAG GAATTGGTGC CGGCCGAGGA TCGCGGCCGC
GTCGACATCG CCATCAACGG ACCCGAGGGG GCGGGCTTCG ACTACACCGC CAGGGCCGCC
GCCGCCGTCG AGAAGCAACT GGAGAAATAC CGCGCCGACG GGGTCGCCTC GCGGACCATC
ATCGCCGCGC CACGCTTTGG CCAGAACCAG TTCAACACCG CAAACGGCGT GCTGGTGCTC
AAGGACTGGG GCGAACGCGA CAAGACCGCC GATCAGGTGG CCGCCGAGCT GAACAAGACC
CTCAGCAAGA TCACCGGCGT GCGCGCCGTG GCCAGCGTGC GCGGCGCCTT CCAGCGCGGC
GGCGGCGGGG GCGGCGGCAC CAATGTCGAC CTGATCGCCA CCGGAAATGA CTATGTCGAG
CTGGCCAACT GGCTGAAGCC GATCCTGGCC GCCTCCCAGG ACAATCCGGG CCTGTCGCGC
CCACGCATCG ACTACGAGCC GACCTCGCCG CGCCTGTCCG TGCAGATCGA CCGCGACAAG
GCCGCCACCC TGGGCATATC GGCCCAGTCG ATCGGCCGGG TCCTTGAGAC GATGTTCGGC
TCGCGCCGGG CCACGACCTA CATCAAGAAC GGCCAGGAAT ACGACGTCAT CCTGCAGTCC
AGTCTCGATC AGCGTCGCAG CTTGGCTGAC CTCGACCGGC TCCAGGTGCG CACCCAGTCT
GGCGCGCTGG TGCCGCTGTC GACCGTGGTC ACCACGCAGT TGCGCGGTGA CGCCCCCGAC
CGTCCGCGTG TCGACCGTCT GCGCTCGGTG ACCCTCACGG CCCAACTGGC GCCCGGCTAC
ACGGTTGGCG ATGCGGTGAA GTTCTTCCAG GACCAGGCGG GCGCCAATCC GAGCCCCGGC
GTCAGCATCA AGTGGGGCGG TCAGGCCAAG GACTTCCTGG AGGCGTCCGG CGCCGTCGGC
ATCGCCTTTG GCTTGGCCCT GCTGCTGGTG TTCCTGGTGT TGGCCGCTCA GTTCGAAAGC
TGGATCCATC CGGCCGTGAT TATGCTGACC GTGCCGCTGG CCGCGCTGGG GGGGCTGTTT
GGCCTGCTGA TGACCGGTTC GACCATCAAC ACCTATAGCC AGATCGGTCT GATCATCCTG
GTCGGCATCG CCGCCAAGAA CGGCATCCTG ATCGTCGAGT TCGCCAATCA ACTGCGCGAC
GAGGGCCTGA AGGTGAAGGA GGCGGTGATC GAGGCGGCCG CCCTGCGTTT GCGCCCGATC
ATCATGACCT CGATCGCCAC CGCCATGGGC GCCTTGCCCC TGATGTTGTG GACCGGGGCC
GGGGCGGGCA GCCGCAAGAC GATCGGGGCG GTGATCTTCA CCGGCGCGAT CTTCTCGACC
CTGCTGACGT TGTTCATCGT GCCGGTATTC TACAACCTGC TGGCGCGCTT CACGAAGTCA
CCGGAGTGGA CGGCGCGGCA GATCGAAGAG TATGAAGCCC GCGAGAAGAG TGGTGAGGGG
CAAGCCTCAC CCGTCTGA
 
Protein sequence
MLSDLSVRRP VFAAVVAIIL CVIGLAAFGS LPIRELPNVD PPVVSISTSY TGASAEVIEE 
RITQVIERQV AGIQGIDRVN SSSRDGRSQI TITFTLDRDL DAAANDVRDA VSRVTSNLPD
QADPPQIAKA QADSSPIVIL NLTSTTLSPL ELADYADRYL VERLSTVPGV AQAGLNGQLY
AMRIWLNPDA MAARGVTVDD VETALNNQNV ELPAGALESD AKDFTIRVAR SYSRPEDFVK
LPLRPADANG FVVRLGDVAR VEEGSNERRK IFRGNGVNQV GIFLTRQSQA NDVAISRVVD
KELLEINKTL PPGTQLIKAV DNSIFTAEAI HEVWVTMGIS IGLVALVNLV FLGSWRSALI
PSIIAPICIL STFIILAPLG FSLNLLTLLA LVLAVGLVVD DAIVVVENIQ RRVDEGEPAN
VAALRGTRQV FFAVVATTIV LISVFAPLMF LPGYIGRLFV ELAVAIAAAV GFSALLALSL
SPMLASKLLK PASGGSWFNR KVDYAMDRLK DSYRHSLERL LAARSASIGA GLVVLFLALV
AGGLFAVMPQ ELVPAEDRGR VDIAINGPEG AGFDYTARAA AAVEKQLEKY RADGVASRTI
IAAPRFGQNQ FNTANGVLVL KDWGERDKTA DQVAAELNKT LSKITGVRAV ASVRGAFQRG
GGGGGGTNVD LIATGNDYVE LANWLKPILA ASQDNPGLSR PRIDYEPTSP RLSVQIDRDK
AATLGISAQS IGRVLETMFG SRRATTYIKN GQEYDVILQS SLDQRRSLAD LDRLQVRTQS
GALVPLSTVV TTQLRGDAPD RPRVDRLRSV TLTAQLAPGY TVGDAVKFFQ DQAGANPSPG
VSIKWGGQAK DFLEASGAVG IAFGLALLLV FLVLAAQFES WIHPAVIMLT VPLAALGGLF
GLLMTGSTIN TYSQIGLIIL VGIAAKNGIL IVEFANQLRD EGLKVKEAVI EAAALRLRPI
IMTSIATAMG ALPLMLWTGA GAGSRKTIGA VIFTGAIFST LLTLFIVPVF YNLLARFTKS
PEWTARQIEE YEAREKSGEG QASPV