Gene Caul_3308 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3308 
Symbol 
ID5900763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3587024 
End bp3590164 
Gene Length3141 bp 
Protein Length1046 aa 
Translation table11 
GC content67% 
IMG OID641563814 
Productacriflavin resistance protein 
Protein accessionYP_001684933 
Protein GI167647270 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCTT CGCAGCCCTT CATCCAGCGG CCGGTCGCCA CGGCGCTGTT CATGGCGGCC 
ATCGTCCTGG CTGGCCTGGT CGGCTTCAAG CTGCTGCCGC TGTCGGCCCT GCCGCAGGTC
GACTATCCCA CGATCCAGAT CCAGACCCTC TATCCCGGCG CCAGCCCCGA GGTGATGAGC
CAGACCGTCA CCGCGCCGCT GGAGCGTCAG CTGGGCGAGA TGGCGGGCCT GGCGCGAATG
AGCTCGGTCA GCACGGCCGG CGCCTCGATC ATCACCCTGC AGTTCAGCCT GGGCGAAGGC
CTGGACGTCG CCGAGCAGGA GGTGCAGGCG GCGATCAATG GCGCGAACAG CCTGCTGCCG
GCTGACCTGC CGGCGCCGCC GGTCTACGCC AAGGTCAACC CGGCCGACGC CCCGGTCCTG
ACCCTGGCGG TGACATCCGA CACCCTGCCG CTGACGCAGG TGCAGAACCT GGTCAACACC
CGCCTGGCCC AGAAGATCAG CCAGGTCTCG GGCGTCGGTC TGGTCAGCTT GAGCGGCGGC
CAGCGCCCGG CCATCCGCAT CCAGGCCGAC ACCCAGGCCA TGGCCAGCTA CGGCGTCACC
CTGGCCAATG TGCAGAGCGC GATCAGCAAC GCCAACGCCA ACAGCGCCAA GGGCAGCTTC
GACGGCCCCA CCCGCAGCTA TACGATCAAC GCCAATGACC AGCTTCTGAC CGTCGAGGAC
TATTCCAGCC TGATCGTCTC CTACAAGAAC GGCGCCCCGA TCCGCCTGCG CGACATCGCC
CAAGTGGTGC AGAGCGCCGA GAACACCCGC CTGGGCGCCT GGGCCAACAC GACGCCGGCC
ATCATCGTCA ACGTCCAGCG CCAGCCGGGC GCCAACGTCA TCGCCACGGT CGACGCCATC
AAGGCCAAGC TGCCGGAGCT GGAAGAGGCG CTGCCCTCGA CCCTGGAGGT CAAGGTGTTG
GCCGACCGCA CCACCGGCAT CCGCGGCTCG GTCCACCACG TGGAGATGGA GCTGCTGCTG
GCGGTCGTGA TGGTCGTGGT GGTGATCTTC TTCTTCCTGC ACAGCCTGCG CGCCACCCTG
ATCGCTGGCT TGGCCGTGCC GATCTCGCTG ATCGGCTCAT GCGGGGTGAT GTATCTGATG
GGCTTCTCGC TCAACAACCT GTCCCTGATG GCCCTGACCA TCGCGACGGG GTTCGTGGTC
GACGACGCCA TCGTCATGAT CGAGAACATC TCGCGGCACC TGGAAAAGGG CGTCAAGCCG
ATGGCGGCGG CCCTGCGGGG GGCGCGCGAG ATCGGCTTCA CGATTATCTC GCTGACCGTG
TCGCTGATCG CGGTGCTGAT CCCGCTGCTG TTCATGGGCG ACGTGGTGGG ACGGCTGTTC
CGCGAATTCG CCATCACCCT GGCGATCACC ATCCTGATCT CGGCCGTGGT CTCCCTGACC
CTGACCCCGA TGCTGTCGGC CCGCTGGCTC AAGCCGGAGG GCGAGGAGAA GCAGGGCCGC
ATTTCTCAAA AGTCCAAGGC GCTGTTCGAC AAGGTCGAGC ACCATTACGA GCGCGGCCTG
ACCTGGGTGC TGAAACGGCA GAAGGCCACC CTGGTGGTCG CCGTGGCCAC CTTCGCCCTG
ACCGCCGCGC TCTACATGGT GATCCCCAAG GGCCTGTTCC CGACCCAGGA CACCGGCCAG
CTCCAGGCCC GCACCGAAGT CGAGCAGTCG GTGTCCTACG ACCGCATGGC CGGGCTGCAG
CAGCAGGCGG CGCGGGCCAT CCTCGACGAC CCGGCGGTGG AGAGCCTCAG TTCGTTCATC
GGCGTCGACG GCGCCAACAA TTCGATGCTC CACACCGGCC AGATGCTGAT CAACCTGAAG
GCCGACCGCA AAGGCTCGCA GGAAAAGATC ATGCAGCGGC TGCGCGACCG CGTGGCCGCC
GTGCCGGGCG TGGCGCTCTA TCTGCAGCCC ACCCAGGACC TGACGATCGA CGCCGAGACC
GGCCCGACCC TCTACCGCCT GTCGCTGGAA GGCGCCGACA CCGCCACGGT CAAGGCATGG
GCCGGCAAGC TGGCCGAGCG GCTGGCGACG GTAAAGTCGG TGCGCAACGT CTATAGCGAC
GCCAGCGCCA CGGGGGCGGC GGCCTATGTC GATATCGACC GCGACACCGC CGCGCGACTT
TCGATCACCG CCTCCGATAT CGACGACGCC CTCTACAGCG CCTTCGGCCA GCGGATCGTC
TCGACGATCT TCACCGAGAC CAACCAGTAC CGCGTGATCC TCGAGGCCAA GCCCGGGCTG
CTGACCGCGC CGCAGTCGCT GGGCCTGCTG AACCTGAAGA CGGGCGGCGG CCAGCCCACG
CCGCTGTCGG CGATCTCGAC GATCAAGGAA CAGACGGCCC CGCTGCAGGT GACCCACGTC
GCCCAGTTTC CGGCCACCAC CATCGGCTTC GACACCGCGC CCGGCGTCTC GCTGGGCAAG
GCGGTCGACG ACATCCGCGC GGCGATGAGG GATATCGGCA TGCCGGTGGC TGTCAGCCAC
ACCTTCCTGG GCGCGGCCGG GGCCTATGAG AACTCGCTCA GCAACCAGCT GTGGCTGATC
CTGGCGGCCG TGGTCTGCGT CTACATCGTG CTGGGCGTGC TCTACGAGAG CTACATTCAC
CCGCTGACCA TCCTGTCGAC CCTGCCGTCG GCGGGGGTGG GCGCGCTACT GGCCCTGTGG
ATCACCGGCA ACGACCTGGG CGTGATCGGC ATCATCGGCA TCATCCTGCT GATCGGCATC
GTCAAGAAGA ACGCGATCAT GATGATCGAC TTCGCGATCG ACGCGCAGCG TAACCAGGGC
AAGAGCCCGC GCGACGCCAT CTTCCAGGCC GCCATCCTGC GCTTCCGCCC GATCCTGATG
ACCACTCTGG CGGCCCTGTT CGCGGCCGTG CCGCTGATGC TGAGCTTTGG CGAAGGCGGC
GAGCTGCGCA GGCCGCTGGG CATCGCCATC TTCGGCGGCC TGCTGGTCAG CCAACTGCTG
ACCATGTTCA CCACCCCGGT GATCTACCTG GCCTTCGACC GCCTGGCCCC AGCCGACAAG
GGGATCCACC GCGACCGTGA CGAGACGCCG CCGGTCGGCG ACCGCCTCGA TCCCGACCCG
ATCCCGGGGG CGCCGTCGTG A
 
Protein sequence
MNPSQPFIQR PVATALFMAA IVLAGLVGFK LLPLSALPQV DYPTIQIQTL YPGASPEVMS 
QTVTAPLERQ LGEMAGLARM SSVSTAGASI ITLQFSLGEG LDVAEQEVQA AINGANSLLP
ADLPAPPVYA KVNPADAPVL TLAVTSDTLP LTQVQNLVNT RLAQKISQVS GVGLVSLSGG
QRPAIRIQAD TQAMASYGVT LANVQSAISN ANANSAKGSF DGPTRSYTIN ANDQLLTVED
YSSLIVSYKN GAPIRLRDIA QVVQSAENTR LGAWANTTPA IIVNVQRQPG ANVIATVDAI
KAKLPELEEA LPSTLEVKVL ADRTTGIRGS VHHVEMELLL AVVMVVVVIF FFLHSLRATL
IAGLAVPISL IGSCGVMYLM GFSLNNLSLM ALTIATGFVV DDAIVMIENI SRHLEKGVKP
MAAALRGARE IGFTIISLTV SLIAVLIPLL FMGDVVGRLF REFAITLAIT ILISAVVSLT
LTPMLSARWL KPEGEEKQGR ISQKSKALFD KVEHHYERGL TWVLKRQKAT LVVAVATFAL
TAALYMVIPK GLFPTQDTGQ LQARTEVEQS VSYDRMAGLQ QQAARAILDD PAVESLSSFI
GVDGANNSML HTGQMLINLK ADRKGSQEKI MQRLRDRVAA VPGVALYLQP TQDLTIDAET
GPTLYRLSLE GADTATVKAW AGKLAERLAT VKSVRNVYSD ASATGAAAYV DIDRDTAARL
SITASDIDDA LYSAFGQRIV STIFTETNQY RVILEAKPGL LTAPQSLGLL NLKTGGGQPT
PLSAISTIKE QTAPLQVTHV AQFPATTIGF DTAPGVSLGK AVDDIRAAMR DIGMPVAVSH
TFLGAAGAYE NSLSNQLWLI LAAVVCVYIV LGVLYESYIH PLTILSTLPS AGVGALLALW
ITGNDLGVIG IIGIILLIGI VKKNAIMMID FAIDAQRNQG KSPRDAIFQA AILRFRPILM
TTLAALFAAV PLMLSFGEGG ELRRPLGIAI FGGLLVSQLL TMFTTPVIYL AFDRLAPADK
GIHRDRDETP PVGDRLDPDP IPGAPS