Gene Caul_5106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5106 
Symbol 
ID5897298 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp22468 
End bp25617 
Gene Length3150 bp 
Protein Length1049 aa 
Translation table11 
GC content65% 
IMG OID641555209 
Productacriflavin resistance protein 
Protein accessionYP_001676540 
Protein GI167621755 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTCA ATCTCTCCGC CGTCGCCGTC AAACATCGGG CGGTGACGCT GTTTCTGATC 
ATCGCGATCA CGATCGCGGG TGGTTTTGCC TTCATGAAAC TGGGCCGCGC CGAGGATCCC
AGCTTCACGA TCAAGGTGCT CACGATCGTC AGCGCCTGGC CCGGCGCCAC GGCTGAGGAG
ATGCAAAACC AGGTCGCCGA ACCGCTGGAA AAGCGGCTTC AGGAGCTGAA GTACTACGAC
CGGACCGAGA CCTTCACCCG TCCGGGTCTG GCCTTCATTA CCCTGACGCT GAAGGACAAC
ACCCCGCCCA AGGCCGTGGC CGAGGAGTTC TACCAGGCGC GCAAGAAGAT GGGCGATGAG
GCGCCCAAGC TTCCCCGCGG CGCGGTGGGT CCGTTCGTCA ATGACGAATA TTCGGACGTC
ACCTTCGCCC TCTACGCGCT CAAGGCCAAG GGCGAGCCGC CCCGGTTGAT GGCGCGCGAG
GCCGAGACGA TCCGCCAGCG CCTGCTGCAC GTGCCCGGCG TCAAGAAGGT CAACATCATC
GGCGAGCGTC CCGAGCGGAT CTATGTCGAG TTCTCCTATG CTCGCCTGGC CAATCTCGGC
GTTTCGGCGC GCGATATCTT CTCGGCCCTG GCCACGCAAA ACCTGGTGAC GCCGGCCGGC
TCGATCGAGA CTCAAGGCCC GCAGGTCCAG GTGCGCCTGG ATGGCGCGTT CGACGATCTG
CAAAAGATCA AGGACACGCC GATCGTCGCG GCGGGCCGCA CGCTCAAGCT CTCCGACATC
GCCGACGTCA AGCGCGGCTA TGAGGATCCG GCCACCTTCC TGATCCGCCA CAACGGCGAG
CCCGCCGTCG AGCTGGGCGT GATCATGAAA GAGCGTTGGA ACGGTCTGGT GCTGGGCAAG
GCGCTTGAAG GCGAGGTCCA GAAGATCCAG TCGGAAATGC CGATGGGCCT GTCGCTGGTG
AAGATCACCG ACCAGGCCGT GAACATCAGC GAGTCGATCA ACGAATTCAT GCTGAAGTTC
TTCGTGGCCC TGGCCGTGGT GATCACCGTC AGCCTGATCA GCCTGGGCTG GCGCGTGGGC
ATCGTGGTCG CCCTGGCCGT CCCGCTAACG CTGTCGGGCG TGTTCTTGAT CATGTTGGTG
ACCGGCCGCG ACTTCGACCG CATCACCTTG GGCGCCCTCA TCCTGGCGCT TGGCTTGCTG
GTTGATGACG CAATCATCGT CATCGAGACC ATGGTGGTGA AAATGGAGGA GGGATTCGAC
CGGATCGCCG CCTCCAGCGC CGCCTGGGTC AACACCGCCG CGCCGCGTCT GGCCGGCGCC
CTGGTCACGG CCATCGGCCT GATGCCAGTC GGCTTTGCGC CCTCCAGCGC CGGCGAATAC
GCCGGCAACA TCTTCTGGAT CGTGTTCTTC GCCCTGTTGA TCTCCTGGGC CGTGGCCGGC
GCCTTCACGC CCTATCTCGG CGTCAAGCTT CTGCCCGACA TCAAGCCGAT CCCCGGTGGC
CACGAAGCCA TCTACGGCAC GCCCAACTAT CAAAAGCTCC GTGGCTTGAT CAGCGGCGCC
GTCCACCACA AGGGCGTCGT GGTCGCCGTG GTGGTGGCGG TCTTCGTCGT CGCCGTCATG
GGCATGGGCC ACCTGAAACA GCAGTTCTTC CCGGAATCGG ACCGCCCCGA GGTCTTCGTC
GAGGTGCAGA TGCCCGAAGG CACCGGCATC GAGCAAACGA CCGCGGCGGT GGAGAAGGTG
GAAGCCTGGC TGCGCAAGCA GCCCGAGACC GAGATCGTCA CCAGCTATGT CGGCGCTGGG
GCCCCGCGCT TCTACCTGGC GATCTCGCCG GAATTGCCCG ACCCCTCGTT CGCCAAGATC
GTCGTGCTGA CCAAAAACCC CAAGGCCCGC GAGGAGCTCA AGCACCGCGT GCGCCACGCC
GTCGCCGCCG GCTTGGCGCC CGAAGCCAAG GTCCGCGCCA CCCAGATCGT CTTTGGTCCC
TACTCGCCGT TCCCGGTCGC CTTCCGCGTC TCTGGCCCCG ATACGGCCAA GGTTCGTGAG
ATCGCCGACC AGGTCAAAGC CGTGATGATC GCCAGCCCCA ACATGCGTCA GGTCAACACC
GACTGGAGCG AGCGGGTGCC CACCGTGCAC TTTGTCCTCG ACCAGAACCG CCTGCGCGCC
TTGGGCCTGT CCTCGAACGA CGCGTCTGAA CAGCTGCAGT TCCTGCTGAC CGGCGCGCCG
ATCACCCAGG TCCGCGAAGA CATTCGCACC GTCGAGGTCG TCGCCCGCAG CGCCGGCGGC
GAGCGCCTGG ATCCGGCTCG CCTGGGTGGC TTCACCCTGG TCGGTTCGGC GGGTCAACGC
GTGCCGCTCG ACCAGATCGG CAAGGTCGAG ATCCGCATGG AGGATCCCAT TCTCCACCGA
CGCGACCGCA TGCCGACCAT CACCGTGCGC GGCGATATCA GCGAGGCCAA GCAGCCGCCG
GACGTGTCGA TGGAAATCAT CGGCAAGATC AAGCCGATCA TGGACAAGCT GCCCGAGGGC
TATGCGATCG AACCGGGCGG ATCGCTGGAA GAAGCCGGCA AGGCCAACGT CGCTCTGGCT
TCGGTCTTCC CGATCATGCT GGTGCTGATG CTGACGGTGA TCATGTTCCA GGTCCGGTCC
TTCCCGGCCA TGTTCATGGT TATCCTGACC GCGCCGCTGG CGCTGGTGGG GATGGTGCCG
ACCCTGATGG TGTTCGGCTC CCCGTTCGGG TTCAACGCCA TCCTGGGCTT CTTTGGCCTG
GCCGGGATCA TCATGCGCAA CACGCTGATC CTCATCGGCC AGATCCACGC CAACGAGCAC
GAGGGGCTGG ATCCGTTCCA CGCCGTGGTC GAGGCGACGG TTCAGCGGGC CAGGCCAGTG
ATCCTCACCG CCCTGGCCGC GGTGCTGGCC TTCATCCCGC TGACCCTATC GGTGTTCTGG
TCATCCCTGG CCTTCACCCT GATCGGCGGC ACCATCGGTG GGACGATCCT CACCCTGGCC
TTCCTGCCGG CCCTCTATGC CCTGTGGTTC AACATCCGCG AGGGCAAGTC GGATGGCGGG
GATGAGCAGA CCGGGATCCT CGGTCGCCTC GTCGGCCGGG CGATGAGCGG GCGGCGGCGC
CAATCTCCGG CTCCGGGCGC CGCGATCTAG
 
Protein sequence
MSFNLSAVAV KHRAVTLFLI IAITIAGGFA FMKLGRAEDP SFTIKVLTIV SAWPGATAEE 
MQNQVAEPLE KRLQELKYYD RTETFTRPGL AFITLTLKDN TPPKAVAEEF YQARKKMGDE
APKLPRGAVG PFVNDEYSDV TFALYALKAK GEPPRLMARE AETIRQRLLH VPGVKKVNII
GERPERIYVE FSYARLANLG VSARDIFSAL ATQNLVTPAG SIETQGPQVQ VRLDGAFDDL
QKIKDTPIVA AGRTLKLSDI ADVKRGYEDP ATFLIRHNGE PAVELGVIMK ERWNGLVLGK
ALEGEVQKIQ SEMPMGLSLV KITDQAVNIS ESINEFMLKF FVALAVVITV SLISLGWRVG
IVVALAVPLT LSGVFLIMLV TGRDFDRITL GALILALGLL VDDAIIVIET MVVKMEEGFD
RIAASSAAWV NTAAPRLAGA LVTAIGLMPV GFAPSSAGEY AGNIFWIVFF ALLISWAVAG
AFTPYLGVKL LPDIKPIPGG HEAIYGTPNY QKLRGLISGA VHHKGVVVAV VVAVFVVAVM
GMGHLKQQFF PESDRPEVFV EVQMPEGTGI EQTTAAVEKV EAWLRKQPET EIVTSYVGAG
APRFYLAISP ELPDPSFAKI VVLTKNPKAR EELKHRVRHA VAAGLAPEAK VRATQIVFGP
YSPFPVAFRV SGPDTAKVRE IADQVKAVMI ASPNMRQVNT DWSERVPTVH FVLDQNRLRA
LGLSSNDASE QLQFLLTGAP ITQVREDIRT VEVVARSAGG ERLDPARLGG FTLVGSAGQR
VPLDQIGKVE IRMEDPILHR RDRMPTITVR GDISEAKQPP DVSMEIIGKI KPIMDKLPEG
YAIEPGGSLE EAGKANVALA SVFPIMLVLM LTVIMFQVRS FPAMFMVILT APLALVGMVP
TLMVFGSPFG FNAILGFFGL AGIIMRNTLI LIGQIHANEH EGLDPFHAVV EATVQRARPV
ILTALAAVLA FIPLTLSVFW SSLAFTLIGG TIGGTILTLA FLPALYALWF NIREGKSDGG
DEQTGILGRL VGRAMSGRRR QSPAPGAAI