Gene Caul_0241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0241 
Symbol 
ID5897515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp267094 
End bp268665 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content69% 
IMG OID641560725 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_001681876 
Protein GI167644213 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.232516 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCC AGACCTTCAC CGACACCGAG CGCCGGCTGA CGCTGGGCGC CCTGATGATC 
GTCTTTCTGC TCAGCGCCCT GGACCAGACG GTGGTCTCCA CGGCCATGCC GCGGATCATC
GCCGAGCTCA ACGGCCTGAC GCTCTATGCC TGGGTCACCA CCGCCTACCT GCTGACCTCA
ACGGTGATGG TGCCGATCTG GGGCAAGCTG GGCGACATCT ATGGCAGGAA GCCCGTCCTG
CTGGCCGGCA TCGGCATCTT CCTGGCCGGC TCATGGCTGG CGGGCCTGTC GGGCGAGTTC
GGCGACCTGC TGGGCATGAG CGGCATGGTC CAGCTGATCG TCTTCCGCGC CTTGCAGGGC
ATCGGCGGCG GGGCGCTATT CACCACCGCC TTCGCGATCA TCGCCGACCT TTATCCGCCG
CGGGAGCGGG GCAAGTTCGC CGGCATCTTC GGTTCGGTGT TCGGCCTGGC CAGCGTGCTG
GGTCCGCTGA TCGGCGGCTA TTTCACCGAC CACGGGACGG TGCAGCTGGG CTCGCACCTG
ATCGCCGGCT GGCGCTGGGT GTTCTATGTC AACCTGCCGC TCAGCCTGCT GTCGCTGTTC
ATGATCCTGG TCAAGATGCC GCCGCTCGAG CACCGGCGCT CCGGCGCGGT CGACTACGTC
GGCGCCATCC TGCTGGTCGC CGCCTTCGTG CCGCTGCTGC TGGCGCTCAG CCTGGGCGGT
CACGACTTCG CCTGGAGCTC GCCCCAGAGC CTGGGCCTGC TCGCCTTCGC CGCCGTCGCG
CTGATCCTCT TCCTCTACGC CCAGACCAAG GCCAGCAATC CACTGGTGCC GCTGCGGCTG
TTCGGCAACC GGGTGTTCGC CACCGCCAAC CTGGCCGGCT TCCTGATCTC CATGGCCTTC
CTCGGCGTGG TGACCTTCCT GCCGCTCTAC ATGCAGCTGG GCCTGGGCGT CGACGCCACG
ACCAGCGGCC TGGCCATCCT GCCGCTGATG GGCGGGCTGA TCGTCGCCTC GACCGCCGCC
GGCCAGATGG TCAGCAAGAC CGGGCGCTAC AAGCCGCTGA TGATCGTTGG CGCCGTGTTG
CTGATGACCG GGGTCTGGCT GCTCAGCCGG GTGACCGTCC ACACCACCCT GCCCGACCTG
TGCTGGCGGA TGGCCATCGT CGGCCTGGGC CTGGGACCGG GCCAGAGCCT GTTCAACATC
GCCACCCAGA ACGCCGTCGA GGTGCGCGAC ATCGGCGTGG CCACCAGTTC CAACCAGTTC
TTCCGCCAGA TCGGCTCGAC GATCGGCGTG GCGGTGTTCG GCGCCCTGCT GACCCATCGC
CTGGCCAACG AGGGCCAGGG CCTGGACCTG GGCGCCCTGC AGGGTCTGGC CCTGAAGGCC
ACCGCGACCG GCGCCGCCCG TCACGCCGAC CCGGCCCTGG CCCAGGCCCT GACCCACGCG
ATCACCGGCG TGTTCTTCGC GGGCCTGTTC GTGATCGGCC TAGGCTTGGT GGTGATCTTC
CTGATCCCAG AGCTGCCGCT GCGCGGCCGG CAACCAGGGC CGGAGCCGGT GCTGGAGAAG
GAGCCGGTTT AG
 
Protein sequence
MTTQTFTDTE RRLTLGALMI VFLLSALDQT VVSTAMPRII AELNGLTLYA WVTTAYLLTS 
TVMVPIWGKL GDIYGRKPVL LAGIGIFLAG SWLAGLSGEF GDLLGMSGMV QLIVFRALQG
IGGGALFTTA FAIIADLYPP RERGKFAGIF GSVFGLASVL GPLIGGYFTD HGTVQLGSHL
IAGWRWVFYV NLPLSLLSLF MILVKMPPLE HRRSGAVDYV GAILLVAAFV PLLLALSLGG
HDFAWSSPQS LGLLAFAAVA LILFLYAQTK ASNPLVPLRL FGNRVFATAN LAGFLISMAF
LGVVTFLPLY MQLGLGVDAT TSGLAILPLM GGLIVASTAA GQMVSKTGRY KPLMIVGAVL
LMTGVWLLSR VTVHTTLPDL CWRMAIVGLG LGPGQSLFNI ATQNAVEVRD IGVATSSNQF
FRQIGSTIGV AVFGALLTHR LANEGQGLDL GALQGLALKA TATGAARHAD PALAQALTHA
ITGVFFAGLF VIGLGLVVIF LIPELPLRGR QPGPEPVLEK EPV