Gene Caul_5066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5066 
Symbol 
ID5902528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5464463 
End bp5466061 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content68% 
IMG OID641565587 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_001686684 
Protein GI167649021 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.613725 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACG CCGCCCTCCC CGCCCAAGCC GCCAAGCCCG CGGCTCCGGT CGCCAGCGAG 
GTCGACTGGA CCAAGCTGTT CCTCGGCTTC GGGGCGATGG TCATCGGCCA GTTCATGGCC
ATCCTCGACA TCCAGATCGT CGCGGCCTCG CTGCCGCAGA TCCAGGCCGG GGTCGGCGCC
AGCGCCGACC AGGTCAGCTG GATCCAGACC GCCTACCTGA TCCCCGAAGT CGTGATGATC
CCTCTGTCGG GGTACCTGTC GCGGCTCTGG GGGACACAGC GCGTCTATCT GATGTCGTGC
CTGGGCTTCA TCGTCATGAG CGTGCTGACG GGGCTGAGCT CGTCGATCGA CATGATGATC
ATCACCCGGG CCCTGCAGGG GTTCATCGGC GGGGCGATGA TCCCGACGGT GTTCGCGGTG
GCCTTCACCG CCTTCCCGCC CGAGCGCCGG GTGACCGCCA GCGTGATCAT GGGCCTGATC
GTCACCCTGG CCCCGACGGT CGGCCCCACC CTGGGCGGCC ACCTGACCGA ATGGCTCAGC
TGGCGCTGGC TGTTCTTCAT CAACGTGCCG ACCGGCCTGG TCGTGCTGTT CGGCGTCGCG
CGCTGGGGCC AGTTCGACAA GGGCGACCCC AGCCTGGCCA AGGGTTTCGA CTGGTTTGGC
CTGGCGGTGA TGGCCACCTT CCTGATGAGC ATGCAGTTCG TGCTCGAGGA AGGCTCCAAG
GACGGCTGGT TCGAGGACAC CGGCATCCTG GCCCTGACCG TGGTGGCGGT GGTGTCAGGG
GTGGTCTTCG TCTGGCGGTC GCTGAGCTAC AAGAACCCGA TCGTCGAGCT GCGGGCCTTC
GCCAACCGCA ATTTCAGCAT CGGCGTGGCC ATGACGGCGG TGTCGGGGGC CAGCCTGTTC
GGCGGCACCT TCCTGCTGCC GCAGTTCCTG GGCCGGGTGC GGCACTATTC GGCGTCCGAG
GTTGGCACGA CGATGGTGGT CTCGGGCCTG TCGATGTTCG CCACCGGTCC GCTCGCCGGC
CGGCTGGTGC GCCAGATGGA CCCGCGCGCG CCCATGTTCA TCGGCTTCAT GCTGGCCGGC
TGGGGCATGT ACATGGCCCA TGGCGTGACC AAGGACTGGG GCTTCTGGCA GTTCGCCGGG
GTCCAGGCCT GTCGGGGCGT CGGGGTGATG ATCGCCATGA TCGCCACCCA GCAGGTGACG
ATGAGCACCC TGCCCCAGAA CATGGTCAAG AACGCCTCGG GCCTGGTGAA CCTGTCGCGC
AACACCGGCG GCGCGGTCGG CCTGGCCCTG CTGGCCACGG CGATCACCAA CCAGACGGCG
CTCTACTACA TGGGCCTGTC GGACCAGGTC AGCCAGGGCG ACGCCCGGAT GGCCGGCATG
ATGCAGGGCC TGGCCGCGCG GATGGCCCAG CTCGGCGTCT CCAACCCCGA CGGCGCCGCC
CGCAAGGCGA TCAGCGGCAT GTTGGAGCAG CAGGCCACCG TGCTGGCCTT CGGCGACAGC
TTCACCTTGC TGGCCTACGG CTGCTTCATC GCCGCCGCCG TCTCGCTGCT GGCCAAGCCG
GCCAAGAACG CTCCACCCCC GCCGTCGGAC GCGCACTGA
 
Protein sequence
MTDAALPAQA AKPAAPVASE VDWTKLFLGF GAMVIGQFMA ILDIQIVAAS LPQIQAGVGA 
SADQVSWIQT AYLIPEVVMI PLSGYLSRLW GTQRVYLMSC LGFIVMSVLT GLSSSIDMMI
ITRALQGFIG GAMIPTVFAV AFTAFPPERR VTASVIMGLI VTLAPTVGPT LGGHLTEWLS
WRWLFFINVP TGLVVLFGVA RWGQFDKGDP SLAKGFDWFG LAVMATFLMS MQFVLEEGSK
DGWFEDTGIL ALTVVAVVSG VVFVWRSLSY KNPIVELRAF ANRNFSIGVA MTAVSGASLF
GGTFLLPQFL GRVRHYSASE VGTTMVVSGL SMFATGPLAG RLVRQMDPRA PMFIGFMLAG
WGMYMAHGVT KDWGFWQFAG VQACRGVGVM IAMIATQQVT MSTLPQNMVK NASGLVNLSR
NTGGAVGLAL LATAITNQTA LYYMGLSDQV SQGDARMAGM MQGLAARMAQ LGVSNPDGAA
RKAISGMLEQ QATVLAFGDS FTLLAYGCFI AAAVSLLAKP AKNAPPPPSD AH