Gene Caul_4620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4620 
Symbol 
ID5902082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4997588 
End bp4999000 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content74% 
IMG OID641565139 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_001686238 
Protein GI167648575 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGGTC CCCACCATCC TCCTTCCTAT GAAGGCGCCG TCCGCGCCTG TCCCGTCCCC 
GACACGCCCG CGCCCCGCAA GCGCCTGGCC CTGGCCGCCA CCGTGCTCGG CTCGAGCCTG
GCCTTCATCG ACGGATCGGT GGTCAATGTC GCCCTGCCGG CCATGCGCAT CGCCCTGAAG
GCCGACCCCG GGCAGGTGCA GTGGATCGTC AACGCCTACC TGCTGCTGCT CGGGGCCCTG
GTGCTGATCG GCGGCTCGGC CGGCGACCGC TACGGGCGGC GCAAGATGTT CGTGCTCGGC
GTCGTGCTGT TCGCCGCCGC CTCGCTGGCC TGCGCCCTGG CCCCGGACGC CGGCTGGCTG
ATCGCCGCGC GCGCGGTGCA GGGCGTCGCC GCCGCCCTGC TGGTCCCGGC CAGCCTGGCC
ATCCTGGGCT CGACCTTCAG CGAGGCGGAA CGCGGCGGGG CGATCGGGGC CTGGGCGGGG
TTCGGGGCTG TGACCACGGC GATCGGCCCG GTGCTGGGCG GCTGGCTGGT CGACCACGTG
TCGTGGCGCG CCATCTTCCT GATCAACGTT CCGCTGGCCG TCGCCACCGT CTGGCTGGCC
CTGGCGGCCG TGCGCGAGAG CCGCGATCCG GAGGTCAAGC ACCTGGACTG GCTGGGCGCG
GTGCTGGCCG CCGCCGGCCT GGGCGCGGTG ACCTGGGGTC TGACGGCGGT GGGCGCGCGC
GGCTGGACCA GCGGCGTGAT CTGGACCGCC CTGGCGCTCG GCGTCGCCCT GCTGGCCGGC
TTCGTGGCCA GCCAGGGCCG CCAGAAGCAT CCGATGATGC CGCTGTCGCT CTATCGCTCC
AGGACCTTCA GCGGCGCCAA CCTGCTGACC CTGGCGCTGT ATTTCGGCCT GACCGGCGCC
CTGTTCTTCC TGCCGTTCGA ACTGATCGCC CGCCACGGCT ATTCGGCCGC CGCCGCCGGA
GCGACCCTGC TGCCGTTCTC GCTGGTCATG GGGGTCCTGT CGGGCGTGGC CGGCAAGCTG
TCCGACCGGT TCGGCGCCCG GCCCATGCTG ACCATCGGCC CGATCCTGGC CGGGGCCGGT
TTCGGTCTGC TCGGCGCGCC GTGGCTCGGC TCCGGCTACT GGACCGGCGT GCTGCCCGCC
GTCCTGGTGC TGGCCCTCGG CATGACCGTC GCCGTCGCGC CCCTGACCAG CACGGTGATG
GGCGCGGTCG CCCCCAGCCA CGCCGGGGTC GCCTCCGGGG TCAACAACGC CGTGGCCAGG
ATCGCCGGCC TGCTGGCCGT GGCGGTCCTG GGCCTGGTCT ATTTCGCGCC GGGTTCGGCG
GGCTATCCCC GAGTGATGGG GATCAGCGCC CTGGCGGCGG TGGCGGCCGG GATCGTGGGG
TGGCTGATGA TCGAGAAGAA GCCGGCGCAC TGA
 
Protein sequence
MAGPHHPPSY EGAVRACPVP DTPAPRKRLA LAATVLGSSL AFIDGSVVNV ALPAMRIALK 
ADPGQVQWIV NAYLLLLGAL VLIGGSAGDR YGRRKMFVLG VVLFAAASLA CALAPDAGWL
IAARAVQGVA AALLVPASLA ILGSTFSEAE RGGAIGAWAG FGAVTTAIGP VLGGWLVDHV
SWRAIFLINV PLAVATVWLA LAAVRESRDP EVKHLDWLGA VLAAAGLGAV TWGLTAVGAR
GWTSGVIWTA LALGVALLAG FVASQGRQKH PMMPLSLYRS RTFSGANLLT LALYFGLTGA
LFFLPFELIA RHGYSAAAAG ATLLPFSLVM GVLSGVAGKL SDRFGARPML TIGPILAGAG
FGLLGAPWLG SGYWTGVLPA VLVLALGMTV AVAPLTSTVM GAVAPSHAGV ASGVNNAVAR
IAGLLAVAVL GLVYFAPGSA GYPRVMGISA LAAVAAGIVG WLMIEKKPAH