Gene Caul_2689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2689 
Symbol 
ID5900144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2922647 
End bp2924182 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content65% 
IMG OID641563180 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_001684314 
Protein GI167646651 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily
[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0761069 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGGG CCGGCAACGG CAAGGGCGAC GCGGCCAACC GGGTCCCGAT CACGGTCGCC 
GTCATGCTGG CGACGATCAT GAACTCGCTG GACACGACGA TCGCCAACGT CGCCCTACCC
CACATCCAGG GCAGCGTCTC GGCTTCGGCC GAGCAGATCA CCTGGGTGCT GACCTCGTAT
ATCGTCGCCG CGACGATCAT GACCCCGCTG ACCGGCTTCT TCGCCGATCG GGTCGGCCGC
AAGATGGTGT TCCTGGTGTC GATCGCCGGT TTCACCGTCG CCTCGATGCT GTGCGGCGTC
GCCACCAGCC TGGTCGAGAT CGTTCTGTTC CGCCTGCTGC AGGGCCTGTT CGGCGCCGCT
CTGATCCCGC TCTCCCAAGC TGTGCTGCTC GACATCAACC CGCCCGAGAA GCACGGCTCG
GCCATGGCCA TCTGGGGCGC CGGGGCGGTG CTGGGGCCAA TCCTCGGGCC GGCCCTGGGC
GGCTGGCTGA CCGACAATCT CGACTGGCGC TGGGTGTTCT TCATCAATCT GCCGATCGGC
ATCCTGGCCT TCTGCGGGGT GTTCTTCTTC CTGTCTGAAA AGAAGAGCCC CGAGAAGAAG
CGGTTCGACG TGCTGGGCTT CGCCAGCCTG GCCCTGGCCA TCGGCGGCTT CCAGATGATG
CTCGATCGCG GTCCCAGCCA GGACTGGTTC GCCTCGTCCG AAATCTGGCT CTACCTGATC
GTCGGGATCA TCGCCCTGTG GATCTTCGGC GTGCAACTGG CCACCGCAGC CAAGCCGTTC
GTCGACCGCG CCCTGCTGGC CGATGTCAAT TTCATCACCT CCTGCGTGTT TGGCTTCTTC
ATCGGCATTC TGCTCTACAG CGTGCTCGCC CTACTGCCGC CGATGATGCA GAACCTGATG
GGCTATCCGG TGGCCTTCAC GGGCCTGGTC AGCATGCCGC GCGGCATCGG CTCGTTCATC
GCCATGTTCG CCGTCGGCCA ATTGATCGGC CGCATGAGCA TCAAGCTGAT CCTGTTGATC
GGCCTGGCGG TCAGCGCCGT CTCGCTGTGG ATGATGACCC AGTTCACCCT GGGCATGGAC
ACCCGCCTGA TCATCGTCTC GGGGTTCCTG TCCGGCGTCG GCACCGGCCT GATCTTCGTG
CCGCTCAGCA CCATCGCCTT CGCCACGGTT CGCCCGCAGC ACCGGGCCGA AGGCGCGGGC
CTGTTCACCC TGATCCGCAA CATCGGCTCG GCCGCCGGCA TCTCGATCAT GCAGGCCCGC
TTCGTCAGCG GCATCGAGGT CCACCACGCC AAGCTGGTCG AGCACGCCCG ACCCGACAAT
CCGCTGTTCC ACGCCTATGC GCCGCTGGTC TTCCAGGCCC AGGACGCCAT GGCCCGGTTC
AACGGCGTCA TCACCCGCCA GGCCTCGATG CTGTCCTATA TCGACGACTT CCAGCTGATG
CTGGGCATCA CCATCCTGTG CGCGCCCATG ATCCTCCTGA TGCGAACCCC CAAGAAGACC
TCGGGGGGAG AGACCGTCCA TGTCGCCGAA CACTAA
 
Protein sequence
MTGAGNGKGD AANRVPITVA VMLATIMNSL DTTIANVALP HIQGSVSASA EQITWVLTSY 
IVAATIMTPL TGFFADRVGR KMVFLVSIAG FTVASMLCGV ATSLVEIVLF RLLQGLFGAA
LIPLSQAVLL DINPPEKHGS AMAIWGAGAV LGPILGPALG GWLTDNLDWR WVFFINLPIG
ILAFCGVFFF LSEKKSPEKK RFDVLGFASL ALAIGGFQMM LDRGPSQDWF ASSEIWLYLI
VGIIALWIFG VQLATAAKPF VDRALLADVN FITSCVFGFF IGILLYSVLA LLPPMMQNLM
GYPVAFTGLV SMPRGIGSFI AMFAVGQLIG RMSIKLILLI GLAVSAVSLW MMTQFTLGMD
TRLIIVSGFL SGVGTGLIFV PLSTIAFATV RPQHRAEGAG LFTLIRNIGS AAGISIMQAR
FVSGIEVHHA KLVEHARPDN PLFHAYAPLV FQAQDAMARF NGVITRQASM LSYIDDFQLM
LGITILCAPM ILLMRTPKKT SGGETVHVAE H