Gene Caul_3188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3188 
Symbol 
ID5900643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3451921 
End bp3453486 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content67% 
IMG OID641563692 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_001684813 
Protein GI167647150 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.477871 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACC AGGACTCCTC TCCCGCAGCC GCCAGCGGCC CAGCGCCGCT GACCGGCATG 
ATGCTGGCGG TCACCTCGAT CGCCCTGGCG CTGGGCACTT TCATGCAGGT GCTGGACAGC
ACCATCGCCA ACGTCTCGAT CCCGACCATC GCCGGCAATC TGGGGGTCAG CACCAGCCAG
GGCACCTGGG TGATCACCTC GTTCGCGGTG GCCAACGGCG TCTCGGTGCC GCTGACCGGC
TGGCTGATGG GTCGCTACGG CGTCGTGAAG ACTTTCGTGG TCTCGGTGCT GCTGTTCACC
CTCGCCTCGT TCCTGTGCGG CGTTTCGTGG AACCTGCCGT CGCTGATCGG CTTTCGGATT
CTGCAGGGCC TGGTCTCGGG TCCGATGATC CCGGGCTCGC AGGCCCTGCT GATCATGATC
TTTCCGGCCA GCAGGCGGGG CACGGCCCTG GCCATCTGGT CGATGACCAC ACTGGTGGCG
CCGATCTGCG GCCCGATCCT GGGCGGCTAC ATCTCCGACA ACATCGCCTG GGAATGGATC
TTCCTGATCA ACGTACCCGT CGGCCTGCTC TGCGCCTTCC TGTGCTGGCG CGGGATGAAC
AACCGCGAGA CCCCGACCCG CAAGGTGCCG ATCGACACCA CCGGCTTCAT GCTGCTGCTG
GTCTGGGTCG GCGCCCTGCA AGTGATGCTC GACACCGGCA AGGACGCCGA CTGGTTCAAC
TCGCCGGCCA TCGTCGTCGA GACCCTGGTG GCCATCGTCG GCTTCATCGC CTGGGTGATC
TGGGAGCTGA ACGAGAAGCA TCCGATCGTC GACCTGTCGC TGTTCAAGTC CAAGAACTTC
GCCCTGGGCA CGGTCGCCTT CTGCCTGGGC TACGCGGTGT TCTTCGGCAG CAATCTGCTG
CAGCCGCTGT GGCTGCAAAC CCAGATGCAC TACATCGCCA CCTGGGCCGG CCTGGTCGCC
GCCCCCAGCG GCGTGGTGGC CGTGCTGCTG ACCCCGTTCG CCGCCCGCAT CATGCAGAAG
GTCGACGCCC GCTGGACCGC CACCCTGTCG CTGGCCGCGT TCGCCCTGTC GTTCTACATG
CGCTCGGGCT TCACGCCGGA CGTGGACTTC AAGGCCCTGG TTTGGCCGAT GCTGGTGCAG
GGGGTGGCGA TGAGCACCTT CTTCCTGTCG ATGGTGACCA TCTCGCTGAA CGGCGTGTCG
CCCCAGCAAC TGCCGTCGGC CTCGGGCCTG TCGAACTTCT CGCGGATCAC CGCGGGCAGC
TTCGCGGCCT CGCTGACCAC GACGATCTGG GACCGGGGCG AAAGCCTGCA CCAGAACCGC
ATCGCCGAAT CCATGGCCTC GAACGACCCG GCCTGGCTGG CGGCCGTGGA CCACATGCAG
GCCGCGGGCC TGAGCCACGC CCAGGCCGTG GGCGCGGTGA CCGCCCAGGT CGTCAACCAG
GCTTACCTCC TGTCGACCCT CGACTTCTTC CGCGCTTCGG CTTGGCTGGC GGTCCTGCTG
ATCCCATGCA TCTGGCTGAC CAAGAAGGCG ATGAGCGGCG GCGGCGCGCA CGCGGCGGCC
GACTAG
 
Protein sequence
MADQDSSPAA ASGPAPLTGM MLAVTSIALA LGTFMQVLDS TIANVSIPTI AGNLGVSTSQ 
GTWVITSFAV ANGVSVPLTG WLMGRYGVVK TFVVSVLLFT LASFLCGVSW NLPSLIGFRI
LQGLVSGPMI PGSQALLIMI FPASRRGTAL AIWSMTTLVA PICGPILGGY ISDNIAWEWI
FLINVPVGLL CAFLCWRGMN NRETPTRKVP IDTTGFMLLL VWVGALQVML DTGKDADWFN
SPAIVVETLV AIVGFIAWVI WELNEKHPIV DLSLFKSKNF ALGTVAFCLG YAVFFGSNLL
QPLWLQTQMH YIATWAGLVA APSGVVAVLL TPFAARIMQK VDARWTATLS LAAFALSFYM
RSGFTPDVDF KALVWPMLVQ GVAMSTFFLS MVTISLNGVS PQQLPSASGL SNFSRITAGS
FAASLTTTIW DRGESLHQNR IAESMASNDP AWLAAVDHMQ AAGLSHAQAV GAVTAQVVNQ
AYLLSTLDFF RASAWLAVLL IPCIWLTKKA MSGGGAHAAA D