Gene Caul_5403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5403 
Symbol 
ID5897187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010333 
Strand
Start bp118760 
End bp120373 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content65% 
IMG OID641550693 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_001672179 
Protein GI167621671 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACAGA CTGAGGCCCT CTTTGCTCGC TATGGGTCGA TGTACCGCTG GCTCGCGACG 
GCGGCGGCCA TGGTTTCAGC GATCGCCGTG GTGTTGTCAT CCACCATTGT GAACGTGGCT
GTCCCGGCGA TCATGGGGCA GTTCGGCATC GATCAGACCC GGGTGCAGTG GCTCGCGACT
GGTTTCCTCG CCGCGATGAC GGTCACCATG CTGCTCTCTG CATGGTGCGA GCGCACCTTC
GGTCAGCGGC AAACCATGAT GGTGGCCCTG GGCGTGTTCC TGTTCGGATC GGTCCTTGGC
GGGCTGGCCC CGGATGATAC GGTGCTCATA GTCTCCCGGG TGATCCAGGG CGCTGCAGCG
GGTGTCATCC AGCCGCTGGC GATGGTCGTG ATGTTCCAGG TGTTTCCCCC CGACAAGCGC
GGCGCGGCTA TGGGCATCTT CGGGATTGGC GTGGTGCTCG CCCCGGCGCT GGGGCCGTGG
GTCGGCGGAC TGCTGATGGA TAGCTTCGAC TGGCGGTTCC TTTTCTATTT GGGGATTCCC
TTCGGCCTCG GCGGCATCAC GCTGGCGAAC CTTTTCCTGC CCGATCGCGA TCCTGCGGCT
GCGAAGGCCC GACTGGACTG GCCGGCCCTG GTGTGGCTCA GCGTGGCGCT GCTCGCGTTG
CTGGAAACCA TGTCCAGTGG ACAGCGCCAG GGCTGGGCCT CCACGCCCGT CCTCGCGGGG
TTTGCGATCA CGATGACCTG CGCGGCGGCC TTTCTTGTCC GCGAAAGTCG GATTCCGCAT
CCGCTGCTCG ACCTGCGGGT GTTCGCCAGT CCCCCCTTCG CTGCGGCCGC AGCGGTCAGC
TTTGTGCTGG GGGCCGGACT GTACGGCACG ACCTATCTGC TCCCGGTCTT CGTCCAGCAG
ATTCAAAACT TCACCCCAAC CCAAGCGGGT CTGCTGCTGA TGCCGGCCGG CTTCGTCCTG
GTCATCGTCT TTCCCGCCGC GGGCATGATG AGCGACCGGC TGTCGCCAGG ATTGCTGATC
GGGGCGGGCA TGGCGATTTT TGCCTATTCG TCGTGGCTAA CGGCCCACGT GAGCGCCGAC
ACAGCCTTCT GGACGCTCGC TTGGTGGACG GTGATTGGCC GCATTGGCAT GGGCTTTGTG
TTCCCCTCGC TGAGTTCTGC AGCTCTCAAG GTGCTGCCGC TGGAACTGTT ATCCCAGGGC
TCAGGCTCGA TGAACTTCAC CCGGCAATTG GGCGGAGCGC TTGGCACGAG TATTCTGGGC
GTTATCATTG AGCGACGAAC CGCGTTCCAC GGCGACGCCC TCGCCGCGAC CCAAACGTCG
GATAATTCGA CTACAGGCAA CTATCTCATG GAAGCCGCGC GAACCGTCCA CGCGATGGGA
CTCGGGCCTT TGGAGCAACT CCCGGCCGCC GGCTGGTTGC TGGGGCAAGG AGTTTACTAC
CAGGCGGCGG CGGCCGCCTA CCGAGACGGG TTCCTGATCA CGGCTATGGT CTTCGCTGCG
GCCCTCGCGC CGACGTTCGT TCTTCATCGT GCTTTGAGCC GCGTCGCGGG GCGATCCCTG
ACAGGATCCG GTTCGGCTTC GGGGGGCGAG GCCTCGCGGG CGACGAAGGT ATAA
 
Protein sequence
MPQTEALFAR YGSMYRWLAT AAAMVSAIAV VLSSTIVNVA VPAIMGQFGI DQTRVQWLAT 
GFLAAMTVTM LLSAWCERTF GQRQTMMVAL GVFLFGSVLG GLAPDDTVLI VSRVIQGAAA
GVIQPLAMVV MFQVFPPDKR GAAMGIFGIG VVLAPALGPW VGGLLMDSFD WRFLFYLGIP
FGLGGITLAN LFLPDRDPAA AKARLDWPAL VWLSVALLAL LETMSSGQRQ GWASTPVLAG
FAITMTCAAA FLVRESRIPH PLLDLRVFAS PPFAAAAAVS FVLGAGLYGT TYLLPVFVQQ
IQNFTPTQAG LLLMPAGFVL VIVFPAAGMM SDRLSPGLLI GAGMAIFAYS SWLTAHVSAD
TAFWTLAWWT VIGRIGMGFV FPSLSSAALK VLPLELLSQG SGSMNFTRQL GGALGTSILG
VIIERRTAFH GDALAATQTS DNSTTGNYLM EAARTVHAMG LGPLEQLPAA GWLLGQGVYY
QAAAAAYRDG FLITAMVFAA ALAPTFVLHR ALSRVAGRSL TGSGSASGGE ASRATKV