Gene Cagg_0237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0237 
Symbol 
ID7269151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp296660 
End bp298525 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content59% 
IMG OID643565106 
Productdrug resistance transporter, EmrB/QacA subfamily 
Protein accessionYP_002461621 
Protein GI219847188 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACCA CAACATCACC GGCTTCCACG ACGGTAACGC GAATCGATTA CGCAGTTACC 
CTCGATCAGC GCAGCAAAGT GCTGATCTTA ATCGGAGTCT TGCTCGGTTT GTTTCTCTCG
GCGCTCGATC AGACTATCGT CTCTACCGCT CTACCACGGA TCGTAGCCGA TCTCAAGGGA
ATTGAACTGA TCGGTTGGGT CTCGACCGGC TATCTGTTGG CTTCGACGTC GATGGTGCCG
ATCTACGGCA AACTGTCAGA CATCTACGGA CGTAAATATG TTCTGTTGTT TGGGATCGCG
GTCTTCTTGC TTGGCTCGCT GCTGTGCGGG CTGGCCGGTG ACATGACCCA ACTGGTCTTC
TATCGCGGTC TCCAAGGCTT CGGTGCCGCA GCCCTAACCT CGACCGCGTT CGCCATTCCC
GCCGACCTCT TCGCACCGGC GGAACGGGCG CGCTATATGG GTCTCTTCGG TGCCGTGTTT
GGTCTGTCGA GTGTTGTGGG GCCGTTTATC GGTGGCCTTC TCACCGACAA TCTGAGCTGG
CATTGGGTGT TCTTTGTCAA TTTGCCGTTA GGGGTGGTGG CGCTCGGTTT TATTATCGCC
AAGCTACCGC GCCTACACAG TGGTCTCAAA CCGGCGATTG ACTATGCCGG TGCGGCGACG
TTGTTGTTGA CCGTGATCCC GTTCTTGTTG GCGTTGACCC TCGACAAGAA CGACTTTTCG
TGGACATCAC CGTTCATTAT CAGTCTGTTT GCCGTCAGTG CGATTGGCTT GATCTTCTTC
CTGCTCATCG AGCGGCGGGC CGAATCACCT ATCTTGCCAC TGCATCTGTT CCGCAATCGC
ACCTTTACGC TCACGACCAT CATCGGCTTC ACCGTCGGTG CAACCCTGTT CGCCGCGATC
TTCTTCCTCT CGCTCTATCT GGTCAATGTA CTCGGCGTGA GTGCCACTGC TGCCGGTACG
ACCCTTATCC CGCTCACCCT TAGCCTTGTG GTGGGGGCGA TGGTGTCATC GCAGATCGTG
CAGCGTACCG GGCGCTATAA GTGGGTGATC GTTGGTGGGA TGGCGATAAT TGTCGCTGCG
CTGTGGTGGC TCACCACCCT CACCCCTGAC ACGTCGGTCT GGATGGTACG ACTGCGGATG
ATCGCACTCG GATTGGGGTT AGGCCCCTCG ATGCCGATCC TCAATCTGGC GATGCAGAAC
GCGGTGCCGC GCACCGACAT GGGTGCAGCG ACGGCTAGCC GACAATTCTT CCAGCAGCTC
GGCCAAGTGG TCGGTTCGGC GGTCTTTGGT GCGCTGCTTA CCGGCGTGTT GACAACAACC
CTCACTGCGA ATCTGGCCCC GATTCAGGCC CAGTTGCCGC CGGAAATGGC TGCCCGCTTC
GACAGCAGCA CGTTGCGCAA CGGGATGGGT GCCGGTGAGG GGGCAAGTGG CGAAACGGTT
GATCCGGCCG TGCGGATCGA GCAGGCGATT GCCGATCAGT TTGCGTCGCG CCGTGATTTA
CTGACCCGCG CCTTGCGCGA TGCCGATCCG GCCGCAATTG AGGCATTGCG TGCCGACCCC
CAACTGCCCG ATCAACAGAA GGCGATGCTG GATATGATCG GGAACTTGCC GGCAGCGGCC
CGCGCGCAGG CGCTCGACCG GGTATTGGCC CAACTTGACC GGGCTGAGCA GAAGGCTCGC
GCTGAGGGGC GGGCAATCGG CGAGCAGATT AGCGCAGCGC TCAAAGATGC ATTTACCAAG
AGCGTGACGA CAATCTACTG GTACGCAATC TGGCTGGCGG TGATCGGGTT GGCACTGGCG
CTGTTCATTC CCGAATTGCC CCTTAAGCAG AATTATGGCG AAGATTTACC GCCATTGATG
GAGTAG
 
Protein sequence
MTTTTSPAST TVTRIDYAVT LDQRSKVLIL IGVLLGLFLS ALDQTIVSTA LPRIVADLKG 
IELIGWVSTG YLLASTSMVP IYGKLSDIYG RKYVLLFGIA VFLLGSLLCG LAGDMTQLVF
YRGLQGFGAA ALTSTAFAIP ADLFAPAERA RYMGLFGAVF GLSSVVGPFI GGLLTDNLSW
HWVFFVNLPL GVVALGFIIA KLPRLHSGLK PAIDYAGAAT LLLTVIPFLL ALTLDKNDFS
WTSPFIISLF AVSAIGLIFF LLIERRAESP ILPLHLFRNR TFTLTTIIGF TVGATLFAAI
FFLSLYLVNV LGVSATAAGT TLIPLTLSLV VGAMVSSQIV QRTGRYKWVI VGGMAIIVAA
LWWLTTLTPD TSVWMVRLRM IALGLGLGPS MPILNLAMQN AVPRTDMGAA TASRQFFQQL
GQVVGSAVFG ALLTGVLTTT LTANLAPIQA QLPPEMAARF DSSTLRNGMG AGEGASGETV
DPAVRIEQAI ADQFASRRDL LTRALRDADP AAIEALRADP QLPDQQKAML DMIGNLPAAA
RAQALDRVLA QLDRAEQKAR AEGRAIGEQI SAALKDAFTK SVTTIYWYAI WLAVIGLALA
LFIPELPLKQ NYGEDLPPLM E