Gene Caci_7431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_7431 
Symbol 
ID8338801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp8618208 
End bp8619809 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content70% 
IMG OID644960511 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003118098 
Protein GI256396534 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCCTCA TCGCCTCCAC TCCGGTGGAC CGTTTGAAAG AGCCCTATGC ACGCAGGTGG 
TGGGCCCTCC TCGTGCTGTG CCTCAGCCTG CTGATCGTGG TCATGGCCAA CACCTCGCTG
ATCGTCGCCG CCCCCGACAT GACCCGCGAT CTGCACCTGA GCAGCGCCGA TCTACAGTGG
GTGATCGACG GCTACACCGT CCCGTACGCG GCGCTGATGC TCGTGCTCGG CGCGGTCGGC
GACAAGTTCA GCCGCCGGGG GGCGCTGATC GCCGGGCTGG TGGTCTTCGC CGGCGGCGCG
GTGGCCGGAA GCCTGGTCCA CACAGCGATC GCGGTCATCG TGGCCCGCGC GACGATGGGC
ATAGGCGCGG CGGTCATCAT GCCCGCGACG CTGTCGCTGC TGGTCGCCAC CTTCCCGCGC
GCCGAACGCG CCAAGGCGAT CACCGCCTGG AGCGCGACCT CGGGCCTGGC CATAGCCCTG
GGCCCGCTGC TGGCCGGCTG GCTTCTGCAG CAGCACAGCT GGGGCTCGAC GTTCCTGATC
AACGTGCCGA TCGCCGCGAT CGCGATCGTG GCGGCGCTGG TCGTCGTACC GCCCTCGCGC
GCGGCGGCGA TGGGCCGCCT GGACCTGGTC GGCGGGCTGC TGTCGGTGAT CACGGTCGGC
TCGCTGGTCT ACGCGATCAT CGAAGGCCCG CACTTCGGCT GGGGCGCGGC GGCGCTCGGC
GCGCTCGCGG CGTCCGCGGT GGGCCTGGTC GGCTTCGTGC TCTGGGAACT GCGCCACCCG
CACCCGATCC TGAACGTCCG CAAGTTCGCC GACCGGATGT TCAGCGGATC GGTGCTGGCA
GTGCTGTTCT TCTTCCTGGG CGCGTACGGC ACCATCTACT ACGCGACCCA GCACCTGCAG
TTCGTCCTGG GCTACAACGC CCTGTCGACC GGCGTGCGCC TGCTGCCGCT GGCCGGCGCC
GTGTTCGTCG GCGCGGCGCT CACCAACCGC CTGACACCGC GGCTGGGCAT GAAGCTGGTC
GTGGTGCTCG GCATGGCGCT GGGCACGGCC GCGATCCTGC TGCTCGCCCG CGTCGGCGAC
GGCGCGACGT ACACCGACTT CCTGCCCACG CTCGCGATGC TCGGCCTGGC CATCGGCCTG
AGCACCGCAC CGTGCACCGA CACCATCATG GGCGCCTTCC CCGAATCAGA GCTCGGAGTC
GGCGGCGGCA TCAACGACAC GGCGCTGGAA CTCGGCGGCT CGCTGGGCAT CGCGATCCTG
GGATCGATCC TGGCCACGAC CTACCGCGAC AAGCTCGCTC CCGTCATCGC CGGTCACCTC
CCGGAGCAGG GCGCGCACGT CGCGAAGGAC TCCATCGGCG GCGCCCTGGC CGTCGCGGAT
CAGGTAGCCC ACAGCCCCGC CGGCGGTCCT GGCCAAGCCC AGGCGCTTGT GACGGCGGCG
GACCACGCCT TCACCCACGC GGTCGCCCAC ACCAGCCTGA TCGGCGGAAT CATCCTCGCG
GTCGGCACCG TGCTGGTGGC GGTGATCCTG CCGCGCCATT CCACCGCGGA GCCTGAGGAT
CAGCACTCTG ACATGGAGCG CGTAGAAGTC GAAGTCGGCT AG
 
Protein sequence
MSLIASTPVD RLKEPYARRW WALLVLCLSL LIVVMANTSL IVAAPDMTRD LHLSSADLQW 
VIDGYTVPYA ALMLVLGAVG DKFSRRGALI AGLVVFAGGA VAGSLVHTAI AVIVARATMG
IGAAVIMPAT LSLLVATFPR AERAKAITAW SATSGLAIAL GPLLAGWLLQ QHSWGSTFLI
NVPIAAIAIV AALVVVPPSR AAAMGRLDLV GGLLSVITVG SLVYAIIEGP HFGWGAAALG
ALAASAVGLV GFVLWELRHP HPILNVRKFA DRMFSGSVLA VLFFFLGAYG TIYYATQHLQ
FVLGYNALST GVRLLPLAGA VFVGAALTNR LTPRLGMKLV VVLGMALGTA AILLLARVGD
GATYTDFLPT LAMLGLAIGL STAPCTDTIM GAFPESELGV GGGINDTALE LGGSLGIAIL
GSILATTYRD KLAPVIAGHL PEQGAHVAKD SIGGALAVAD QVAHSPAGGP GQAQALVTAA
DHAFTHAVAH TSLIGGIILA VGTVLVAVIL PRHSTAEPED QHSDMERVEV EVG