Gene Caci_1069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1069 
Symbol 
ID8332404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1212203 
End bp1213633 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content69% 
IMG OID644954217 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003111836 
Protein GI256390272 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.794083 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCCA CGACTGCCAC GACCCCCGCT ACCACCACAG CCCGTTCTGC TCGCCTGATG 
GGGGCGATCC TGGCGCTGGT GCTGCTCGCT GACGCCCTGG ACGTGATCGA CGCGACCGTC
ACGAACATCG CCGCGCCCAC GATCGCCCGC GATCTGCACG GCGGCGTGGG GCTGATCAAG
TGGCTGGGGA CGGCGTACAT GCTCGCGATG GGCGTCCTGC TGGTGGTCGG CGGACGGCTG
GGCGACAAGT ACGGCCAGCG CAGGCTCTTC CTGATCGGCA TCGCCGGGTT CGGGGTGGCC
TCGGCGGTCG CCGGGCTCTC GCCCGACCCG ACCCTGCTCA TCGCCGCGCG GGTCGCTCAG
GGTGCGTTCG GGGCGCTGCT CACGCCGCAG GGCATGGCGA TCATGGTCAA GACCTTCAGT
CCCGAGCTGC TCACCAAGGC GTTCGCTCTG TTCGGTCCGG TGCTCGGCAT GGCGTCCGTC
GCAGGTCCCG TCCTGGCCGG GTTCATCATC AGCGCCGACC TGTTCGGGCT GTCGTGGCGC
CCGATCTTCC TCATCAACGT CGTGCTCGGC GGCGTCGGGC TGGTCGTCGC GGCCAGGATC
CTGCCGCGCG ACGACGGCGA CCGGTCCGTC GTGGTGGACG GCTGGGGTTC CGGACTGCTC
GCCGCGACCA TGTTCGGGCT GCTGTACGGC CTGATCGAGG GCTCGACCAA TGGCTGGAGC
GTGCTCCCGA TCGCCTCGAT CGTGGCCGGC GTCCTGTTCT TCGGCGCCTT CGCCTACCGC
CAGCGCACCG CCGCCCACCC GCTCGTCGCG CCCTCGCTGC TGCGGAACAA GGGTTTCACG
TCCGGGATGA TCGTCGGGCT GGTCGTCTTC GCCGCCACCA CCGGCCTGAT CTACGTGCTG
TCGCTATTCA TGCAGGAGGG CTTGCACACC GGACCGCGCG ACACCTCGCT CGCCCTGGTG
CCGCTCACCC TCGGCATCAT CGCCTCCGCG TTCGCCGCGA TGGGAGGTCT GGTCGCCAAG
CTCGGCCGGA CCCTCGTCTT CATCGGGCTC GGGGTCGTTC TCCTAGGCTG CGGTTGGATC
CTGGCGCTTG TTGCCACCTC TGGGACGAGC GTCAGCCTGT GGGCGTTGGC GCCGGCGTTG
TCCGTCACCG GTGTCGGCTT GGGCCTGTGC TACAGCACGA TCTACAACGT CGCCCTCGGA
GAGGCGAGCC CTGAGGAAGC CGGAAGCGCC AGCGGCTCGA TCAGCTCGAT CCAGCAGCTC
GCTGCGGGGA TCGGCTCGGC CGCGGTCACC TCGGTCTTCT TCCAGGCGGC GACGTCCGGC
TTCGGGCACG CCATGAAGGT CAGCCTCATC GTGACCCTGA TCGTGACCGC GCTCAGCATC
CCGGTCGTCA CCTTGATGCC GCGCAAGGCT GCGGCGGAAC CTCAGGAGTA G
 
Protein sequence
MTATTATTPA TTTARSARLM GAILALVLLA DALDVIDATV TNIAAPTIAR DLHGGVGLIK 
WLGTAYMLAM GVLLVVGGRL GDKYGQRRLF LIGIAGFGVA SAVAGLSPDP TLLIAARVAQ
GAFGALLTPQ GMAIMVKTFS PELLTKAFAL FGPVLGMASV AGPVLAGFII SADLFGLSWR
PIFLINVVLG GVGLVVAARI LPRDDGDRSV VVDGWGSGLL AATMFGLLYG LIEGSTNGWS
VLPIASIVAG VLFFGAFAYR QRTAAHPLVA PSLLRNKGFT SGMIVGLVVF AATTGLIYVL
SLFMQEGLHT GPRDTSLALV PLTLGIIASA FAAMGGLVAK LGRTLVFIGL GVVLLGCGWI
LALVATSGTS VSLWALAPAL SVTGVGLGLC YSTIYNVALG EASPEEAGSA SGSISSIQQL
AAGIGSAAVT SVFFQAATSG FGHAMKVSLI VTLIVTALSI PVVTLMPRKA AAEPQE