Gene Caci_4127 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4127 
Symbol 
ID8335481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4667296 
End bp4668753 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content71% 
IMG OID644957230 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003114832 
Protein GI256393268 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.773525 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAGAC GCCCGGCGGA GACTTCGCGT CCCCACTACA ACGTCATCTT CGCGGTCCTG 
CTCATCGGCA TCTCCGCCTA CGCGGTCCTG CAGTCGCTCG TCGCCCCGGT CCTGGCGACC
TTCATCACCG CCCTGCACAC GACGCAGGAC ACGGCGACGT GGCTGATGAC GGCCTACCTG
CTGTCCGCCT CGGTCGCCAC CCCGATCCTC GGCCGCATCG GCGACAAGGT CGGCAAGGAG
CGGATGCTGG TCCTCACGCT GCTCGCGCTG ACCCTCGGCT CGGGGCTGGC GGCGCTGTCG
CACTCCGTCG CCCTGATGAT CATCGCGCGC GCGATCCAGG GCCTCGGCGG CGGACTGCTG
CCGCTGTCCT TCGGCATCAT CCGCGACGAG TTCCCGCCGG AGAAGGTGAA CTCCGCGATC
GGTCTCGGCT CGGCGACGGT CGCCGTCGGC GGCGGTCTCG GACTGCTGAT CGCGGGCCCG
ATCGTCACGC ACCTGAACTA CCACTGGCTG TTCTGGATAC CGATGGTGCT GACCGCGATC
GCCACCGTCG CTTGCTGGCG CTTCGTGCCG GAATCCCCGG TGCGCACGCC CGGGAAGATC
AGCTGGGGCG CGGCGGTCCT GCTCTCGGCC TGGCTCGTGA TGCTGCTGCT GGCCGTCAGC
GAGGGCCCGA CCTGGGGCTG GGGCTCGACG AAGGTGATCG GGCTGTTCCT CGGCGCGGTG
GTCTGCCTGC CGCTGTGGAT CCTCACCGAG CTGAAGTCGA GCGCGCCGCT GATCGACATG
CGGATGATGC GCCTGCCGGC GGTGTGGACG ACCAACGTCG TGGCGCTGCT GTTCGGCGTC
GGGATGTACA CGGTGATGAC GTTCCTGCCG CAGCTCGTGC AGACCCCCCG CGCCACGGCC
GGCTACGGGC TCAGCGCCAG CATCACGCAG TCCGGCGTCT ACCTGCTGCC CATGACGATC
GGCATGTTCC TGCTCGGCAT CGCCGCCGCG CCGCTGGCCA AGCGCATCGG GCTGAAGGCC
GTGCTGGTCC TCGGCTGCGC GGTCAGCATC CCCGGCTTCG CCGCCCTCGC CTTCGGGCAT
TCGCAGGGCT GGGAGATCTA CCTGGCGTGC GGACTGCTCG GCATCGGCAT CGGCCTGGCG
TTCGCCTCGA TGTCCGCGAT CGTGGTCCAG TCGGTCCCGG CCGCGCAGGT CGGCGTCGCC
AGCGGCATGA ACGCCAACAT CCGCACCATC GGCGGCGCGT TCGGCAGCAG CGTGGCGGCG
AGCGTCCTGG CCACCGGCGT CACCGCCGCC AACCCCCTGC CGAAGGACGC CGGCTACACG
CACGTGTTCT GGCTCCTCGC CGCCGCGGCG GTCCTCGCGA CCCTCGCGGC CCTGATCATC
CCGGCGGTCA AGGCGCGATC GGCGCCGACC ATCGACGAGC TGAGCGTGGA CGACGGCGCG
GTTCCCGCCG CCGCTTAG
 
Protein sequence
MPRRPAETSR PHYNVIFAVL LIGISAYAVL QSLVAPVLAT FITALHTTQD TATWLMTAYL 
LSASVATPIL GRIGDKVGKE RMLVLTLLAL TLGSGLAALS HSVALMIIAR AIQGLGGGLL
PLSFGIIRDE FPPEKVNSAI GLGSATVAVG GGLGLLIAGP IVTHLNYHWL FWIPMVLTAI
ATVACWRFVP ESPVRTPGKI SWGAAVLLSA WLVMLLLAVS EGPTWGWGST KVIGLFLGAV
VCLPLWILTE LKSSAPLIDM RMMRLPAVWT TNVVALLFGV GMYTVMTFLP QLVQTPRATA
GYGLSASITQ SGVYLLPMTI GMFLLGIAAA PLAKRIGLKA VLVLGCAVSI PGFAALAFGH
SQGWEIYLAC GLLGIGIGLA FASMSAIVVQ SVPAAQVGVA SGMNANIRTI GGAFGSSVAA
SVLATGVTAA NPLPKDAGYT HVFWLLAAAA VLATLAALII PAVKARSAPT IDELSVDDGA
VPAAA