Gene Caci_0689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_0689 
Symbol 
ID8332019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp800060 
End bp801301 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content73% 
IMG OID644953841 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003111465 
Protein GI256389901 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.549457 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCTCGA CCATCGAAGC GGGTGTCGGC ACCGAAGACA TCCGCAAGGA CACTGTGTTC 
GGCCGCATCG GCGGCGCGCT ACGCGAACGC GAATTCCGGT GGTGGTTCGC CGGGCAGATC
ACGTCGGCGT CCGGGGTGAT GGCGCAGGGG GTGGCGCTGT CGTGGTGGAT GCTGCAGCGG
ACCGGCGACG CGGTGTGGCT CAGCGTACTG ACGGTGTGCA CGATGGGCCC GACGCTGATC
GGCGGCGCGT GGGCCGGGGC GGTGGTGGAC CACGCGGATC GCCGGCGGCT GCTGATCGGC
ACGCAGACGG TCCTGATGGG CATCGCCGCG GCGCTGACCG TCCTGGCCGC GACCGACACG
CTCGCGGTGT GGAACGTGCT GGTCGCCTCG GTGCTGGCCG GGACGACCAT GGCCGTGGAC
TCGCCCGCGC GGCAGGTGTA CGTCGTGGAC CTGGTCGGCG CCGACGGCGT CGCGAGCGCG
GTCGGGCTGT GGGAGGTGGC GCTGAACACC TCACGGGTCG TGGGTCCGGG CTTGGGCGGC
GCGCTGCTCG CAGGTCCCGG CGCCACCGCG TGCTTCGGCG TGAACGCGTT TTCTTATCTG
GCGCCGCTGA TCGTGCTGCT CCGGATGAAG CCCCGGACGA CCGCGCAGGT TCGGACGCGC
GGACGTGCCC GGGGCGCCGC GCGTGACGGC ATCCGCTACG CGTTCCGCTC GCCGGTCATC
AGGGCTCTGC TTCCGATGTC GACGGCTTCC GGCTTGATCT TCGGCATGGG TATCGCGCTG
CCGCCGCTGG TCCAGCGTGC TCTGCACCAG GGCGGCGGCG GGTACGGCGC GATGATGGCG
GCGTTCGGCG TCGGCGGGCT GCCCGGGGCG CTGCTGGCCG CCGCCCAACC CGAGCCGACC
GGCCGCCGCG TGCGCTGGCT CGCGCTGGCG ACCGCGGCGG CGGTGATCGG GACCGCGGTG
GCGCCGGTGA TGGCGGTCGC GTTGGTGGGG ATGGTGGCCC TCGGCCTGAC GTCGATCTGG
TTCATCGCCT CGGCCAACAC CCTGGCGCAG TTGCGGTGCG CGCCGGACAT GCGCGGCCGG
GTGATGAGCC TGTGGGGCGT GGCGATGATG GGGACCGCGC CGATCACCGG GTTCGGCGTC
GCGGCGGTGG TGCAGTACGT CGGACCGCGC GAGGGGTTCT CCATCTCGGG CATCGCGCTC
GGGCTGGCCG CCGTCGTCGG CTGGCGGGCG TTGCGCGACT AG
 
Protein sequence
MSSTIEAGVG TEDIRKDTVF GRIGGALRER EFRWWFAGQI TSASGVMAQG VALSWWMLQR 
TGDAVWLSVL TVCTMGPTLI GGAWAGAVVD HADRRRLLIG TQTVLMGIAA ALTVLAATDT
LAVWNVLVAS VLAGTTMAVD SPARQVYVVD LVGADGVASA VGLWEVALNT SRVVGPGLGG
ALLAGPGATA CFGVNAFSYL APLIVLLRMK PRTTAQVRTR GRARGAARDG IRYAFRSPVI
RALLPMSTAS GLIFGMGIAL PPLVQRALHQ GGGGYGAMMA AFGVGGLPGA LLAAAQPEPT
GRRVRWLALA TAAAVIGTAV APVMAVALVG MVALGLTSIW FIASANTLAQ LRCAPDMRGR
VMSLWGVAMM GTAPITGFGV AAVVQYVGPR EGFSISGIAL GLAAVVGWRA LRD