Gene Caci_5238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5238 
Symbol 
ID8336592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6030426 
End bp6031625 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content71% 
IMG OID644958336 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003115938 
Protein GI256394374 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0376392 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGCTACT TCGCGTTGCT TCGCCTACCC TTCATCGCCC GGTTGCTCAG CGGCTCGCTC 
GTCGGGCGGC TGCCGGGTGG CATGTCCATC CTGATGATTC CGCTGGCTTT GCGCCGCTCG
GGGATGGATT ACGGTTTCGT CGGCATCGCC TCTGGCGCCT TGGCGGTCTC CACTGCGATC
GGCGGGCCGG TGTTGGGGCG GCTGGTGGAT CGGGTCGGGC AGGTCAGGGT ATTGGTGCCG
GCGGCGGCGG CTTCGGGCGT CGGGTTCGCG ATTTTGGCGG TGGCGCCGGG GAATCGGCTG
AGTGTGCTGG TCGGGGTGGT GCTGGCGGGA GCCGCCGCGC CGCCGTTGGA GCCGTGCATC
CGCATTCTGT GGCCCTCACT CGTGCCGGCT GAGGGTCTGG CGTCGGCGTA CTCGCTGGAT
TCGGCGGCGC AGGAGCTGAT CTTCGTGTCC GGGCCGTTGG TGGTCGCCGG GTGTGTGGCG
TGGGCTGCGC CGGTGGCGGC GTTGTGGTTG GGAGCGGCGC TCGGCGCGGT CGGGGTGCTG
GTGGTCGCGA CCAGCAAACC GGTGCGCGAG TGGCGTCCGG AGGTGCGCGA GGCGCACTGG
CTCGGGCCGC TGCGGAGTCC GGGGTTGGTG GTGCTGCTGA CCTCGCTGAT CGGGCTGGGT
ATCGCGATCG GGACGCTGAA CGTCGTGCTC GTGGACTACG CCGAGCACCA CAAGTTCCCC
GGCGGGGCGG GCACCTTGAT GGCGATCAAC GCCTTCGGCT CGCTGATCGG CGCGCTGGTC
TACGGCGCGC GCAAGTGGCC CGGCACGGCG ACCGGCCACC TGCTGACATT GCGCGTCGGC
CTCGGCGCCG CCTACGCCCT GCTCCTGCTG GTCCCGGCGC CGCCGCTGAT GGTGGCGATC
ATGGTGGTCT CCGGAGTCTG CTTCGCGCCC TCGCTGACCG TGCTGTTCAT GCTCACCGGC
GAGCTGGCGC CGGCCGGCAC CGCCACCGAG GCCTTCGCCT GGCTGATCAC GCTGTTCAAC
GTCGGCGCGG CCGCCGGCGC GGCGATCAGC GGCTTCGTCA TCGCGCACGC CGGGCTGTCT
CCGGCGGCGG TGACCGCGGT CGCCGGGATA GCCGCGGCGG TGCTCGTCCA GCTGGCCGGG
CGCCGCTACC TGCATCAGTC CGAGCCCGAG GGTCCGGTGG AAACTCCGGT GTACGCCTGA
 
Protein sequence
MGYFALLRLP FIARLLSGSL VGRLPGGMSI LMIPLALRRS GMDYGFVGIA SGALAVSTAI 
GGPVLGRLVD RVGQVRVLVP AAAASGVGFA ILAVAPGNRL SVLVGVVLAG AAAPPLEPCI
RILWPSLVPA EGLASAYSLD SAAQELIFVS GPLVVAGCVA WAAPVAALWL GAALGAVGVL
VVATSKPVRE WRPEVREAHW LGPLRSPGLV VLLTSLIGLG IAIGTLNVVL VDYAEHHKFP
GGAGTLMAIN AFGSLIGALV YGARKWPGTA TGHLLTLRVG LGAAYALLLL VPAPPLMVAI
MVVSGVCFAP SLTVLFMLTG ELAPAGTATE AFAWLITLFN VGAAAGAAIS GFVIAHAGLS
PAAVTAVAGI AAAVLVQLAG RRYLHQSEPE GPVETPVYA