Gene Caci_6224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_6224 
Symbol 
ID8337587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp7163793 
End bp7164944 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content72% 
IMG OID644959325 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003116919 
Protein GI256395355 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.806406 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCAGTGC TGCGGGGTCC TCGATGTCGA GGCCTGCGAG GCGGTCTTCG AGCAGGTCGC 
GATAGCGGTG CAGGTGGGTG CGCGACGTTG CAGGAACTCC CTCCCTTCCT GTCCGCGAAG
TTCCACGTCG GACCGGCGGT GACCGGCTTG GTCGTGGGCG CCGCCTTCGC CGCGACGGCG
GCCGGGCGCC CCTTCGCCGG CCGGTGGGGC GATGCGGGAC GGTCGCGCGC GGTGGTCGTC
CTCGGCGGGC TGCTGGCGTC GGCAGCCGGT GCGGGCACGG TGTTCGCGCC CGATGTCGCA
GTGCTCTTAC TATGCCGCCT CGTGATGGGT GCGGGGGAGG CGGCGCTGTT CTCCGGCGCC
CTGCCGTGGG TGCTCGCCGA CGTGCTGCCC GAGCAGCGCG GCCGGGTGGC GGGCTGGTTC
GGCTTGTCCA TGTGGGGCGG GCTGACGCTG GGACCGGTGC TCGCCGCGAC CATCGGGGAG
GCCGCGAACG CCGACGCGGT GTGGTGGACG GTCGCGGCGT TGCCGTTGGT GTCGGCGGTC
CTGGCGGCGA CCGCTCGCAA GCCGGAATTC GTTACTACCA AACGGATTCC GGGCCCGCTC
GTACCTCCAG GTGTAAGTCT GCCGGGGTCG ATCATCGGCT TGGCAGCATA CGGATACGGC
ACGCTCGCGG CGGTGCTCGT GCTCTCGCTG GGGCACTCTG CGGGCAAGGT CGTGGCGTTG
CCCGTCTTCG CCTGCGCGTT CCTGGTGGTC CGAGGGTTCG GGAGCCCGCT GGTCGATCGT
CACGGCGGCG CGGTGGTCGC CTGCGTCACC TTGGTGGTGC AGGCCGCGGG GCTGATCTTG
TTGGCGGCGG CGCCGGATCT CGTCGTCACG CTCGCAGGCG CCGCGATCAC CGGCGCCGGG
CTCGGACTCA TCTATCCGGC GACCGCCGCG ATGACGTTGG GCATGGCTCG GGCGGAGACG
GCGGGAGCGG CGGTCGGGGC GATGACCTCG TTCTGGGACC TCGGGATTCT GGTCGCGGGT
CCGCTCAGCG GAACGATCGC GGCGGCTGCC GGTTTCCACG CCGCGTTCGG CGTTGCCGTG
GTGACGACGG TGCTCGCGGT GATCTTGGCT GTCCGGACCT CGATGAGCGG CGTCGCTGCG
TCCGTGCACT GA
 
Protein sequence
MPVLRGPRCR GLRGGLRAGR DSGAGGCATL QELPPFLSAK FHVGPAVTGL VVGAAFAATA 
AGRPFAGRWG DAGRSRAVVV LGGLLASAAG AGTVFAPDVA VLLLCRLVMG AGEAALFSGA
LPWVLADVLP EQRGRVAGWF GLSMWGGLTL GPVLAATIGE AANADAVWWT VAALPLVSAV
LAATARKPEF VTTKRIPGPL VPPGVSLPGS IIGLAAYGYG TLAAVLVLSL GHSAGKVVAL
PVFACAFLVV RGFGSPLVDR HGGAVVACVT LVVQAAGLIL LAAAPDLVVT LAGAAITGAG
LGLIYPATAA MTLGMARAET AGAAVGAMTS FWDLGILVAG PLSGTIAAAA GFHAAFGVAV
VTTVLAVILA VRTSMSGVAA SVH