Gene Caci_4580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4580 
Symbol 
ID8335934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5212697 
End bp5214118 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content73% 
IMG OID644957681 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003115283 
Protein GI256393719 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.259189 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGCG CCGATCCGAT AGGACAGAGG ACGCGCGCCG AGCCGCAGGC TGCCGCCGCC 
ACGCCGCTCC CCCGCGCGCT GCTGCTCATC GGCGTGGTTA TGGCGGTGGT CGGCAGCGTC
GGCGGGCCGC TGATCACGAG CGTCGCGACG ACGCTGCACG TCTCGCTCGC GGCGGCGCAG
TGGACGCTGA CCGTCTCGCT GCTCACCGGG GCGGTGGCGA CGCCGGTGCT CGGGCGGCTC
GGGTCCGGGG CGCGCCGGCG TCCGGTCGTG CTGGGCACGC TCGGTGTCAT CGTGGTCGGC
AGTGTCCTGA CGGTGCTGCC GCTGCCGCTC GGGCCGCTGC TGGTGGGGCG GGCGGCGCAG
GGGTGCGGGC TGGCGCTGGG ACCGCTGATG ATGGCCTCGG CGCGCGAGCA TCTGGACAAG
GCGCGGGCCG CCTCGACGAT CGCGATGATC TCGGTCGCCT CGACGGCGGG GGTCGGGTTC
GGGTATCCGC TCGCGGGCTA CCTCACCGAT GTCGGTGGCA TTCGGCTCGC CTATGCGGCC
GCGCTGGGAC TGTCGGCGGT GGCGTTCCTG GTGGCGTTCC GCTTCTTCCC CGCCTCGCAG
CAGCCTGTCG CTCCCGCGCC CGACGCGGCG GCGAGTCTGC TGCTCACCGC CGGGGTGCTG
GTGCTGTTGC TCGTCCTCGG TGAGGCGAGT CTGTGGCAGG ACCATCTGGC GGTGGCGGTC
GGCGGGGTCG TCGCGGCGCT GCTGCTCGTG GCGTGGACGC TGCGGGAACG CGCCCTGGTC
AATCCGCTGG TCGATCTGCG TCTGCTGCGA CACCGGGCGG TGGCCGGCGC GAACGCCGCG
ATGCTGGTGG CAGGTGTCGG AATGTATCTG CTGCTGTCGT GCATCACCCG GTACGTGCAG
ACTCCGCGCG TCGCCGGCTA CGGCTTCGGG CTGAGCACGT TCACCGCGGG CCTGTTCCTC
GTCCCGTTCT CGGTGGTCGG CTTCGTGGCG GGCCGCCTGA GCCCGCGGGT GCGCAGGTAT
CTGTCGGCGC CGGCGCTGGT CACGGCCGCG ACCGCGGTGG TGATCGGCGC GTTCGCGCTG
TTCGCCGTGG CGCGCGGGTC CATCGCCGGT CCGCTGGTGT CGATGACGCT GCTGGGCTTG
GGCGTCGGCG CCTTCTCCGC GGCGATGCCC GCGGTGATCC TGGAGGTGAC TCCGGCGCAG
GAGACGGCGA GCGCCATGGG CGTCAACCAG GTGGTGCGCA GCATCGGGTT CTCGCTCGGC
AGTACCCTGA GCGCCCTGCT CTTGGCCGCG CACACACCGG CGGGCGCCGT CTTCCCCACC
GACGAAGGCT TCACGGTGAC GGCGTGGATC GGCGCCGCCG TCACCGCGAT CGCCTTGGGG
ATCAGCTTTA CGCTGCGCCC GGAGTCGGAA CCGGAGCGGT GA
 
Protein sequence
MSGADPIGQR TRAEPQAAAA TPLPRALLLI GVVMAVVGSV GGPLITSVAT TLHVSLAAAQ 
WTLTVSLLTG AVATPVLGRL GSGARRRPVV LGTLGVIVVG SVLTVLPLPL GPLLVGRAAQ
GCGLALGPLM MASAREHLDK ARAASTIAMI SVASTAGVGF GYPLAGYLTD VGGIRLAYAA
ALGLSAVAFL VAFRFFPASQ QPVAPAPDAA ASLLLTAGVL VLLLVLGEAS LWQDHLAVAV
GGVVAALLLV AWTLRERALV NPLVDLRLLR HRAVAGANAA MLVAGVGMYL LLSCITRYVQ
TPRVAGYGFG LSTFTAGLFL VPFSVVGFVA GRLSPRVRRY LSAPALVTAA TAVVIGAFAL
FAVARGSIAG PLVSMTLLGL GVGAFSAAMP AVILEVTPAQ ETASAMGVNQ VVRSIGFSLG
STLSALLLAA HTPAGAVFPT DEGFTVTAWI GAAVTAIALG ISFTLRPESE PER