Gene Caci_5131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5131 
Symbol 
ID8336485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5889854 
End bp5891317 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content70% 
IMG OID644958229 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003115831 
Protein GI256394267 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.22804 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCG AGTCCACACG GACCACTGAG ACCGCCTCCC TCCGACCCCC TTCCGAACCC 
GCCCTCCCCG ATCCCCGCCG CTGGTGGGTC CTGGCCGCGG TCGCCGCGGC GCAGCTGATG
ATCGGCCTGG ACCTGACGAT CGTGAACATC GCGCTGCCGT CGATGCAGCG CTCGCTGGGC
CTGTCCGATC CCGCGCGCCA ATGGGTGATC ACGATCTTCG CGCTCTGCTA CGGCGGCCTG
CTCCTGCTGG GCGGCCGCAT GTCAGACCTC ATCGGCCGCC GGCGCGCGCT GATGATCGGT
CTGACGGGCT TCGCCTTGGC CTCCGCGCTC GGCGGCGCGG CGACCGACCC GGCGATGCTG
CTCGGCTCAC GCGCCGCTCA GGGCGTGTTC GGGGCGCTGC TGACGCCGGC GGTGCTCGCG
ACACTCGCGA CCACGTTCAC CGTTCCTGCC GAACGCGGCA GGGCTTTCGG TATCTATGGA
ACCGCGATGG GCAGCGCGTC CGGCGTCGGC GTCATGCTGG GCGGCGTGCT CACCCAGTAC
CTCGACTGGC GCTGGTGCAT GTATGTGAAC GTGCCGATCG CGATCGCTGC CGCGGCCGGG
GTTCTGTACG CGGTGCGTCC CGTTCCGCGC AACGCCGGGG TGCGTGTCGA CGTGCTCGGT
GCGCTGCTGG CGACCGGGGG CATCATGGCT CTGGTCTTCG GCTTCGCTCG GGCTCAGACC
GACGGGTGGA GTGCGGCGAT CACCGTTGTG CCGCTGGTGA TCGGGGTGCT GACGCTGGTC
GCCTTCGTGT TCGTGCAGGC TCGGACCGCC AAGCCGCTCC TTCCGCTGCG GGTCGTCCTC
AACCGGCGGC GAGCCGGCTC CTACCTGGCG GTGTTCAGCC TGGCTGTCGG CATGTTCGCC
GCGCTGTTCT TCCTGACGTT CTTCCTGCAG AACGTCCTGG GCTACTCACC GATCCGCGCC
GGTCTGGCCT TCCTGCCTCT GACGGTCGGA CTCATCGCCG GAGTGCGGGC GGTGACCAAG
CTTCTACCGC GCGCGCCGGT GCGCTTGCTG CTGTGCCCGG GCCTGCTGAC CATCGCAACC
GGTCTGGCCC TGCTCGGCGT CGTGAAGGCT GACAGTGGCT ACTGGCTTCA TGTGTTCCCC
GTGTTCTTCC TCGTCGGTAT CGGCACCGGG TGGGTGCTGA TCACCGCGAA CAGCGCGGCG
ACACTGGGCG CCGGTCAGGA CACCGCGGTG GCGGGAGCGA TGGTCATGAC CTCACAACAG
ATCGGCGCGT CCCTCGGCAC GGCGGTGCTC AGCACGATCG CCGGAACGGC TGCCGCCGGA
TACCTCCACG GGCACCCCGG CTCCGGCGCC AGGGCTGTGG TCCACGGCTT CGACGTCGCG
AGCCTCGGCG CCGCCGGGTT CCTCTGCCTG GCCGCGGTGA CGGTCTTCCT CGTCAGCGGC
AGGGGCGAAC CGAAGGTCCG GTAA
 
Protein sequence
MTTESTRTTE TASLRPPSEP ALPDPRRWWV LAAVAAAQLM IGLDLTIVNI ALPSMQRSLG 
LSDPARQWVI TIFALCYGGL LLLGGRMSDL IGRRRALMIG LTGFALASAL GGAATDPAML
LGSRAAQGVF GALLTPAVLA TLATTFTVPA ERGRAFGIYG TAMGSASGVG VMLGGVLTQY
LDWRWCMYVN VPIAIAAAAG VLYAVRPVPR NAGVRVDVLG ALLATGGIMA LVFGFARAQT
DGWSAAITVV PLVIGVLTLV AFVFVQARTA KPLLPLRVVL NRRRAGSYLA VFSLAVGMFA
ALFFLTFFLQ NVLGYSPIRA GLAFLPLTVG LIAGVRAVTK LLPRAPVRLL LCPGLLTIAT
GLALLGVVKA DSGYWLHVFP VFFLVGIGTG WVLITANSAA TLGAGQDTAV AGAMVMTSQQ
IGASLGTAVL STIAGTAAAG YLHGHPGSGA RAVVHGFDVA SLGAAGFLCL AAVTVFLVSG
RGEPKVR