Gene Caci_2303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2303 
Symbol 
ID8333652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2608602 
End bp2610158 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content68% 
IMG OID644955456 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003113062 
Protein GI256391498 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGG CGGTGAGCGG TACCGAGGAC CTGGTCCTCC CGGTGGCCGA AGCGGTACCC 
GAGGACGAGC GACCGTCTAA GGGCAAGAAG AAGGATCCGA GCCAGTCGGC GATTCCGAAG
GGCGCGTGGG CGGTGATGTT CACCGTGCTC GGCGCCTCGA TGATGGACCT GCTGGACGCG
ACGGTGATGA ACGTCGCCGC GCCGTCCATC CGCAACGGTC TCGGTGCCTC GAACACCGAG
TACCAGTGGA TCAGCACCGG CTACGTGCTC TCCTTCTCCG TGCTGCTGAT CGCCGGCGGG
CGCCTGGGCG ACATAGCCGG GCGGCGCCGG ATGTTCCTGA TCGGCCTGAC CGGCTTCACG
ATCATGTCCG CGGTCTGTGC CATCGCGCAG AACCCCGGCG AGCTGATCGC GGCCCGCCTG
CTGCAGGGCG GCGCCTCGGC GATGATGATC CCGCAGGGCA TCGGCATGAT CCGGGAGAAG
TTCGGCCGGG AGAACAGCCA GAAGGCCTTC GCGATCTTCG GACCCTTCAT GGGTCTGTCC
GCCGCCCTGG GCCCGGTGCT CGGCGGCGCG CTGATCACCT ACTCCTCCTG GCGCTGGGTC
TTCGTCATCA ACCTGCCGGT CGGCGTGATC GCGCTGTACT TCGCCGCCAA GGTGCTGCCG
AAGGTGCACC ACAGCGCGGG CACCCGGCCG AAGCTGGACG TCATCGGCCT GATCTTCTGC
TCGGCCGCGG TCGGTCTGCT GGTCTACCCG GTCATCCAGG GCCGCGAGCA CCACTGGGAC
GCCGGCATCT GGCTGATGCT CGCCGGGTCG GCCGTGGTGA TGGCGCTGTT CGCGGTCTAC
GCGCGGGCCC GGCACAAGCG GAACCTGGAT CCGTTCCTGG AGACCAGCTT GTTCCGTAAG
CGCGCCTTCA CCACCGGGAC CATGACCATC TTCCTGTTCT TCGGCGCGTG CGCGGCGGCC
TTCACCGTCA GCCCGCTGCT GCTGCAGGTC TCCCTGGGCT GGTCGCCGCT GCGGGCCGGG
CTCACCGGCG CGTGGTGGTC GCTGGGCACG ATCATCTCGA TGGGCGCCGG GCAGGCCTTT
GTGAAGAAGA CGCCGCGCCG GGTGCTGCAC GCCGGCCTGC TGACGCTGGC CGCGGGCATG
GCGCTGAGCG CGTACATCAT CAAGCACTAC GCGGGCACCA CCTTCACGCT GAACGCCGAG
CACCAGCCGA TCTGGCACAG CGGGGTGACC AGCTGGAACC TGGCGCCGGC GCTGCTGGTG
TCCGGGATCG GCATGGGCCT GGTGTTCGCG CCGTTCTTCG GTCTGGTGCT GGCCGCCGTG
GACGACCACG AACTGGGCTC GGCCAACGGC GTGATCAGCT CCTTCAACCA GCTGGGCAAC
GCGGTCGCGG CGGCGCTGTT CAGCACGCTG TTCTTCAACA AGGTCGAGAG CGGCGGCTCG
CCGTTCCCCG CCGCCGAGCT GGTGTACTGG CTGGCTGCCG GAATCCTGGT GCTGACCTGG
TTGCTGGCGT TCACGGTGCC GAAGACGGCG CGCAGTGAGG ACGAGATCAT GGTGTGA
 
Protein sequence
MSEAVSGTED LVLPVAEAVP EDERPSKGKK KDPSQSAIPK GAWAVMFTVL GASMMDLLDA 
TVMNVAAPSI RNGLGASNTE YQWISTGYVL SFSVLLIAGG RLGDIAGRRR MFLIGLTGFT
IMSAVCAIAQ NPGELIAARL LQGGASAMMI PQGIGMIREK FGRENSQKAF AIFGPFMGLS
AALGPVLGGA LITYSSWRWV FVINLPVGVI ALYFAAKVLP KVHHSAGTRP KLDVIGLIFC
SAAVGLLVYP VIQGREHHWD AGIWLMLAGS AVVMALFAVY ARARHKRNLD PFLETSLFRK
RAFTTGTMTI FLFFGACAAA FTVSPLLLQV SLGWSPLRAG LTGAWWSLGT IISMGAGQAF
VKKTPRRVLH AGLLTLAAGM ALSAYIIKHY AGTTFTLNAE HQPIWHSGVT SWNLAPALLV
SGIGMGLVFA PFFGLVLAAV DDHELGSANG VISSFNQLGN AVAAALFSTL FFNKVESGGS
PFPAAELVYW LAAGILVLTW LLAFTVPKTA RSEDEIMV