Gene Caci_0539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_0539 
Symbol 
ID8331866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp623267 
End bp624508 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content73% 
IMG OID644953696 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003111323 
Protein GI256389759 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.557555 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.112071 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTCGA GCGAGCAGGG GAAACATGTG ACGCAGGTGC AGCCTCCACC AGTGCGGCGG 
CTGTGGTCCG CCGTCTTCTT CGGGTACCTC GCGCTGGGGG CGACGCTCCA GGAACTGCCC
GGCTACATGA CGTCGAAGTT CGGCGACGGC CCCACGATCA TCGGCGTCGC GGTGGGCATC
GCCTACCTGG GCACCGCCGT GACCCGACCG TTCGCCGGCC GGGCCGGGGA CGCGGGGCTG
GCCAGGAACG TCTCCGTAGC CGGCGGCGCG ATCACCACGC TGGCCGCCCT CGGCCAGCTG
ACCGCCCCGT CCGCGCTCGT GCTGATCATT TTCCGGCTGC TGATGGGCAT CGGCGAGGGG
GCGCTGTTCT CCGGCTGCCT GCCCTGGGTG CTCACGGGGA TCGCGGCCGA CCGGCGCGGC
CGGATCGCGG GCTGGTTCGG ACTGTCGATG TGGGGCGGCC TGGCGCTCGG GCCGCTGGCC
GCGGTCGGGG TGAACCACCT CGGCGGGTCG ACCGCGACGT GGTGGACGAT CTTCGGCCTG
CCGCTGGTTT CCAGCGTGCT GATCGCCTCC ACCAGGCCGC AGCCCGCGGT CTCGCCCCGA
CGCGAGATCC GGCCGCAGGG CTGGCGGGAC ATCGTGCCGA TCGGCGTCAG CGTGCCGGGC
ATCGTGCTCG GGCTCGCCGC CTACGGCTAC GGCACCCTGA ACGCACTGCT CGTCCTTTAT
TTGACGCACG ACCACATCGG CGGCCAGGGC ATCGGCCTGA CCGTGTTCGC CGTGGCGTTC
CTGGCCACAC GCGCCGCCGG CAGCCCCCTG ACCGACCAGT ACGGCGGCAT CCGGGTCGCC
CGGGTCACGC TGGTCGTCGA GATCGCCGGG CTCTGCGTTC TGGCCGCCTC CTCCTCCCAG
GGCGGTGCGC TGGCCGGCTG TGTCGTCACC GGCATCGGGC TCGGCGTCAT CTATCCGTCC
ACCAGCAAGA TCACACTCGG CCGCACCGGT CCGCTGCAGG CCGGCGTGTC GATGGGCACG
ATGACCTCGT TCTGGGACCT GGGGATCATG GCGGCCGGGC CGATCAGCGG CGCGGTCGCG
GCGCACCTGG GGTACCGGGA GGGCTTCGGG GTCGCGGCGG CGGTGACCGT CGCGGCGCTG
GTGCTCACGG TGCTGGGGCT GCATACGGAC TCCCCGGCGG AGGCGCCCAC GTCGGTGCCG
CGGTCGGTCC CGGCTGGCGC GCAGGTGCGC CCGCGCGCCT GA
 
Protein sequence
MASSEQGKHV TQVQPPPVRR LWSAVFFGYL ALGATLQELP GYMTSKFGDG PTIIGVAVGI 
AYLGTAVTRP FAGRAGDAGL ARNVSVAGGA ITTLAALGQL TAPSALVLII FRLLMGIGEG
ALFSGCLPWV LTGIAADRRG RIAGWFGLSM WGGLALGPLA AVGVNHLGGS TATWWTIFGL
PLVSSVLIAS TRPQPAVSPR REIRPQGWRD IVPIGVSVPG IVLGLAAYGY GTLNALLVLY
LTHDHIGGQG IGLTVFAVAF LATRAAGSPL TDQYGGIRVA RVTLVVEIAG LCVLAASSSQ
GGALAGCVVT GIGLGVIYPS TSKITLGRTG PLQAGVSMGT MTSFWDLGIM AAGPISGAVA
AHLGYREGFG VAAAVTVAAL VLTVLGLHTD SPAEAPTSVP RSVPAGAQVR PRA