Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_0539 |
Symbol | |
ID | 8331866 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 623267 |
End bp | 624508 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 644953696 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003111323 |
Protein GI | 256389759 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.557555 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.112071 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGTCGA GCGAGCAGGG GAAACATGTG ACGCAGGTGC AGCCTCCACC AGTGCGGCGG CTGTGGTCCG CCGTCTTCTT CGGGTACCTC GCGCTGGGGG CGACGCTCCA GGAACTGCCC GGCTACATGA CGTCGAAGTT CGGCGACGGC CCCACGATCA TCGGCGTCGC GGTGGGCATC GCCTACCTGG GCACCGCCGT GACCCGACCG TTCGCCGGCC GGGCCGGGGA CGCGGGGCTG GCCAGGAACG TCTCCGTAGC CGGCGGCGCG ATCACCACGC TGGCCGCCCT CGGCCAGCTG ACCGCCCCGT CCGCGCTCGT GCTGATCATT TTCCGGCTGC TGATGGGCAT CGGCGAGGGG GCGCTGTTCT CCGGCTGCCT GCCCTGGGTG CTCACGGGGA TCGCGGCCGA CCGGCGCGGC CGGATCGCGG GCTGGTTCGG ACTGTCGATG TGGGGCGGCC TGGCGCTCGG GCCGCTGGCC GCGGTCGGGG TGAACCACCT CGGCGGGTCG ACCGCGACGT GGTGGACGAT CTTCGGCCTG CCGCTGGTTT CCAGCGTGCT GATCGCCTCC ACCAGGCCGC AGCCCGCGGT CTCGCCCCGA CGCGAGATCC GGCCGCAGGG CTGGCGGGAC ATCGTGCCGA TCGGCGTCAG CGTGCCGGGC ATCGTGCTCG GGCTCGCCGC CTACGGCTAC GGCACCCTGA ACGCACTGCT CGTCCTTTAT TTGACGCACG ACCACATCGG CGGCCAGGGC ATCGGCCTGA CCGTGTTCGC CGTGGCGTTC CTGGCCACAC GCGCCGCCGG CAGCCCCCTG ACCGACCAGT ACGGCGGCAT CCGGGTCGCC CGGGTCACGC TGGTCGTCGA GATCGCCGGG CTCTGCGTTC TGGCCGCCTC CTCCTCCCAG GGCGGTGCGC TGGCCGGCTG TGTCGTCACC GGCATCGGGC TCGGCGTCAT CTATCCGTCC ACCAGCAAGA TCACACTCGG CCGCACCGGT CCGCTGCAGG CCGGCGTGTC GATGGGCACG ATGACCTCGT TCTGGGACCT GGGGATCATG GCGGCCGGGC CGATCAGCGG CGCGGTCGCG GCGCACCTGG GGTACCGGGA GGGCTTCGGG GTCGCGGCGG CGGTGACCGT CGCGGCGCTG GTGCTCACGG TGCTGGGGCT GCATACGGAC TCCCCGGCGG AGGCGCCCAC GTCGGTGCCG CGGTCGGTCC CGGCTGGCGC GCAGGTGCGC CCGCGCGCCT GA
|
Protein sequence | MASSEQGKHV TQVQPPPVRR LWSAVFFGYL ALGATLQELP GYMTSKFGDG PTIIGVAVGI AYLGTAVTRP FAGRAGDAGL ARNVSVAGGA ITTLAALGQL TAPSALVLII FRLLMGIGEG ALFSGCLPWV LTGIAADRRG RIAGWFGLSM WGGLALGPLA AVGVNHLGGS TATWWTIFGL PLVSSVLIAS TRPQPAVSPR REIRPQGWRD IVPIGVSVPG IVLGLAAYGY GTLNALLVLY LTHDHIGGQG IGLTVFAVAF LATRAAGSPL TDQYGGIRVA RVTLVVEIAG LCVLAASSSQ GGALAGCVVT GIGLGVIYPS TSKITLGRTG PLQAGVSMGT MTSFWDLGIM AAGPISGAVA AHLGYREGFG VAAAVTVAAL VLTVLGLHTD SPAEAPTSVP RSVPAGAQVR PRA
|
| |