Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4121 |
Symbol | |
ID | 8335475 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4662942 |
End bp | 4664357 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644957224 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003114826 |
Protein GI | 256393262 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.878146 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGACA AATCTCACCG CGCCAAAGCA CTGCTGGTCG CCGGGTGTTT CTTCATGGAG ATGCTCGACG GCACCATCGT CACCACCGCC GCGCCGCAGA TGGCCCGGTC CCTGCACGTC AGCTCCTCGG CCATCGGGCT GGTCATCACC TCCTTCCTGC TCACCTTCGC CGCGCTGATA CCGCTCAGCG GCTGGCTGAC CCGCAGGTGG GGCACGCGGC CGGTGTTCCT CGCCGCGATC GCCGTGTTCA CCGCCGCCTC CTTGGGCTGC GCGCTCAGTA CGACGCTGCC GGTGCTGATC GCCATGCGGG TCCTGCAGGG CTTCGGCGGC GCGATGATGG TGCCGGTCGG ACGGCTGATC GTGTTGGCCG GCGCGGAGAA ATCCGACCTG CTGCGGCTCA TGGCGTACAT CGTGTGGCCG GGGTTGATGG CTCCGGTCGT GGCGCCGCTG GCCGGCGGGC TCATCACGAC CTACGCCTCC TGGCCGTGGC TGTTCGGGAT CAACATCCCG CTCGGCGTGG TGGCGTTCGC CATCGCCTGG CGCGTCGTGG AGGCGGCGCC CACCGAGCGG CCGCCGCGGC TGGACCGGCT CGGCGTCGTG CTCACGTGCC TCGGGCTCGG CGGCCTCACG TATGCCGCGC ACCTGTTCTC CGACACCGAC ATCTCCTGGG CCACCGCCAT CGCCACCGCT GTGGTGTCGG TCGTCCTGCT CGCCGCGGCG ACGCGCCACC TGTTGCGGAC CGAGGCGCCG CTGGTGAATC TCAGGGTCCT GCGGATCGCG ACGCTGCGGA CGTCGGTCAG CGGGGGTTCG GTGTTCTGGC TCGTCGTCGG CGCCGGACCG TTCCTGCTGC CGCTGCTGTT CCAGAACGTG TTCGGCTGGA GCGCCGTGAA GTCCGGGGCG GTGGTCCTGT TCATCTTCGT GGGCAACATC GGCATCAAGC CGGCGACCAC ACCGATGCTC AACCGCTTCG GCTTCCGTCC GGTGCTCGTC GCCTCCACGC TGGTGATGGC GGCGGCGATG GCTGCCGCCG GGCTGCTCAC CGCCCACACG CCGATCGTCC TCACCTGCGC GCTGATCCTG CTCAGCGGCA TCGCCCGCTC GGTCGCGCTG ACCGCGTTCA GCACCATCGC CTACAGCGAC GTCGGCCCGG AGGAGATGCG CGACGCCAAC TCCATCGCCG CCACCGCCTT CCAGATGTCC GCGGGACTGG CGATCGCCGT GAGCACCATC GCCCTGCGCG CCGGCGGACC CTTGGGACGG CTGCTGCCGG GAGCGCCGAG CGCCGGGACC GCCTACACCG TCGCGTTCCT CATCCTCGCG CTGTTCTCGC TGAGCGTGAC GGTGACCGCG TTGCGCATGC ATCCCGACGC CGGCGCACGC GTGCGGCGCG TGCGGCCGGT CGCGGCGCGT CCGTGA
|
Protein sequence | MIDKSHRAKA LLVAGCFFME MLDGTIVTTA APQMARSLHV SSSAIGLVIT SFLLTFAALI PLSGWLTRRW GTRPVFLAAI AVFTAASLGC ALSTTLPVLI AMRVLQGFGG AMMVPVGRLI VLAGAEKSDL LRLMAYIVWP GLMAPVVAPL AGGLITTYAS WPWLFGINIP LGVVAFAIAW RVVEAAPTER PPRLDRLGVV LTCLGLGGLT YAAHLFSDTD ISWATAIATA VVSVVLLAAA TRHLLRTEAP LVNLRVLRIA TLRTSVSGGS VFWLVVGAGP FLLPLLFQNV FGWSAVKSGA VVLFIFVGNI GIKPATTPML NRFGFRPVLV ASTLVMAAAM AAAGLLTAHT PIVLTCALIL LSGIARSVAL TAFSTIAYSD VGPEEMRDAN SIAATAFQMS AGLAIAVSTI ALRAGGPLGR LLPGAPSAGT AYTVAFLILA LFSLSVTVTA LRMHPDAGAR VRRVRPVAAR P
|
| |