Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3500 |
Symbol | |
ID | 8334853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 3903292 |
End bp | 3904833 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644956644 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003114247 |
Protein GI | 256392683 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.021178 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACCT CCGCCGCCGT CGCGGATCCG GCCTCCGCCT TCAGCGAACG CGCCGCCCGG CGCGCCGTCT GGCTGGTGCT GAGCGCGACG TTCGTCGTCT CCGCGGACAT CTCGATCGTC GCCGTCGCCG CCCCGCCGAT CCAGCGCGGC CTGCACGCGA GCTCCGGGGA TATCGAGCTG ACCGTCGCCG CCTACCAGAT CGCCTACGCC GCGCTGCTGA TCACCGGCGG CCGGCTCGGC GACATCTTCG GCCGGCGCGC GCTGTTCACC TGGGCGTTCG CCGGCTTCGT CCTCACCTCC GCAGCCTGCG GCCTGGCGAC CTCCCCGGGC CAGTTGGTGG CGTTCCGCGC GCTGCAAGGC GTGACCGCCG CGATGCTCTC GCCCCAGGTG ATGGCGACCA TCCAGATCAT GCTGCCGCCC GAGAAGCGCG CCGCGGCGTT CGGCGCGCAG GGCGCGATGC TCAGCCTCGC CACAGTCATC GGGCCGGTGT TCGCCGGACT GCTGTACTCC GGGAACATCA TGGGCCTGTC ATGGCGGCCG ATCTTCCTGG TGAACGTGCC CTTCGGGCTG GCGGCGATCT GGCTGGGCCG GCGCTACCTG CCCTCGCTGC GCAATCCCGA GGCCAAAAGC CTCGACCTAC CCGGTACGTG TCTGGTCGTC CTCGCGTTGG TCGCGCTCAT GACGCCGCTG TCACTGGGCG AGCAGTACGG ATGGCCGCTG TGGTGCTGGC TGAGCCTGGC TGCCTCGCCG GTGCTGATCC TGGCGTTCCT GAAGCTGCAG CAGGCTGAGG AGCGGCGCGG CGGGTCTCCG CTGCTGCCGA CCGACCTGTG GCGAGACCGG GCGTTCCGTA CCGGCGTCGT GCTCTTCCTG CTGGCGTTCA GCGGGGTCGT GTCCTTCTTC CTGTACTACT TCACCCTGAT CCAGACCGCG TACAACGTCT CCACGCTGTG GGCCGCGGTG ACCACGATCC CGGTCGGGAT CGGCACGATC GCGCTGTCGG CTGCCTCGGG GCGGCTGGTC CGCGCCTGGG GCGGGCGCCG GGTCGCCTCG GTCGGGGCGA TCGTGTGCTG CTTCGGCGCG CTGTCGATGT TCATCCCGGT GGTCGCGGTC ACGGACTCCT CGCTGGCGCT GTGGTCCATC CCGTCGCAGC TGGTGCTCGG CTCCGGAATC GGGATGCTGT TCGCTCCGCT GCTGTCGGTG GTCCTCGCTG GAATCCGCAG CACGCACGCC GGCGCCGCCG CCGGACTGCT GGTGACGATG CAGATCGCCG GCGGTGCGCT GGGGGTCAGC GCCATGGGAG TGCTCTTCAA CTCGCGGCTG CCCGGAGGCT CCACGGACCA CGCGTCCCAC GGACAGCTCT CCTCGGCGAT GGTCCACGCC ATGCTCTACA ACCCGGTCTC GTTCCTGGCG GCGCTGCTGG TGATCCTCGT TCTGCCGAGG ACGGTGCGCA GTGCCGGAAG GGCTGCCGGG CCCAAGGGGA CGCCGGCCGG CGCGGCGGGT GCTGCGGGAG CGCCGGGAAC TCCGGGAGCT GCTCATGCCT GA
|
Protein sequence | MTTSAAVADP ASAFSERAAR RAVWLVLSAT FVVSADISIV AVAAPPIQRG LHASSGDIEL TVAAYQIAYA ALLITGGRLG DIFGRRALFT WAFAGFVLTS AACGLATSPG QLVAFRALQG VTAAMLSPQV MATIQIMLPP EKRAAAFGAQ GAMLSLATVI GPVFAGLLYS GNIMGLSWRP IFLVNVPFGL AAIWLGRRYL PSLRNPEAKS LDLPGTCLVV LALVALMTPL SLGEQYGWPL WCWLSLAASP VLILAFLKLQ QAEERRGGSP LLPTDLWRDR AFRTGVVLFL LAFSGVVSFF LYYFTLIQTA YNVSTLWAAV TTIPVGIGTI ALSAASGRLV RAWGGRRVAS VGAIVCCFGA LSMFIPVVAV TDSSLALWSI PSQLVLGSGI GMLFAPLLSV VLAGIRSTHA GAAAGLLVTM QIAGGALGVS AMGVLFNSRL PGGSTDHASH GQLSSAMVHA MLYNPVSFLA ALLVILVLPR TVRSAGRAAG PKGTPAGAAG AAGAPGTPGA AHA
|
| |