Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_1478 |
Symbol | |
ID | 8332817 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 1683165 |
End bp | 1684373 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644954626 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003112242 |
Protein GI | 256390678 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTGA TCAGCGATCC GAGCCGAGCC GAGAGCAGCG CAGAGGCCGG AGAAGCCACC GAGGTGATCT CGCGCGGAGT CCGCGGCCGC ATGGCGGTCT GGGGCGCGGC GCACGCGGTC GACGACATGT ATCAGGGCCT TGTCCCGGCC AGCGTCCCCT ACTTCGTTCT GGAGCGGCAC TACAGCTACC TGGCGGCCTC CGGACTCACG TTGGCTGCGA CGCTGGGCAG CGCGCTTCCG CAGCCGCTTG TCGGGCTCGC CGTGGACCGC TGGCGGCTGC CTCGCTTGGC TGCGGTGGGC GTCGCGGTCG CCGGGACCGG GCTCGGGCTG TCGGGGCTGG CGGGTCCGTA CGCACTGGTC TGGCTCTGCA TTCTGATATC AGGGTTCGGA GTCGCGATGT TCCACCCCGC GGCCGGCCGC TCGGCGCGCG AGGCCGCCGG GGACAGCGCC GCCGCGATGA GCGTCTTCGC CGCCGGCGGG AGCGTCGGCT TCTTTCTGGC ACCGGTCCTC GCGACTCCGG CTTTGAACGC GTGGGGCGTC GGCGCGACGG CGCTGTTCAT CCCGCCGGCA TGGCTCATCG CCTTGGTACT GCTCAAAAAG CGCGCCGGCA CCAGGCCATC TCACGCGGCG GCGAGTGGCG GCCAGGACCG TTGGGGACCG TTCGCGGCGC TGACCGGCAT CGAGGTGGTC CGCTCGGCAG CCTTCTTCGG CATCAACACC TTCATCGAGC TGTACTGGAT CAAGCACCTG CACGCCTCCC GCACCATGGC CGGCGCCGCC CTCGCCTGCT TCCTCATCGG CGGCGTCGCC GGCACCCTCC TTGGCGGCCG CATCGCCGAC CGCGTCGGCA TGGTCCGCAC CGTCCAACTC GGCGGCGCCC TCACCATCCC CATGCTGCTG GCCCTCCGCT TCAGCCCGGG CGCCGTCGCA CCGCTGGCCT TCGCAGTCCT CACCGGACTA GCCCTGAACA TCCCGTTCGC TGTCCTGGTG AAACTCGGCC AGGACTACCT CCCCGGCCGC CCCGGCACAG CCGCAGGCGT CACGCTCGGG CTGGGCGTCA GCGCGGGAGG GCTGATGGCA CCCGTCTTCG GCCTGATCGC CGAACACCGC GGCCCGCAGG GAGTGCTGAC GGCGCTGTGC GCGGTACCGA TCGCGGGGAT AGTGCTGGGG TTTCTGCTGC GGGAGCCTCA GAAGACTGAT GGATCGTGA
|
Protein sequence | MSLISDPSRA ESSAEAGEAT EVISRGVRGR MAVWGAAHAV DDMYQGLVPA SVPYFVLERH YSYLAASGLT LAATLGSALP QPLVGLAVDR WRLPRLAAVG VAVAGTGLGL SGLAGPYALV WLCILISGFG VAMFHPAAGR SAREAAGDSA AAMSVFAAGG SVGFFLAPVL ATPALNAWGV GATALFIPPA WLIALVLLKK RAGTRPSHAA ASGGQDRWGP FAALTGIEVV RSAAFFGINT FIELYWIKHL HASRTMAGAA LACFLIGGVA GTLLGGRIAD RVGMVRTVQL GGALTIPMLL ALRFSPGAVA PLAFAVLTGL ALNIPFAVLV KLGQDYLPGR PGTAAGVTLG LGVSAGGLMA PVFGLIAEHR GPQGVLTALC AVPIAGIVLG FLLREPQKTD GS
|
| |