Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_7524 |
Symbol | |
ID | 8338894 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 8716333 |
End bp | 8717844 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644960603 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003118190 |
Protein GI | 256396626 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.47136 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGATG TCCTCGACCG GTCCACGCCG GCGGCACCCG AAGGTCCGGG CGGCACGGCT ACCACCGCGA ACATCACCGA CTTCTCGACG ACTTCCGGCT TGTCCGCCCG GGCCAAGCTT GTTCTCTTCC TGCTCTGCGC GGCGAACTTC ATGGTCGCCG TCGACTTCTC CATCCTGAAC ATCGCCGTCC CCAGCATCGG CAAGGACCTG CACATCGCCG ACGCGAACCT GCAGTGGATC GCCACCGCCT TCGCGCTGCC CTCCGGCGGC CTGCTGCTGT TGTCCGGCCG CGTCGGCGAC CTCGTCGGAC GCAAGAAGGT CTTCATCACC GGCACCATCT TGTTCACCTC GGCCAGCGTG ATCGCCGCCA TCGCGTGGGT CCCCGCGGTC CTGCTCGCCG GCCGCGCCCT GCAAGGCATC GGCGCGGCGA TGATCGTGCC GACCGGCATG GCGCTGCTGA CCACCTCCTT CCCCGAGGGC CCGCAGCGAG AGCGCGCCCT GGGCATCAAC GGCACGCTGA TGACCGTCGG CTTCACCGCC GGCATGGTCC TCGGCGGCGT GCTGACCCAG GCGCTGTCCT GGCGCTCCAC GATGGTGCTG AACACCGTGA TGGGAGCCGT CGTCCTGCTC GGCGCGCCGC GGCTGCTCAC CGAGAGCCGC AACCCGCACG CCTCCCGCCT TGACGTCCCC GGCGCCGCGA CCGTCACCAC CGCGCTGCTG GCGCTGATCT ACGCGCTGTC GACCGCCGCG CAGGCCGGCT TCGGACGCCC CGACGTCGTC ATCGGCCTGG TCGCCGGCGT GGCACTGCTC GCGGCCTTCG TCTTCATCGA GTCCCGAGCC GCCGAACCGC TGGTGTCGCT GCGCATCCTG CGCCGCCGCA ACGTCGCGAT CGGCAACATC GGCGGGCTGA TCACCTTCGC CTGCATGAGC GCGGTCGTCT TCCTCGGCAC GCTGTTCCTG CAACAAGTGG AGGGCATGTC CCCGACCCTG ACCGGTCTGG TCTTCGCGGT CATGGGGGTC GCGGCAGCGC TCGGCGGCAT GATCGCCCCA CGGCTCATCG GCCGCTACGG CGCCCGCACC ACCCTCGTCG GCGGACTGAT CTTCCAGGGC GCGCTGATCC TCCCGCTCGC CCTGATCGCC CCCGGCAACG GAACCGTCCT GCTGTTGACG ATCGGCGCCG TCGCCGCCTT CGGCCACCTC GCCGCGGTCG TCTCGTACGG CGTCACCGCG ACCTCCGGCC TCGGCGACAC CGAGCAAGGT CTGGCGACCG GTCTGGTCAC CACCTCGCAG CAGGTCGGTC TGACGCTCGG CATCCCGCTG CTGTCCGCCG TGGCCTCGGC GCGCAGCGAC AGCCTGCGCA CCGCCGGGCA CAGCGCCAAG GACGCGCTCA CCAGCGGCAT CCAGCTCAGC ATGGGCGCCG ACGGCGTGGT CGTGCTCGTC GCCGCCGCGC TGGTCTGGTT CGGCCTGCGC GCCAAGACTG TGCGCGCTGA AACCGTGCGC AGCCAAGGCT GA
|
Protein sequence | MTDVLDRSTP AAPEGPGGTA TTANITDFST TSGLSARAKL VLFLLCAANF MVAVDFSILN IAVPSIGKDL HIADANLQWI ATAFALPSGG LLLLSGRVGD LVGRKKVFIT GTILFTSASV IAAIAWVPAV LLAGRALQGI GAAMIVPTGM ALLTTSFPEG PQRERALGIN GTLMTVGFTA GMVLGGVLTQ ALSWRSTMVL NTVMGAVVLL GAPRLLTESR NPHASRLDVP GAATVTTALL ALIYALSTAA QAGFGRPDVV IGLVAGVALL AAFVFIESRA AEPLVSLRIL RRRNVAIGNI GGLITFACMS AVVFLGTLFL QQVEGMSPTL TGLVFAVMGV AAALGGMIAP RLIGRYGART TLVGGLIFQG ALILPLALIA PGNGTVLLLT IGAVAAFGHL AAVVSYGVTA TSGLGDTEQG LATGLVTTSQ QVGLTLGIPL LSAVASARSD SLRTAGHSAK DALTSGIQLS MGADGVVVLV AAALVWFGLR AKTVRAETVR SQG
|
| |