Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_2223 |
Symbol | |
ID | 8333572 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 2523091 |
End bp | 2524428 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644955377 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003112983 |
Protein GI | 256391419 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2271] Sugar phosphate permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00013617 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.000156735 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCGGGTCA GGGTGCTGCG GGTGCTGGAT GTTGGGGGGT TGCCGCGGGT TTTTTGGTGG GTGTGGGTTA GTACGTTGGT GGCGCGGACG GGGGCTTTTG TTGCGCCGTT TCTTTCCTAC TACCTGACGC GGTCGCTGGG GCATTCGGCG GCTTTTGCGG GGTTCGTTGC CGCGTTGAAT GCGGGCGGGG CGGCTGTTTC GGCGGTGGTG GGGGGTGTGC TGGCTGATCG GGTGGGGCGG CGGGGGACGT TGCTGGGGGC GCTGGTGGCG TCGGCGGTGA CGTTGGTGGC GCTTGGGTCG GTGCACTCGG TGGGGTTGAT TGCGGTGTTG GCGTTCCTTG CGGGGTTGGC CAACAATGCG ACGCGGCCGG CGACGGGGGC GATCATCGCG GACATCGTGC CGTCGGGGGA TCGGGGGCGG GCGTATGCGC TGAACTACTG GGCGATCAAC TTGGGGTTCG CGGTCGCGAT GCTGTCTGCG GGGGCGGTGG CGTCGCACGG GTATTCGCTG CTGTTCATGG GTGATGCGAT CGCGAATGTG GGGTGCGCGG TCGTCGTCTT CTTCACGGTG CCGGAGACGC GGCCGACTTC TGTTGGTGTC GCCGGCGCAG GGCATGCCGG TCAGCCCGAG CGTGCGGGCA CTCTCGTGGA TGTGCTGCGC GACCGGATCT TCTTGGGGTT CCTGGGGGCG GTGCTGGTCG GGGCGGTCAT CTATTCGCAG GCTCAGACCG TGCAGCCGAT CATGATGGGT CAGGACGGTC TTGGGCCTGG TGCATATGGC GCGGTCGCGG CGCTCAATGG GATCCTCATC GGTGTTCTGC AGTTGCCGAT GACGTCGTGG ATGCGGCGAT ACACCCATGG GTCGGTGCTG GCGGCCTCGT CGTTTCTGAT GGGCGCCGGG TTCGCTGTGC CTTTGCTGAT CTCGGCGGTG GGGCACCCGA TGGGGGTCTA TGCCGGCTCG GTGGTGGTGT GGACGATCGC GGAGATCGGG AGCACGCCGC CGCAGATGGC GCTGGGTGCG GATCTGGCGC CGGCGCATCT GCGCGGGAGG TATCAGGGGA TGTCGACGCT GGCGTGGAGT GTGGCCGGCA TCGTCGGTCC GTTGGTGGGC GGCTGGGCAC TGACGGCGAT CGGTGCTTCG GCCGTGTTGT GGGCGAGCCT GCTGCTCGGC GCGGCAGGTG TGCCGGCGTG GGTGATGCTC GACCGCCGAT CGAGAACACG AGTGGCCACG TTGCGCGCCG CCGAAGCGCA CTGGGAACCG GTGTTGTCCG CCATCGCGTC ACCCGAAGTG GTGTCGGCCG GTGTGCCGAC GTCCGAGCCG GAGCCGGAAC CGGTGTGA
|
Protein sequence | MRVRVLRVLD VGGLPRVFWW VWVSTLVART GAFVAPFLSY YLTRSLGHSA AFAGFVAALN AGGAAVSAVV GGVLADRVGR RGTLLGALVA SAVTLVALGS VHSVGLIAVL AFLAGLANNA TRPATGAIIA DIVPSGDRGR AYALNYWAIN LGFAVAMLSA GAVASHGYSL LFMGDAIANV GCAVVVFFTV PETRPTSVGV AGAGHAGQPE RAGTLVDVLR DRIFLGFLGA VLVGAVIYSQ AQTVQPIMMG QDGLGPGAYG AVAALNGILI GVLQLPMTSW MRRYTHGSVL AASSFLMGAG FAVPLLISAV GHPMGVYAGS VVVWTIAEIG STPPQMALGA DLAPAHLRGR YQGMSTLAWS VAGIVGPLVG GWALTAIGAS AVLWASLLLG AAGVPAWVML DRRSRTRVAT LRAAEAHWEP VLSAIASPEV VSAGVPTSEP EPEPV
|
| |