Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_8543 |
Symbol | |
ID | 8339923 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 9915279 |
End bp | 9916556 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644961627 |
Product | transposase IS204/IS1001/IS1096/IS1165 family protein |
Protein accession | YP_003119204 |
Protein GI | 256397640 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3464] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00367387 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0000144931 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCGAAACA AGAGCTTATG GGATGCGTTG CTCGGTTTGG CGGATGCTGT GATCGAGTCC GTGGATGTCG ATGAGACCTC GGGGCAGATC GTGGTCCAGG TCCGGATCAA AGCGGCGGCT TCCCGGCGGT GCGGGCGCTG TCTGCGTCGC TGCCCGCGGT ATGACCAGGG TACGAGCCGG CGGCGGTGGC GGCACATGGA TGCCGGGGTG TTGATGGTGT GGATCGTGGC CGAGGCGCCC CGGGTGTCCT GCCGCGATTG TGGTGTGGTG ACCTGCCACA TTCCGTGGGC GCGGGCCGGG GCCGGACACA CCGCCGACTT CGATCATCAC ATCGCGTGGC TGGCAACTAA CGCCTCCAAA ACCACGGTGG CCACTTTGGC ACGGATCGCC TGGCGCACGG TGGGGGCGAT CATCACCCGG GTGTGGGCCG ACATCGAGGC CTGCACCGAC CGGTTCGCGA ACCTGTCCCG GATCGGCATC GATGAGATCT CCTACAAGAA GGGCCAGCGC TACCTCACCA TGGTGGTGGA CCACGACACT GGGCGGCTGT TGTGGGCCGG GGTCGGCCGG GACACCGAGA CGGTGGATGA GTTCTTCGAT GCCTTGGGTC CTGAACGCTG CGTGGCGATC ACGCACGTGT CCGCCGACGC CGCCCCATGG ATCGCCAAAT CGGTCCGCAC CCACTGCCCC GGCGCGATCC GCTGCGCCGA TGCCTTCCAT GTGGTGGCCT GGGCCACCAA AGCCCTAGAC ACGGTGCGCC GCCAGGCCTG GAACACAGCC GCCGGCCGCG CCCGCGACAC CAACCGCACC GGCCGCAACT CCACCGGTGC AGCCCGCGTC TTGAAAGGCG CCCGCTGGGC CTTGTGGAAG AACCCTGAAG ACCTCACCAC GAGTCAACAC CACAAACTCA CCTGGATCGC CAAAACCGAC CCGGCCCTGT GGCGCGCCTA CCTGCTCAAA GAAGGACTAC GCCACGTCTT CAAAATCAAA GGCGCCGCCG GCAAAACCGC CCTGGACCGA TGGCTGTCCT GGGCAGTGCG CTGCCGCATC CCTGCGTTCG TGGAGCTGGC CTCCACCATC AAACGCAACC GCACCGAGAT CGACAACGCC CTCGACCACA ACCTATCCAA CGCATTGATC GAATCCATAA ACACCAAAAT CCGACGCATC ACCCGCACCG CCTACGGCTT CACCAACCCC GAAGCCCTCA TCGCCCTCGC CCTGCTCGCC CACAGCGGCC ACCGACCCCA ACTACCCGGC CGAACAACCC ACGGATAA
|
Protein sequence | MRNKSLWDAL LGLADAVIES VDVDETSGQI VVQVRIKAAA SRRCGRCLRR CPRYDQGTSR RRWRHMDAGV LMVWIVAEAP RVSCRDCGVV TCHIPWARAG AGHTADFDHH IAWLATNASK TTVATLARIA WRTVGAIITR VWADIEACTD RFANLSRIGI DEISYKKGQR YLTMVVDHDT GRLLWAGVGR DTETVDEFFD ALGPERCVAI THVSADAAPW IAKSVRTHCP GAIRCADAFH VVAWATKALD TVRRQAWNTA AGRARDTNRT GRNSTGAARV LKGARWALWK NPEDLTTSQH HKLTWIAKTD PALWRAYLLK EGLRHVFKIK GAAGKTALDR WLSWAVRCRI PAFVELASTI KRNRTEIDNA LDHNLSNALI ESINTKIRRI TRTAYGFTNP EALIALALLA HSGHRPQLPG RTTHG
|
| |