Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3907 |
Symbol | |
ID | 8335260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 4430839 |
End bp | 4432038 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644957033 |
Product | CRISPR-associated protein, Cse4 family |
Protein accession | YP_003114636 |
Protein GI | 256393072 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01869] CRISPR system CASCADE complex protein CasC/Cse4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000018703 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.192043 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCGCG TCATCCTCGA TATCCACATC CTGCAGACCG TCCCGCCGAG CAACCTGAAC CGAGACGACA CCGGCTCCCC GAAAACCGCC GTCTATGGCG GTGTGCGCCG GGCACGTGTC TCCAGCCAGG CCTGGAAGCG CGCCACCCGT CAAGCATTCG GAGATCTGCT GGACCCGTCC GAACTGGGCG TGCGGACCAA GCGCGTCGCC GAACAGATCG CCAACCGGAT GACCGCGCTG GAGCCGTCCC TGTCCCCCGG CGATGCAGTG GCTGTCGCAG TCGAGGTGAT CAAGGCTGCG ACGGGGGCCA AAAGCGAGGT CCCAAAGCGG AAGTCAGCAG CAGTCAAAAG CGATCAGGAT GCTACGGCCG CACTCCCGGA GACCGGCTAC CTGATGTTCC TCAGCGAAAG CCAGCTCAAC AACCTGGCAC GCCTCGGGGT GGAAGGCTCC AAGGACATCA CGGCCTTCCT GAAAGACAAA GACTTCAAGA ACCGGGTCCG GCAAGCCGCC GACACGCGCC ACTCGGTCGA CATCGCGCTG TTCGGCCGCA TGGTCGCCGA CGCCACGGAC ATCAACGTCG ACGCCGCAGC ACAAGTCGCA CACGCAATCA GCGTGCACGC CGTGGAGAAC GAGTCGGACT ACTTCACCGC CGTCGACGAC CGTAGCACCG AGGCCGAGCC TGGGGCCGGC ATGATTGGGA TCGTCGACTT CAACGCGGCA ACGCTTTACC GATATGCGGC AGTCGATGTG AACCGGCTGG CCGACAACCT CGGTGCCGGG CTACTTGAAG GTGAGTCTCA GACCGAGCCC GTGCGGCGTG CTGTCGAGGC CTTCATCCGG GGATTCGCAC TGTCGATGCC GACCGGGAAA GTCAACACGT TCGGCAACCA CACAGTCCCC GACGTGGTCC TGGTCAAGCT ACGCGCCTCA CGCCCAATCA GCTTCGCCGC CGCATTCGAG GAAGCCATCA GCGCCGGCGA ACACCAGGGC GGGTATCTCA AAGGGGCATG CGAGCGTTTG GCCAGCTACA TCCCGAAGCT CGAGCAGGCC TACGACCTGC AGGAGGGTAC TGATTCCTGG GTCGTCTGCG CGGGTTCGGC AACAGAAGCC CTTGAGCAGG CCGGCGATCC GGTGTCGATC AGCCAGCTTG TCGCCGCTGT CGGGGCTGCA GTCACGACTC GGCTGGCATC TGACGCATGA
|
Protein sequence | MTRVILDIHI LQTVPPSNLN RDDTGSPKTA VYGGVRRARV SSQAWKRATR QAFGDLLDPS ELGVRTKRVA EQIANRMTAL EPSLSPGDAV AVAVEVIKAA TGAKSEVPKR KSAAVKSDQD ATAALPETGY LMFLSESQLN NLARLGVEGS KDITAFLKDK DFKNRVRQAA DTRHSVDIAL FGRMVADATD INVDAAAQVA HAISVHAVEN ESDYFTAVDD RSTEAEPGAG MIGIVDFNAA TLYRYAAVDV NRLADNLGAG LLEGESQTEP VRRAVEAFIR GFALSMPTGK VNTFGNHTVP DVVLVKLRAS RPISFAAAFE EAISAGEHQG GYLKGACERL ASYIPKLEQA YDLQEGTDSW VVCAGSATEA LEQAGDPVSI SQLVAAVGAA VTTRLASDA
|
| |