Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0344 |
Symbol | |
ID | 6973738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 387014 |
End bp | 387907 |
Gene Length | 894 bp |
Protein Length | 297 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643389876 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_002274755 |
Protein GI | 209542526 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03639] CRISPR-associated endonuclease Cas1, NMENI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0826483 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.0182764 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATGGC GAGGCGTGCA TATCTCCCAC CCCTCCCGGT TGACGCATCG GAATCGGCAG CTCGTTGTTG CTCAGGATGG TGGCGAGGTA TCATTGGCGG TGGAGGACAT CGCGTGCCTT ATCCTCGATA CGCGACAAGT GAGCATCACC GGGTCTCTTC TCTCTGCGCT TGCAGAAAAT GGCGTTGCCA TGATCGTGCC CGATGCCAGG CATCATCCTG CCGGTATCCT GCTGCCTTTT CACCAGCATC ATGCCCAGGC GCACATAGCA CATGCCCAGA TCTCGATCAG CCAACCATTG AAGAAGCGCC TGTGGCAGAC ATTGGTCGTC GCCAAGATAC GTAATCAGGC TGCACTACTG GACCAACTCG GCCGGCCGCA AGGACAAACG ATTGCAGCAA TGGCTGGACG GGTCGCTTCC GGCGATCCGG GCAATGTGGA AGCACAGGCG GCCCGAGCTT ACTGGGCGAG CCTGTTTTCG GATTTTACAC GCGCAAACGA GAATGATCGT CGTAATGCGT TGCTTAACTA TGGTTATGCG ATCATGCGAG CCGCGATTGC ACGCGCATGC GTGGCGCTGG GATTGCTCCC AGCTTTCGGG GTACATCACG CATCGAAAAC CAATGCGTTC AATCTCGTCG ACGATCTGAT CGAGCCGTTC CGCCCCTTTG TGGACCGCAT GGCGCATGAC CGGGCTTTGG AACATGTAGG GGACACGCTG TCTATCGAGG ATCGCCGTCA AATGTCGACG ATCCTCAATG ACAATGCGGC CATCGGTCGC GAGCGAATGA CCGTCCTGGC CGCAACCGAA GCGGTAGCCA TGTCCGTGGT GCGCGCCATC GAGCATGGCA GTGCCGCGCT TCTCTCGACT CCAACTCTGA AAGCCCGGGA TTGA
|
Protein sequence | MAWRGVHISH PSRLTHRNRQ LVVAQDGGEV SLAVEDIACL ILDTRQVSIT GSLLSALAEN GVAMIVPDAR HHPAGILLPF HQHHAQAHIA HAQISISQPL KKRLWQTLVV AKIRNQAALL DQLGRPQGQT IAAMAGRVAS GDPGNVEAQA ARAYWASLFS DFTRANENDR RNALLNYGYA IMRAAIARAC VALGLLPAFG VHHASKTNAF NLVDDLIEPF RPFVDRMAHD RALEHVGDTL SIEDRRQMST ILNDNAAIGR ERMTVLAATE AVAMSVVRAI EHGSAALLST PTLKARD
|
| |