Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_1633 |
Symbol | |
ID | 6975049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 1813275 |
End bp | 1814315 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643391169 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_002276026 |
Protein GI | 209543797 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.599733 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCGCC ACCTCAATAC CATCTATGTG ACGACCGAGG ATGCCTGGCT GCGCAAGGAT GGCGCCAATG TGGTGGTCGA AGTCGAGGGC CAGGAGCGCG GTCGGGCCCC CATGCATCTG CTGGACGGGA TCATGTGCTT TGGCCGGATC GGTGCGTCAC CGGCGCTGAT GGGGGCCTGC GCGGAATCGG GCCTCGCGCT GTCATGGCTC TCCCCACAGG GGCGCTTCCT CGCCCGGGTC GAGGGACCAA GAAGCGGCAA CGTGTTGCTG CGACGAACCC AGTATCGCAC GGCAGACAGT ACCGAACGCG CCTTGCCGAT CGTGCGAAAC ATGGTCGCCG CCAAGGCCGC CAATCAACGC ACCGTCATCC GCCGGGCCCT GCGCGACCAC GGCCCTGGAC TGAATGCGCA GGAGAGGACC CGACTGGACG AAGCCGAACG GCGCCTGACC ATTATCGTGC GCCATATCAC CCAGGCCCCG GATGTGGCCA CCGCACGGGG TATGGAGGGC GAGGCGGCGC AAGTCTATTT CGGAATCTTC AATCTTCTGA TTCGGGTCGA AGGCGATGCA TTCACCTTCG GTGGCCGGTC GCGGCGGCCG CCGCTCGACC GCGTGAATGC CCTGCTCTCG TTTCTCTATG CCGTTCTGGG ACATGATTGC CGCTCCGCGC TCGAAAGCGT CGGTCTGGAC CCGCAGGTCG GTTTCCTGCA TGCGGATCGG CCGGGACGAA ATAGCCTGGC ACTGGACATG ATGGAGGAAT TGCGCCCCGT TCTGGCTGAC CGACTGGCCG TGAGCCTGAT CAATCGACGG CAGGTCGCGG CAGGTGATTT TACCGTCAGC GATGGAGGCG CCGTCCTGCT GAACGACGAC GCCCGCAAGA CGGTTCTCGT AGCCTGGCAG GACCGCAAGA AGAAGGAATT GCGGCACCCA TTCCTTAATG AGTCCCTGAC GTTCGGCCTT GTCGCACATG TCCAAGCGCT GCTTCTGTCC CGACACCTTC GTGGCGAACT GGACGGTTAC CCAGCATTCG TGTGGAAATA G
|
Protein sequence | MRRHLNTIYV TTEDAWLRKD GANVVVEVEG QERGRAPMHL LDGIMCFGRI GASPALMGAC AESGLALSWL SPQGRFLARV EGPRSGNVLL RRTQYRTADS TERALPIVRN MVAAKAANQR TVIRRALRDH GPGLNAQERT RLDEAERRLT IIVRHITQAP DVATARGMEG EAAQVYFGIF NLLIRVEGDA FTFGGRSRRP PLDRVNALLS FLYAVLGHDC RSALESVGLD PQVGFLHADR PGRNSLALDM MEELRPVLAD RLAVSLINRR QVAAGDFTVS DGGAVLLNDD ARKTVLVAWQ DRKKKELRHP FLNESLTFGL VAHVQALLLS RHLRGELDGY PAFVWK
|
| |