Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_31580 |
Symbol | cas1 |
ID | 7762058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 3266829 |
End bp | 3267869 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643806032 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_002800296 |
Protein GI | 226945223 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCGGC AACTCAACAC CCTGTATGTC ACCTCGGAAG GTGCCTGGCT GCGTAAGGAT GGCGCAAACA TCGTCATGGA AATCGCCGGT GAAATTCGCG GCCGGATTCC GGTACACATG CTGGAAAGCC TGGTCTGCAT CGGCCGGGTG CTAGTTTCCC CACCTTTGCT GGGCTATTGC GCCGAGCAGG GCATCCGCAT CAGTTACCTG ACCCCCAATG GCAAATTCCT AGCCCGCGTC GAAGGGCCGG TATCCGGTAA TGTGTTGCTG CGTCGCGAGC AGTACCGGCG CAGTGACGAT CCGGCAGGCA GTGCCGCCAT CGTCGCCAAT CTGCTGGTCG GCAAGGTGTA CAACCAGCGC GCCGTGATCG GCCGCGCCCT GCGTGATCAT GGCGACAAGC TGGCAGAGGA GGCGCGGATT GCCCTGAACA TCACCCACAA GCGGTTGATG CGGATTGCCG ACCGCTTGTT GCTGGAGCAG GACGTGGATG TGCTGCGTGG GTTGGAAGGG GAGGCAGCCC AGGCCTATTT CGGCGTGTTC GATCACCTGA TCCGTGTGCC GGATGCCTCC TTGCGCTTCA GTGGCCGCAG CCGCCGGCCA CCGCTGGATG CGGTCAATGC CCTGCTGTCG TTTCTCTACA CCCTGCTGAC CCACGACTGC CGTTCGGCAC TGGAAACGGT CGGCCTCGAT CCGGCGGTCG GCTACCTGCA CCGCGACCGG CCAGGCCGGC CGAGTCTGGC GCTGGATCTG GTCGAGGAGT TCCGTCCGGT ACTGGCCGAC CGGCTGGCGC TGTCGCTGCT CAACCGGAGG CAACTGTCCG CTGCCGACTT CCAGACGCTG GATAACGGTG CCGTGCTGCT GCGCGACGAA GCGCGCAAGC AGGTACTGAC AGCGTATCAG GAGCGCAAAC GCGAAGAGCT GCGGCATGCC TTTCTGGAAG AGCAGGCTCC GCTCGGTTTG TTCATGTTCA TCCAGGCGCA ACTGCTGGCC CGCCACCTGC GCGGCGATCT GGATGCCTAC CCACCCTTCA TCTGGAAGTG A
|
Protein sequence | MRRQLNTLYV TSEGAWLRKD GANIVMEIAG EIRGRIPVHM LESLVCIGRV LVSPPLLGYC AEQGIRISYL TPNGKFLARV EGPVSGNVLL RREQYRRSDD PAGSAAIVAN LLVGKVYNQR AVIGRALRDH GDKLAEEARI ALNITHKRLM RIADRLLLEQ DVDVLRGLEG EAAQAYFGVF DHLIRVPDAS LRFSGRSRRP PLDAVNALLS FLYTLLTHDC RSALETVGLD PAVGYLHRDR PGRPSLALDL VEEFRPVLAD RLALSLLNRR QLSAADFQTL DNGAVLLRDE ARKQVLTAYQ ERKREELRHA FLEEQAPLGL FMFIQAQLLA RHLRGDLDAY PPFIWK
|
| |