Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Veis_1231 |
Symbol | |
ID | 4691855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Verminephrobacter eiseniae EF01-2 |
Kingdom | Bacteria |
Replicon accession | NC_008786 |
Strand | + |
Start bp | 1369191 |
End bp | 1370120 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639849005 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_996019 |
Protein GI | 121608212 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03639] CRISPR-associated endonuclease Cas1, NMENI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAGGCC GCACCGTGGA AATCGCAGAC GACCGGCGGC ACCTGTTTGT CAATCGCGGT TTCATGGTGA TCAAGGACAC CGAGGGCGAG CGCAAGGAAC TGGGGCAAGT GCCGCTGGAC GACATTGCGG CGGTCATCGC CAATGCGCAT GGCCTCACCT ACACGAACAA TCTGCTGGTG GCCCTGGCCG AACGCGGCGC CCCCTTCGTG CTGTGCGGCC CCAACCACAA CGCCGTGGGC ATGCTGTTGC CACTCGACGG CCACCATGTG CAGGCCAAGC GCATAGAGGC GCAGATCGCA GCCGGCCTGC CCATGCACAA GCGCCTGTGG GCAGCCGTGG TCAAGTCCAA ACTGGAACAA CAGGCCGCAG CACTGGAGGC TGTGGCCGCA CCCACGGCGC CGCTACAGGC CCTGGTCGCC AAAGTGAGAA GCGGCGACCC GGAGAATATC GAAGGACAAG GTGCGCGCCG CTACTGGGGG CTGCTGTTTG GCGCGGAGTT TCGGCGCGAC CAGAGCGGCG ACGGCTTGAA TGCGCTGCTG AACTACGGCT ACACCATCGT GCGCTCTGCC TGCGCCCGCG CGGTGGTGGC TGCCGGCCTG CACCCCAGCA TTGGGCTGCA CCACAGCAAC GACGCCAACG CCATGCGGCT GGTGGACGAT TTGATGGAGC CTTTTCGCCC CATCGTGGAC CTGAAGGTAT GGCAGTTGCA CAAGGCGGGC GAATCGCACG TCACGCCAGA CACCAAGCGC GCCCTGGTGC GCGTGCTGTA TGACGACATG CAGACAGCGG CAGGGGTGAC GCCGGTGATG GTGTGCATGC AGCGCCTGGC GACATCGCTG GCGCAGGTGT ACCTGGGCGA ACGCGCCAAA CTGGATTTGC CCTTGCCGGG GCTGCCGCTG GCGCTGGCTG GCAGCTTCGA GCCAGATTGA
|
Protein sequence | MIGRTVEIAD DRRHLFVNRG FMVIKDTEGE RKELGQVPLD DIAAVIANAH GLTYTNNLLV ALAERGAPFV LCGPNHNAVG MLLPLDGHHV QAKRIEAQIA AGLPMHKRLW AAVVKSKLEQ QAAALEAVAA PTAPLQALVA KVRSGDPENI EGQGARRYWG LLFGAEFRRD QSGDGLNALL NYGYTIVRSA CARAVVAAGL HPSIGLHHSN DANAMRLVDD LMEPFRPIVD LKVWQLHKAG ESHVTPDTKR ALVRVLYDDM QTAAGVTPVM VCMQRLATSL AQVYLGERAK LDLPLPGLPL ALAGSFEPD
|
| |