Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal195_3377 |
Symbol | |
ID | 5755181 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS195 |
Kingdom | Bacteria |
Replicon accession | NC_009997 |
Strand | - |
Start bp | 3992373 |
End bp | 3993350 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641289710 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001555799 |
Protein GI | 160876483 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03637] CRISPR-associated endonuclease Cas1, YPEST subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000538903 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACGATC TTTCTCCCTC TGATTTAAAA ACGATACTTC ATTCAAAGCG AGCTAATGTT TATTACCTCG AATACTGTCG AGTCATGCAG AAAGACGGTC GTGTGCTTTA CTTAACCGAA GCAAAAAACG AAAACCAATA TTTCAATATC CCCATCGCCA ATACCACAGT CCTGTTATTG GGCAATGGCA CGTCCATCAC CCAAGCGGCC ATGCGAATGC TGGCGCAGGC AGGTGTATTA GTCGGCTTTT GTGGCGGTGG CGGCACGCCA CTTTATATGA CCTGCGAAGT GGAATGGCTA ACGCCTCAGA GTGAATACCG GCCTACTGAA TATTTGCACG GTTGGATGCA GTTTTGGTTC GATGATGAAA AACGATTACT CGCCGCGAAA ACTTTTCAGC AAGCCCGCAT CCAATTTATC GAGCAAGTGT GGCAGAGGGA TCGCGAACTC AAAACGGAAG GCTTTATTTT TAAGGATCCG GCAATCCAAG CCGCGCTTGA GACTTTTCAT GCCCGCACAG AGGTGGCAAC CAAACAGTCG GACCTGCTAC TGACCGAAGC TCAGTTAACC AAAGTGCTGT ACAAGCACGC CGCCAATAAT ACCCAACTCA AAGATTTTAC CCGTCAACAC CAAAGCACAG ATATCGCTAA TGACTTTTTA AACCACGGCA ATTATTTAGC CTATGGACTT GCAGCTAGTT GCCTCTGGGT GTTGGGCATT CCACATGGTT TTGCCGTGAT GCACGGGAAA ACTCGCCGCG GAGCATTAGT GTTTGATGTC GCTGATCTCA TCAAAGACGC CATAGTGCTA CCGTGGGCTT TTGTTTGCGC CAAAGAGAAC GCCACTGAAC AAGAATTTCG TCAGCAAGTA TTGCAGGCAT TTACCGATCA TAAAGCATTA GATTTTATGT TTAATACGGT GAAAGACGTT GCACTTCAGG ACTACAGTGC TGAGCAAATT GCAGCCCAAG GATTATAA
|
Protein sequence | MDDLSPSDLK TILHSKRANV YYLEYCRVMQ KDGRVLYLTE AKNENQYFNI PIANTTVLLL GNGTSITQAA MRMLAQAGVL VGFCGGGGTP LYMTCEVEWL TPQSEYRPTE YLHGWMQFWF DDEKRLLAAK TFQQARIQFI EQVWQRDREL KTEGFIFKDP AIQAALETFH ARTEVATKQS DLLLTEAQLT KVLYKHAANN TQLKDFTRQH QSTDIANDFL NHGNYLAYGL AASCLWVLGI PHGFAVMHGK TRRGALVFDV ADLIKDAIVL PWAFVCAKEN ATEQEFRQQV LQAFTDHKAL DFMFNTVKDV ALQDYSAEQI AAQGL
|
| |