Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sputcn32_1819 |
Symbol | |
ID | 5079186 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella putrefaciens CN-32 |
Kingdom | Bacteria |
Replicon accession | NC_009438 |
Strand | + |
Start bp | 2070103 |
End bp | 2071080 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640498969 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001183341 |
Protein GI | 146292917 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03637] CRISPR-associated endonuclease Cas1, YPEST subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGATT TTAGCCCATC GGATCTAAAA ACCATTTTGC ACTCAAAACG CGCGAACATG TATTACCTCG AATACTGTCG AGTGATGCAA AAAGATGGCA GGGTGCTGTA TTTGACCGAG GCAAAAAACG AAAACCAATA TTTTAATATC CCTATCGCCA ACACCACAGT CCTATTATTA GGCAATGGCA CGTCCATCAC CCAAGCGGCC ATGCGAATGC TAGCGCAGGC GGGGGTGTTA GTCGGTTTTT GTGGCGGTGG TGGCACGCCA CTTTATATGA CCTGTGAAGT GGAATGGCTG ACACCACAGA GTGAATATCG GCCCACTGAA TATTTACACG GCTGGATGCA ATTTTGGTTT GATGATGAAA AACGATTACT CGCCGCGAAA ACCTTCCAGC AAGCCCGCAT TCAATTTATC GAGCAAGTGT GGCAGAGGGA TCGCGAGCTC AAAACGGAAG GCTTTATCTT TAAGGATCCG GCAATCCAAG CCGCGCTCGA GACGTTTCAT GCCCGCACTG AGGCGGCAAC CAAACAGTCG GATCTGCTAC TGACCGAAGC CCAGTTAACC AAAGTGCTGT ACAAGCACGC CGCCAATAAT ACCCAACTCA AAGATTTTAC CCGCCAACAC CAAAGCGCAG ATATCGCGAA CGACTTTTTA AACCACGGCA ACTATTTAGC CTATGGACTC GCCGCCAGTT GTCTCTGGGT TCTGGGCATT CCCCACGGTT TTGCCGTGAT GCACGGGAAA ACTCGTCGCG GAGCATTAGT GTTTGATGTC GCTGATCTCA TCAAAGACGC CATCGTATTA CCCTGGGCAT TCGTCTGCGC CAAAGAAAAG GCCAGCGAAC AAGAGTTTCG CCAGCAAGTG TTACAAGCCT TTACCGACCA TAACGCTTTA GATTTTATGT TCAATACGGT GAAAGGCATT GCACTGCAGG ACTACAGTGC CGAGCAAATT GCAGCCCAAG GATTATAA
|
Protein sequence | MDDFSPSDLK TILHSKRANM YYLEYCRVMQ KDGRVLYLTE AKNENQYFNI PIANTTVLLL GNGTSITQAA MRMLAQAGVL VGFCGGGGTP LYMTCEVEWL TPQSEYRPTE YLHGWMQFWF DDEKRLLAAK TFQQARIQFI EQVWQRDREL KTEGFIFKDP AIQAALETFH ARTEAATKQS DLLLTEAQLT KVLYKHAANN TQLKDFTRQH QSADIANDFL NHGNYLAYGL AASCLWVLGI PHGFAVMHGK TRRGALVFDV ADLIKDAIVL PWAFVCAKEK ASEQEFRQQV LQAFTDHNAL DFMFNTVKGI ALQDYSAEQI AAQGL
|
| |