Gene Sputcn32_1819 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSputcn32_1819 
Symbol 
ID5079186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella putrefaciens CN-32 
KingdomBacteria 
Replicon accessionNC_009438 
Strand
Start bp2070103 
End bp2071080 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content49% 
IMG OID640498969 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_001183341 
Protein GI146292917 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03637] CRISPR-associated endonuclease Cas1, YPEST subtype 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGATT TTAGCCCATC GGATCTAAAA ACCATTTTGC ACTCAAAACG CGCGAACATG 
TATTACCTCG AATACTGTCG AGTGATGCAA AAAGATGGCA GGGTGCTGTA TTTGACCGAG
GCAAAAAACG AAAACCAATA TTTTAATATC CCTATCGCCA ACACCACAGT CCTATTATTA
GGCAATGGCA CGTCCATCAC CCAAGCGGCC ATGCGAATGC TAGCGCAGGC GGGGGTGTTA
GTCGGTTTTT GTGGCGGTGG TGGCACGCCA CTTTATATGA CCTGTGAAGT GGAATGGCTG
ACACCACAGA GTGAATATCG GCCCACTGAA TATTTACACG GCTGGATGCA ATTTTGGTTT
GATGATGAAA AACGATTACT CGCCGCGAAA ACCTTCCAGC AAGCCCGCAT TCAATTTATC
GAGCAAGTGT GGCAGAGGGA TCGCGAGCTC AAAACGGAAG GCTTTATCTT TAAGGATCCG
GCAATCCAAG CCGCGCTCGA GACGTTTCAT GCCCGCACTG AGGCGGCAAC CAAACAGTCG
GATCTGCTAC TGACCGAAGC CCAGTTAACC AAAGTGCTGT ACAAGCACGC CGCCAATAAT
ACCCAACTCA AAGATTTTAC CCGCCAACAC CAAAGCGCAG ATATCGCGAA CGACTTTTTA
AACCACGGCA ACTATTTAGC CTATGGACTC GCCGCCAGTT GTCTCTGGGT TCTGGGCATT
CCCCACGGTT TTGCCGTGAT GCACGGGAAA ACTCGTCGCG GAGCATTAGT GTTTGATGTC
GCTGATCTCA TCAAAGACGC CATCGTATTA CCCTGGGCAT TCGTCTGCGC CAAAGAAAAG
GCCAGCGAAC AAGAGTTTCG CCAGCAAGTG TTACAAGCCT TTACCGACCA TAACGCTTTA
GATTTTATGT TCAATACGGT GAAAGGCATT GCACTGCAGG ACTACAGTGC CGAGCAAATT
GCAGCCCAAG GATTATAA
 
Protein sequence
MDDFSPSDLK TILHSKRANM YYLEYCRVMQ KDGRVLYLTE AKNENQYFNI PIANTTVLLL 
GNGTSITQAA MRMLAQAGVL VGFCGGGGTP LYMTCEVEWL TPQSEYRPTE YLHGWMQFWF
DDEKRLLAAK TFQQARIQFI EQVWQRDREL KTEGFIFKDP AIQAALETFH ARTEAATKQS
DLLLTEAQLT KVLYKHAANN TQLKDFTRQH QSADIANDFL NHGNYLAYGL AASCLWVLGI
PHGFAVMHGK TRRGALVFDV ADLIKDAIVL PWAFVCAKEK ASEQEFRQQV LQAFTDHNAL
DFMFNTVKGI ALQDYSAEQI AAQGL