Gene Sbal195_3377 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal195_3377 
Symbol 
ID5755181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS195 
KingdomBacteria 
Replicon accessionNC_009997 
Strand
Start bp3992373 
End bp3993350 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content47% 
IMG OID641289710 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_001555799 
Protein GI160876483 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03637] CRISPR-associated endonuclease Cas1, YPEST subtype 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000538903 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACGATC TTTCTCCCTC TGATTTAAAA ACGATACTTC ATTCAAAGCG AGCTAATGTT 
TATTACCTCG AATACTGTCG AGTCATGCAG AAAGACGGTC GTGTGCTTTA CTTAACCGAA
GCAAAAAACG AAAACCAATA TTTCAATATC CCCATCGCCA ATACCACAGT CCTGTTATTG
GGCAATGGCA CGTCCATCAC CCAAGCGGCC ATGCGAATGC TGGCGCAGGC AGGTGTATTA
GTCGGCTTTT GTGGCGGTGG CGGCACGCCA CTTTATATGA CCTGCGAAGT GGAATGGCTA
ACGCCTCAGA GTGAATACCG GCCTACTGAA TATTTGCACG GTTGGATGCA GTTTTGGTTC
GATGATGAAA AACGATTACT CGCCGCGAAA ACTTTTCAGC AAGCCCGCAT CCAATTTATC
GAGCAAGTGT GGCAGAGGGA TCGCGAACTC AAAACGGAAG GCTTTATTTT TAAGGATCCG
GCAATCCAAG CCGCGCTTGA GACTTTTCAT GCCCGCACAG AGGTGGCAAC CAAACAGTCG
GACCTGCTAC TGACCGAAGC TCAGTTAACC AAAGTGCTGT ACAAGCACGC CGCCAATAAT
ACCCAACTCA AAGATTTTAC CCGTCAACAC CAAAGCACAG ATATCGCTAA TGACTTTTTA
AACCACGGCA ATTATTTAGC CTATGGACTT GCAGCTAGTT GCCTCTGGGT GTTGGGCATT
CCACATGGTT TTGCCGTGAT GCACGGGAAA ACTCGCCGCG GAGCATTAGT GTTTGATGTC
GCTGATCTCA TCAAAGACGC CATAGTGCTA CCGTGGGCTT TTGTTTGCGC CAAAGAGAAC
GCCACTGAAC AAGAATTTCG TCAGCAAGTA TTGCAGGCAT TTACCGATCA TAAAGCATTA
GATTTTATGT TTAATACGGT GAAAGACGTT GCACTTCAGG ACTACAGTGC TGAGCAAATT
GCAGCCCAAG GATTATAA
 
Protein sequence
MDDLSPSDLK TILHSKRANV YYLEYCRVMQ KDGRVLYLTE AKNENQYFNI PIANTTVLLL 
GNGTSITQAA MRMLAQAGVL VGFCGGGGTP LYMTCEVEWL TPQSEYRPTE YLHGWMQFWF
DDEKRLLAAK TFQQARIQFI EQVWQRDREL KTEGFIFKDP AIQAALETFH ARTEVATKQS
DLLLTEAQLT KVLYKHAANN TQLKDFTRQH QSTDIANDFL NHGNYLAYGL AASCLWVLGI
PHGFAVMHGK TRRGALVFDV ADLIKDAIVL PWAFVCAKEN ATEQEFRQQV LQAFTDHKAL
DFMFNTVKDV ALQDYSAEQI AAQGL