Gene Shew185_3241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShew185_3241 
Symbol 
ID5370123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS185 
KingdomBacteria 
Replicon accessionNC_009665 
Strand
Start bp3873297 
End bp3874274 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content47% 
IMG OID640831491 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_001367431 
Protein GI153001750 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03637] CRISPR-associated endonuclease Cas1, YPEST subtype 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGATC TTTCTCCCTC TGATTTAAAA ACGATACTTC ATTCAAAGCG AGCTAATGTT 
TATTACCTCG AATACTGTCG AGTCATGCAG AAAGACGGTC GTGTGCTTTA CTTAACCGAA
GCAAAAAACG AAAACCAATA TTTCAATATC CCCATCGCCA ATACCACAGT CCTGTTATTG
GGCAATGGCA CGTCCATCAC CCAAGCGGCC ATGCGAATGC TGGCGCAGGC AGGTGTATTA
GTCGGCTTTT GTGGCGGTGG CGGCACGCCA CTTTATATGA CCTGCGAAGT GGAATGGCTA
ACGCCTCAGA GTGAATACCG GCCTACTGAA TATTTGCACG GTTGGATGCA GTTTTGGTTC
GATGATGAAA AACGATTACT CGCCGCGAAA ACTTTTCAGC AAGCCCGCAT CCAATTTATC
GAGCAAGTGT GGCAGAGGGA TCGCGAACTC AAAACGGAAG GCTTTATTTT TAAGGATCCG
GCAATCCAAG CCGCGCTTGA GACTTTTCAT GCCCGCACAG AGGTGGCAAC CAAACAGTCG
GACCTGCTAC TGACCGAAGC TCAGTTAACC AAAGTGCTGT ACAAGCACGC CGCCAATAAT
ACCCAACTCA AAGATTTTAC CCGTCAACAC CAAAGCACAG ATATCGCTAA TGACTTTTTA
AACCACGGCA ATTATTTAGC CTATGGACTT GCAGCTAGTT GCCTCTGGGT GTTGGGCATT
CCACATGGTT TTGCCGTGAT GCACGGGAAA ACTCGCCGCG GAGCATTAGT GTTTGATGTC
GCTGATCTCA TCAAAGACGC CATAGTGCTA CCGTGGGCTT TTGTTTGCGC CAAAGAGAAC
GCCACTGAAC AAGAATTTCG TCAGCAAGTA TTGCAGGCAT TTACCGATCA TAAAGCATTA
GATTTTATGT TTAATACGGT TAAAGCCGTT GCACTCCAAG ACTACAGTGC TGAGCAAATT
GCAGCCCAAG GCTTATAA
 
Protein sequence
MDDLSPSDLK TILHSKRANV YYLEYCRVMQ KDGRVLYLTE AKNENQYFNI PIANTTVLLL 
GNGTSITQAA MRMLAQAGVL VGFCGGGGTP LYMTCEVEWL TPQSEYRPTE YLHGWMQFWF
DDEKRLLAAK TFQQARIQFI EQVWQRDREL KTEGFIFKDP AIQAALETFH ARTEVATKQS
DLLLTEAQLT KVLYKHAANN TQLKDFTRQH QSTDIANDFL NHGNYLAYGL AASCLWVLGI
PHGFAVMHGK TRRGALVFDV ADLIKDAIVL PWAFVCAKEN ATEQEFRQQV LQAFTDHKAL
DFMFNTVKAV ALQDYSAEQI AAQGL