Gene Sbal223_4186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_4186 
Symbol 
ID7088509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp4970201 
End bp4972306 
Gene Length2106 bp 
Protein Length701 aa 
Translation table11 
GC content41% 
IMG OID643463061 
Producthypothetical protein 
Protein accessionYP_002360076 
Protein GI217975325 
COG category 
COG ID 
TIGRFAM ID[TIGR02565] CRISPR-associated protein, Csy2 family 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones90 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAAAGA ATTTACATTT GCGAGCACTG CTAGATATTG AAGACGTTCA AGTACGTACG 
GCAGCGTTGA GGCGTGCATT TGCTGCTTAT ACAGAACCGT TAGATGTGAC TGGCGAGGAA
AGTTCTACCC TAATTATTTT GCTTAATCTG ACCTATCCGA AAAAGTTGGT GGATGACTTA
TTAGATAAAA GGTTAGCCAG AAAAACTGTT AACAACCAAA CCCATATTGA TGTTTGCATT
GACGAAGTAC AATGGCTGCA TACCCATAAT CTCAAGTACC CAGATATTCG GGTCAGTAAA
CAAAGGCTCG TTGCCCCCAG TCCTCAATTA CATCCCAGCA TATTGAGTAG CGTTAATTGC
CAGCGTACTT TAGGGTGGTC TCATGACAGT GCTAAAGTCA ATTTTACCAA ACTGTTTGTT
TGTCATTTTA TTTGGCAAGG TAAAGTAACT TGCCTTGCAA AGTTAATGTG TGAAGCACCA
AAATACTGGA AAGAGGCTTT TCAAGAACTT GGCATGCCTG TTAAACAGTT TATGAATATT
TGCGGTAGGG TGAAGCATTT TTTACCTGAG CAGTTAAGCC CAGATAAAGT CGATAAATAC
TCCGTACAGG TTCGAATGCC ATACCGTGAT GGATATATAG CGGTTACCCC CGTTGTAAGC
CATGCACTTC AGTCTGAACT CCAGCAAGCA GCAATCAAAA AACAGGGCCG ATATACTGGT
CTTGAGTTTA TTCGTTCCGC GGCAGTCAGT GAGCTAGTCG CATCTCTTGG CGGTAATGTT
AAAGCTCTGA ATTATCCCCC TCGTATTAAT AAAAAAACTC ACGGATTAGG CGATAATTTA
CTGGCTAGGG TTTTATCTGG TCAGGATGTA TTGAATCAAA ATGTATTGTC ACAGCCTCGA
TTTGTGAGGG TATTGGATGA ACTGTTATCG AGTGAATCAG CGTTGGCGTT AAAGCAACGC
AGGCAACAAA AAGTCGCTAA CCTTAGACAG ATAAGAGCGA TGTTATCTGA GTGGCTTGCG
CCAATGTTGG AATGGCGATT AGAGGTTAAA GAGGGGCCGA TTCTTCTTAG CGAGCTTGAG
CCTATTAAGG GCTCATTAGA GTATCAGTTT TTAACGTTTG AGAATGAGCT ATTACCTGAA
TTGCTAGCGA ATCTGTTCGG CAGATTAAAT TCGGTGATGT CGCATTCTAC AATGATAAGT
CAGTATGCAT TTCACCAGAG ACTGATGCCC CTTCTTCGAA ATAGCTTGAA ATGGTTGTTA
ACTCATTTGG CTTATAAAGA ACCTTTGCTA CAAAAAGATG CTGAGGATGA GCATTTCCAG
CGATATTTAC ATTTGAAAGG TATTCGGGTG TTTGATGCCC AAGCTTTATC TAATCCCTAT
TGTGTTGGCA TTCCATCTCT CACAGCTGTT TGGGGAATGA TGCATAACTA CCAGCGGCGA
TTAAATGAAG GATTGGGTAC TCAGCTTAGA TTGACCTCCT TTTCGTGGTT TATTCGTCAG
TACTCATCTG TCTCGGGCAA AAAACTGCCT GAGTTCGGCA TGCAAGGAGC GAAAGAGAAT
CAATTTAGGC GAGCTGGAAT AATTGATAAT CGACATTGTG ATTTAGTGTT TGATCTTGTC
ATTCACATTG ATGGTTATGA AGACGATTTA GCTATCTTAG ATAATCAAAC TGAAATGTTG
AAAGCAAGTT TTCCGGCGAA CTTTGCCGGT GGAGTAATGC ATCCTCCCGA GTTAGATGCT
GTTGATGCGT GGTGCCAGCT TTATCAATCT GAAGCTGAAC TTTTCTCAAC ATTGAAAAGA
CTGCCTATGT CAGGAAAATG GATTATGCCG ACTAGATATT CAATGGGTAA TATTAGTGAT
CTTCTGTTTT TGTTGGATAA AAATTCATCG CTGTGTCCCA CTATGGTTGG CTATTTACCA
TTAACTGAAC CCGTTAGCCG AGTCGGTAGC CTAGAGCCTT TACATTGCTA TGCTGAACCA
GCGCTTGGCG TAGTTGAGTG TGCATCGGCA ATTGAGATAA GGTTACAGGG GCAAATAAAC
TATTTTAAGA GAGCGTTTTG GATGTTAGAG GCAAAAGAGC ATTCAATGCT GATGAAACGG
ATTTAA
 
Protein sequence
MVKNLHLRAL LDIEDVQVRT AALRRAFAAY TEPLDVTGEE SSTLIILLNL TYPKKLVDDL 
LDKRLARKTV NNQTHIDVCI DEVQWLHTHN LKYPDIRVSK QRLVAPSPQL HPSILSSVNC
QRTLGWSHDS AKVNFTKLFV CHFIWQGKVT CLAKLMCEAP KYWKEAFQEL GMPVKQFMNI
CGRVKHFLPE QLSPDKVDKY SVQVRMPYRD GYIAVTPVVS HALQSELQQA AIKKQGRYTG
LEFIRSAAVS ELVASLGGNV KALNYPPRIN KKTHGLGDNL LARVLSGQDV LNQNVLSQPR
FVRVLDELLS SESALALKQR RQQKVANLRQ IRAMLSEWLA PMLEWRLEVK EGPILLSELE
PIKGSLEYQF LTFENELLPE LLANLFGRLN SVMSHSTMIS QYAFHQRLMP LLRNSLKWLL
THLAYKEPLL QKDAEDEHFQ RYLHLKGIRV FDAQALSNPY CVGIPSLTAV WGMMHNYQRR
LNEGLGTQLR LTSFSWFIRQ YSSVSGKKLP EFGMQGAKEN QFRRAGIIDN RHCDLVFDLV
IHIDGYEDDL AILDNQTEML KASFPANFAG GVMHPPELDA VDAWCQLYQS EAELFSTLKR
LPMSGKWIMP TRYSMGNISD LLFLLDKNSS LCPTMVGYLP LTEPVSRVGS LEPLHCYAEP
ALGVVECASA IEIRLQGQIN YFKRAFWMLE AKEHSMLMKR I