Gene Shewana3_3854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_3854 
Symbol 
ID4480064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp4628251 
End bp4630356 
Gene Length2106 bp 
Protein Length701 aa 
Translation table11 
GC content42% 
IMG OID639728467 
Producthypothetical protein 
Protein accessionYP_871478 
Protein GI117922286 
COG category 
COG ID 
TIGRFAM ID[TIGR02565] CRISPR-associated protein, Csy2 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.753761 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0836726 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAAAGA ATTTACATTT GCGAGAACTG CTAGATATTG AAGATGTTCA AGCACGTTCG 
ACAGCGTTGA GGCGTGCATT TGCTGCTTAT ACAGAACCGT TAGATGTGAC TGGCGAGGAA
AGCTCTGCCC TAATTATTTT GCTTAATCTT ACCTATCCGA AAAAGTTGGT GGATGACTTA
CTCGATAAAA GGTTAGCTAG AACAACTGTT AACAACCAAA CCCATATTGA TGTTTGCATT
GACGAAGTAC AATGGCTGCA TACCCATAAT CTCAAGTACC CAGATATTCG GGTCAGTAAA
CAAAGGCTCG TTGCCTCCAG TCCTCAATTA CATCCCAATA TGTTGAGTAG TGCTAATTGC
CAACGTACTT TAGGGTGGTC TCATGACAGT GCGAAAGTCA ATTTTGCCAA GCTGTTTGGC
TGCCAGTTTA TTTGGCAAGA TAAAGTAACT TGCCTTGCAA CATTAATGTG TGAAGCACCA
AAACACTGGA AAGAGGCTTT TCAAGAACTT GGCATGCCTG TTAAACAGTT TATGAATATT
TGCGGCAGGG TGAAGCATGT TTTACCTGCG CAGTTAAGCC CAGATAAAGT CGATAAATAC
TCCGTACAGG TTCGAATGCC ATTCCGTGAT GGATATATAG CGATTACCCC CGTTGTAAGC
CATGCACTTC AGTCTGAACT CCAGCAAGCA GCAATCAAAA AACAGGGCCG ATATACCAGT
CTTGAGTTTA TTCGTCCTGC GGCAGTCAGT GAGCTAGTCG CATCTCTTGG CGGTAATGCT
AAAGCCCTGA ATTATCCGCC CCGTATTAAT AAAAAAACTC ACGGATTAGG CAACAATTTA
CTGGCTAGGG TTTTATCTGG TCACGATGTA TTGAATCAAA ATGTATTGTC ACAGCCTCGA
TTTATGAGGG TATTGGATGA ACTGTTATCG AGTGAGTCAG CATTGGCGTT AAAGCAACGC
AGGCAGCAAA AAGTCGCCAA TCTTAGACAG ATAAGAACGA TGTTATCTGA GTGGCTTGCG
CCAATGTTGG AATGGCGATT AGAGGTTAAA GAGGGGCCGA TTCTTCTTAG TGAGCTTGAG
CCTATTAATG GCTCATTAGA GTATCAGTTT TTAACCTTTG AGAATGAGCT ATTACCTGAA
TTGCTGCTGC CTCTGTTCGA CAGATTGAAT TCGGTGATGT CGCATTCAGC AATGATAAGT
CAGTATGCCT TTCATCAGAG ACTGATGCCC CTTCTTCGAA ATAGCTTGAA ATGGTTGTTA
ACTCATTTGG CTTATAAAGA ACCTTTGCTA CAAAATGATG CTGCAGATGA GCATTTCCAG
CGATATTTAC ATTTGAAAGG TATTCGAGTG TTTGATGCCC AAGCTTTATC TAATCCGTAT
TGTGTTGGCA TTCCTTCTCT CACAGCTGTT TGGGGAATGA TGCATAACTA CCAGCGGCGA
TTAAACGAAG GCTTGGGTAC TCAGCTTAGA TTCACCTCAT TTTCGTGGTT TATTCGTCAG
TACTCATCTG TCTCGGGCAA AAAACTGCCT GAGTACGGCA TGCAAGGATC GAAAGAGAAT
CAATTTAGAC GAGCTGGAAT AATTGATAAT CGACATTGTG ATTTAGTGTT TGATCTTGTC
ATTCATATCG ATGGCTATGA AGACGCTTTA GCGATCTTAG ATAATCGAAC TGAAATGTTG
AAAGCAAGTT TTCCGGCGAA CTTTGCCGGT GGAGTAATGC ATCCTCCCGA GTTAGATGCT
GTTGGTGCGT GGTGCCAGCT TTATCAATGT GAAGCTGAAC TTTTCTCAAC ATTGAAAAGA
CTGCCTATGT CAGGAAAATG GATTATGCCG ACTAGATATT CAATGGGTAA TGTTAATGAT
CTTCTGTTTT TGCTGGATAA AAATTCTTCG CTATGCCCCA CTATGTTTGG CTATTTACCA
TTGACTGAAC CCGTTAGCCG AGTCGGTAGC CTAGAGCCTT TACATTGCTA TGCTGAACCA
GCGCTTGGCG TAGTTGAGTG TGCATCGGCG ATTGAGATAA GGTTACAGGG GCAAATAAAC
TATTTTAAGA GAGCGTTTTG GATGTTAGAG GCAAAAGAGC ATTCAATGCT GATGAAACGG
ATTTAA
 
Protein sequence
MVKNLHLREL LDIEDVQARS TALRRAFAAY TEPLDVTGEE SSALIILLNL TYPKKLVDDL 
LDKRLARTTV NNQTHIDVCI DEVQWLHTHN LKYPDIRVSK QRLVASSPQL HPNMLSSANC
QRTLGWSHDS AKVNFAKLFG CQFIWQDKVT CLATLMCEAP KHWKEAFQEL GMPVKQFMNI
CGRVKHVLPA QLSPDKVDKY SVQVRMPFRD GYIAITPVVS HALQSELQQA AIKKQGRYTS
LEFIRPAAVS ELVASLGGNA KALNYPPRIN KKTHGLGNNL LARVLSGHDV LNQNVLSQPR
FMRVLDELLS SESALALKQR RQQKVANLRQ IRTMLSEWLA PMLEWRLEVK EGPILLSELE
PINGSLEYQF LTFENELLPE LLLPLFDRLN SVMSHSAMIS QYAFHQRLMP LLRNSLKWLL
THLAYKEPLL QNDAADEHFQ RYLHLKGIRV FDAQALSNPY CVGIPSLTAV WGMMHNYQRR
LNEGLGTQLR FTSFSWFIRQ YSSVSGKKLP EYGMQGSKEN QFRRAGIIDN RHCDLVFDLV
IHIDGYEDAL AILDNRTEML KASFPANFAG GVMHPPELDA VGAWCQLYQC EAELFSTLKR
LPMSGKWIMP TRYSMGNVND LLFLLDKNSS LCPTMFGYLP LTEPVSRVGS LEPLHCYAEP
ALGVVECASA IEIRLQGQIN YFKRAFWMLE AKEHSMLMKR I