Gene Nwi_1941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_1941 
Symbol 
ID3674785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp2129211 
End bp2131445 
Gene Length2235 bp 
Protein Length744 aa 
Translation table11 
GC content62% 
IMG OID637713506 
ProductCRISPR-associated helicase Cas3 family protein protein 
Protein accessionYP_318553 
Protein GI75676132 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3
[TIGR01596] CRISPR-associated endonuclease Cas3-HD 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.557598 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTATC ACGCTCATTC GGGCAAGCTG ACGGATGAGA GCGACTGGCA GATCCTGTCG 
CATCACTTAA CACGGGTTGC GGCTCGCGCT GGTATTTACG GCTCCCCCAT AGGCTTGGAA
GGGCTGGCGA AGATCGCCGG CCTTTTTCAC GACTTGGGTA AGTACACAGC GGACTTCCAA
AGACGGCTGC ACGGCATAAA TTTGCGGGTC GATCATTCGA CGGCAGGCGC CGCCGTGCTG
ATGAAGATGG TGCCGAGGCC GGTTCGGGAA ATCGCGGAAC TTGTTGCCTA CACGATACTG
GGACATCACG CCGGCCTGCC CGACAAGTTC AACGAGTTCG GCCATTGCTT CCTGCGTCGG
GTTAGGGAGT TCGAGGACCG CCTCGACCCG GTGTGGAAGG ACCAGCTGTC GTTCGATCTC
GGCGACCTGC AGATGCGCGA GTTGATGGGC AAGTTGTCGC CGGAGAAGAG GATCGCCGAG
TTTGAGCTTT CGGTCGTGAC GCGCATGCTC TTCTCCTGCC TGGTGGATGC CGATTTCAAG
GACACGGAAG CGTTCTACGA CGCGCTCGAA GGTCGGCAGT CGAACCGGGA ATGGCCGCTA
CTGCAGGACG TCCTGCCGGC CTTCCTCGCC GCATTCGATG CTCACATGGC GGCGAAGTCG
AAGGATGGCG AGGTCAACCG GTTGCGAGGC GACATCCTGG CGCATGTGCG GGCGGGAGCA
TTGAACGAGC CGGGACTGTT CACGCTCAAC GTTCCGACCG GCGGTGGCAA GACGCTGGCC
TCGCTCGGCT TTGCTTTGGC TCACGCTAGG AAGTGGGATC ACCGCCGCAT CATCTATGCG
ATTCCGTTTA CTTCGATCGT CGACCAGACG GCCGCAATAT TCCGGGACAT TCTCGGCGAG
GATAACGTGC TCGAGCACCA TTCCGCGATC GACGAAGAAC ACATCGAGGA GCGGTGGGGC
CGTGACAAGC TGAAGCTCGC CATGCAGGAC TGGGCCGCGC CAGTGGTCGT GACCACCAAT
GTCCAGTTCT TCGAAAGCCT GTTCGCTGCG AGAACTTCGC GGGCGCGCAA GCTGCACAAC
ATCGCCGGCT CGATCATCAT CCTGGACGAG GCGCAGACCA TCCCGCGGCC GCTGCTGAAG
CCGTGCGTAC GGATGCTCGA TGCGCTGGCG AGGCTGTTCG GCTGCACCAT CGTGCTGTGC
ACGGCCACGC AGCCGGCCCT CGACGCCTCG AACTTCCCCG ATGGGCTGAA ACTTGACGGC
CGCGAGCTGG CGCCCGATCC TGGGAAGCTT TCAGCCAGGC TGAAGCGAGC GCGGATCGTG
CGCGTCGGAG CGATGAACAA TCCGGAGCTA ATTGAGGCGA TCCGCGCCGA GCCGCAGGCG
CTGTTCATCG TCAATAGCCG CAAGCATGCG CTGGACCTCT ACAAGGAAGG GAAGAACGCC
GGAGTTGACG GACTTGTCCA TCTCACCACC CGCCAGTGCG CCGCTCACCG GCGTCTGATC
CTCGGCGACG TCAAGGCGCG GCTGAAGAAC GGGGAGACGT GCCGGCTGGT CGCGACCAGC
CTCATCGAAG CTGGCGTCGA CGTGGATTTT CCAGGAGTCT GGCGGGCCGA GGCAGGGCTC
GATCAGATCG TCCAGGCGGC CGGCCGATGC AACCGTGAGG GCAGGCATCC GGTGGAGGAC
AGCATCGTCA GCGTCTTCTC GGCGCCCGAC TACCCGCCAC CGCGCGAGAT TGCCGGCCTG
ATCGGCGACA TGGGCCGGGT GATCCCCAAG CATGAGGACC TGCTGTCGCT CGGGGCGATC
GCCGATTATT TCGGCGAGGT CTATTGGCGG GCAGGCCCCG AACTGGATGC GAAGAAGATA
TTGGAGGGCT TCAAGATCAA CCGCGACGGC ACCGATTTCG CTTTCCGCTG CGTGGCGGAA
AAGTTCCGGA TGATCGAGAG CGGCATGGAG CCGGTCATCG TGGAGTTCGA TAACGACGCC
GAGGAAACCG TAACGGAGCT GGAATTCGAG CGGATATCTT CTGGCGTGCT TGCGCGAAAG
CTGCAATCAT ACATTGTGCA GGTGCCGCCC AAGGCACGTC GATTGCTGAT CGACAATGGC
CACGTGGCTT TCGTACGGCC GGATATAAGG GGGGATCAGT TCGCGGTGCT CAAGAACGCA
TCACTCTATC GCCAGGAGGT TGGCCTGATG TGGGAGGACG CGGCGTACCT TGCGGCCGAG
AGTTGGCAGA TTTGA
 
Protein sequence
MTYHAHSGKL TDESDWQILS HHLTRVAARA GIYGSPIGLE GLAKIAGLFH DLGKYTADFQ 
RRLHGINLRV DHSTAGAAVL MKMVPRPVRE IAELVAYTIL GHHAGLPDKF NEFGHCFLRR
VREFEDRLDP VWKDQLSFDL GDLQMRELMG KLSPEKRIAE FELSVVTRML FSCLVDADFK
DTEAFYDALE GRQSNREWPL LQDVLPAFLA AFDAHMAAKS KDGEVNRLRG DILAHVRAGA
LNEPGLFTLN VPTGGGKTLA SLGFALAHAR KWDHRRIIYA IPFTSIVDQT AAIFRDILGE
DNVLEHHSAI DEEHIEERWG RDKLKLAMQD WAAPVVVTTN VQFFESLFAA RTSRARKLHN
IAGSIIILDE AQTIPRPLLK PCVRMLDALA RLFGCTIVLC TATQPALDAS NFPDGLKLDG
RELAPDPGKL SARLKRARIV RVGAMNNPEL IEAIRAEPQA LFIVNSRKHA LDLYKEGKNA
GVDGLVHLTT RQCAAHRRLI LGDVKARLKN GETCRLVATS LIEAGVDVDF PGVWRAEAGL
DQIVQAAGRC NREGRHPVED SIVSVFSAPD YPPPREIAGL IGDMGRVIPK HEDLLSLGAI
ADYFGEVYWR AGPELDAKKI LEGFKINRDG TDFAFRCVAE KFRMIESGME PVIVEFDNDA
EETVTELEFE RISSGVLARK LQSYIVQVPP KARRLLIDNG HVAFVRPDIR GDQFAVLKNA
SLYRQEVGLM WEDAAYLAAE SWQI