Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2901 |
Symbol | cas3 |
ID | 5595096 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2901786 |
End bp | 2904452 |
Gene Length | 2667 bp |
Protein Length | 888 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640922018 |
Product | hypothetical protein |
Protein accession | YP_001459529 |
Protein GI | 157162211 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR01587] CRISPR-associated helicase Cas3 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACCTT TTAAATATAT ATGTCATTAC TGGGGGAAAT CCTCAAAAAG CCTGACGAAA GGAAATGATA TTCATCTGTT GATTTATCAT TGCCTGGATG TTGCAGCTGT TGCAGATTGC TGGTGGGATC AATCAGTCGT ACTACAAAAT GCTTTTTGCC GAAATGAAAT GCTATCAAAA CAGAGGGTGA AGGCCTGGCT GTTATTTTTC ATTGCTCTTC ATGATATTGG AAAGTTTGAT ATACGATTCC AATATAAATC AGCAGAAAGT TGGCTGAAAT TAAATCCTGC AACGCCATCA CTTAATGGTC CATCAACACA AATGTGCCGT AAATTTAATC ATGGTGCAGC CGGTCTGTAT TGGTTTAACC AGGATTCACT TTCAGAGCAA TCTCCCGGGG ATTTTTTCAG TTTTTTTGAT GCCGCGCCTC ATCCTTATGA GTCCTGGTTT CCATGGGTAG AGGCCGTTAC AGGACATCAT GGTTTTATAT TACATTCCCA GGATCAAGAT AAGTCGCGTT GGGAAATGCC AGCTTCTTTG GCATCTTATG CTGCGCAAGA TAAACAGGCT CGTGAGGAGT GGATATCTGT ACTCGAAGCA TTATTTTTAA CGCCAGCCGG GTTATCTATA AACGATATAC CACCTGATTG TTCATCATTG TTAGCAGGTT TTTGCTCGCT TGCTGACTGG TTAGGTTCCT GGACCACAAC GGATACCTTT CTATTCAAAG AAGATGCACC TTCAGGCATA CAAGCGCTAA GAACATATTT TCAGGATAGA CAGCAGGATG CGTGTCGGGT ACTGGCGCTG AGTGGCCTTG TATCAAATAA GCGTCGTTAT GACGGCGTAC ATGCCTTACT GGACAATGGC TATCAACCAA GACAATTACA GGTGTTGGTT GATGCTCTTC CTGCAGCTCC CGGGCTGACG GTGATTGAGG CACCTACGGG ATCAGGTAAA ACCGAAACAG CGCTGGCCTA TGCCTGGAAA CTTATTGACC AACAACTTGC GGATAGTGTT ATTTTTGCCC TCCCTACACA AGCTACTGCA AATGCAATGC TCAGCAGAAT GGAAGCGAAC GCGAGCCGTT TATTTACCTC CCCAAATCTT ATTCTTGCTC ATGGTAATTC ACGTTTTAAC CACCTTTTTC AATCAATAAA ATCACGCGCA TTTACTGAGC AGGGGCAAGA AGAAGCATGG GTTCAGTGTT GTCAGTGGTT GTCACAAAGC AATAAGAAAG TGTTTCTTGG GCAAATCGGC GTTTGCACGA TTGATCAGGT GTTGATATCG GTATTGCCAG TTAAACACCG CTTTATCCGT GGTTTGGGAA TTGGTCGAAG TGTTTTAATT GTTGATGAAG TTCATGCTTA CGACACGTAT ATGAACGGCT TGCTGGAGGC TGTGCTCAAG GCTCAGGCTG ATGTGGGGGG GAGTGTTATT CTTCTTTCCG CAACTCTACC AATGAAACAA AAACAGAAAC TTCTGGATAC TTATGGTCTG CATACAGATC CAGAGGAAAA TAACTCCGCA TATCCACTCA TTAACTGGCG AGGTGTGAAT GGTGCGCAAC GTTTTGATCT GTTAGCTCAT CCAGAACAAC TCCCGCCCCG CTTTTCAATT CAGCCAGAAC CTATTTATTT AGCTGACATG TTACCTGACC TTACGATGTT AGAGCGAATG ATCGCAGCGG CAAACGCGGG TGCACAAGTC TGTCTTATTT GCAATTTAGT TGACGTTGCA CAAGTATGCT ATCAACGGCT AAAGGAGCTA AATAACACGC AAGTAGATAT AGATTTGTTT CATGCACGCT TTACGCTGAA CTCTCGTCGA GAAAAAGAGA ATCGAGTTAT TAGCAATTTC GGCAAAAATG GGAAGCGAAA TGTTGGACGG ATACTTGTCG CCACCCAGGT CGTGGAACAA TCACTCGACG TTGATTTTGA TTGGTTAATT ACTCAGCATT GTCCTGCAGA TTTGCTTTTC CAACGATTGG GTCGTTTACA TCGCCATCAT CGCAAATATC GTCCCGCTGG TTTTGAGATT CCTCTGGCAA CCGTTTTGCT GCCTGATGGC GAAGGTTACG GACGACATGA GCACATTTAT AGCAACGTTA GAGTCATGTG GCGGACGCAG CAACATATTG AGGAACTTAA TGGAGCATCC TTATTTTTCC CTGATGCTTA CCGGCAATGG CTGGATAGCA TTTACGATGA TGCTGAAATG GATGAGCCAG AATGGGTCAT CAAAGGCATG GATAAGTTTG AAAGCGCTGA GTGTGAAAAG AGGTTCAAGG CTCGCAAGGT TCTGCAGTGG GCTGAAGAAT ATAGTTTGCA GGATAACGAT GAAACCATTC TTGCGGTAAC GAGGGATGGG GAAATGAGCC TGCCATTATT GCCTTATGTA CAAACGTCTT CAGGTAAACA ACTGCTCGAT GGCCAGGTCT ACGAGGACCT AAGTCATGAA CAGCAGTATG AGGCGTTAGC TCTTAATCGC GTCAATGTAC CCTTCACCTG GAAACGTAGT TTTTCTGAAG TAGTAGATGA AGATGGGTTA CTTTGGCTGG AAGGGAAACA GAATCAGGAT GGATGGATCT GGCAGGGTAA TAATATTGTT ATTACCTACA CTCGGGATGA AGGGATGACC AGAGTCATCC CTGCAAATCC CAAATAA
|
Protein sequence | MEPFKYICHY WGKSSKSLTK GNDIHLLIYH CLDVAAVADC WWDQSVVLQN AFCRNEMLSK QRVKAWLLFF IALHDIGKFD IRFQYKSAES WLKLNPATPS LNGPSTQMCR KFNHGAAGLY WFNQDSLSEQ SPGDFFSFFD AAPHPYESWF PWVEAVTGHH GFILHSQDQD KSRWEMPASL ASYAAQDKQA REEWISVLEA LFLTPAGLSI NDIPPDCSSL LAGFCSLADW LGSWTTTDTF LFKEDAPSGI QALRTYFQDR QQDACRVLAL SGLVSNKRRY DGVHALLDNG YQPRQLQVLV DALPAAPGLT VIEAPTGSGK TETALAYAWK LIDQQLADSV IFALPTQATA NAMLSRMEAN ASRLFTSPNL ILAHGNSRFN HLFQSIKSRA FTEQGQEEAW VQCCQWLSQS NKKVFLGQIG VCTIDQVLIS VLPVKHRFIR GLGIGRSVLI VDEVHAYDTY MNGLLEAVLK AQADVGGSVI LLSATLPMKQ KQKLLDTYGL HTDPEENNSA YPLINWRGVN GAQRFDLLAH PEQLPPRFSI QPEPIYLADM LPDLTMLERM IAAANAGAQV CLICNLVDVA QVCYQRLKEL NNTQVDIDLF HARFTLNSRR EKENRVISNF GKNGKRNVGR ILVATQVVEQ SLDVDFDWLI TQHCPADLLF QRLGRLHRHH RKYRPAGFEI PLATVLLPDG EGYGRHEHIY SNVRVMWRTQ QHIEELNGAS LFFPDAYRQW LDSIYDDAEM DEPEWVIKGM DKFESAECEK RFKARKVLQW AEEYSLQDND ETILAVTRDG EMSLPLLPYV QTSSGKQLLD GQVYEDLSHE QQYEALALNR VNVPFTWKRS FSEVVDEDGL LWLEGKQNQD GWIWQGNNIV ITYTRDEGMT RVIPANPK
|
| |