Gene EcHS_A2901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2901 
Symbolcas3 
ID5595096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2901786 
End bp2904452 
Gene Length2667 bp 
Protein Length888 aa 
Translation table11 
GC content44% 
IMG OID640922018 
Producthypothetical protein 
Protein accessionYP_001459529 
Protein GI157162211 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACCTT TTAAATATAT ATGTCATTAC TGGGGGAAAT CCTCAAAAAG CCTGACGAAA 
GGAAATGATA TTCATCTGTT GATTTATCAT TGCCTGGATG TTGCAGCTGT TGCAGATTGC
TGGTGGGATC AATCAGTCGT ACTACAAAAT GCTTTTTGCC GAAATGAAAT GCTATCAAAA
CAGAGGGTGA AGGCCTGGCT GTTATTTTTC ATTGCTCTTC ATGATATTGG AAAGTTTGAT
ATACGATTCC AATATAAATC AGCAGAAAGT TGGCTGAAAT TAAATCCTGC AACGCCATCA
CTTAATGGTC CATCAACACA AATGTGCCGT AAATTTAATC ATGGTGCAGC CGGTCTGTAT
TGGTTTAACC AGGATTCACT TTCAGAGCAA TCTCCCGGGG ATTTTTTCAG TTTTTTTGAT
GCCGCGCCTC ATCCTTATGA GTCCTGGTTT CCATGGGTAG AGGCCGTTAC AGGACATCAT
GGTTTTATAT TACATTCCCA GGATCAAGAT AAGTCGCGTT GGGAAATGCC AGCTTCTTTG
GCATCTTATG CTGCGCAAGA TAAACAGGCT CGTGAGGAGT GGATATCTGT ACTCGAAGCA
TTATTTTTAA CGCCAGCCGG GTTATCTATA AACGATATAC CACCTGATTG TTCATCATTG
TTAGCAGGTT TTTGCTCGCT TGCTGACTGG TTAGGTTCCT GGACCACAAC GGATACCTTT
CTATTCAAAG AAGATGCACC TTCAGGCATA CAAGCGCTAA GAACATATTT TCAGGATAGA
CAGCAGGATG CGTGTCGGGT ACTGGCGCTG AGTGGCCTTG TATCAAATAA GCGTCGTTAT
GACGGCGTAC ATGCCTTACT GGACAATGGC TATCAACCAA GACAATTACA GGTGTTGGTT
GATGCTCTTC CTGCAGCTCC CGGGCTGACG GTGATTGAGG CACCTACGGG ATCAGGTAAA
ACCGAAACAG CGCTGGCCTA TGCCTGGAAA CTTATTGACC AACAACTTGC GGATAGTGTT
ATTTTTGCCC TCCCTACACA AGCTACTGCA AATGCAATGC TCAGCAGAAT GGAAGCGAAC
GCGAGCCGTT TATTTACCTC CCCAAATCTT ATTCTTGCTC ATGGTAATTC ACGTTTTAAC
CACCTTTTTC AATCAATAAA ATCACGCGCA TTTACTGAGC AGGGGCAAGA AGAAGCATGG
GTTCAGTGTT GTCAGTGGTT GTCACAAAGC AATAAGAAAG TGTTTCTTGG GCAAATCGGC
GTTTGCACGA TTGATCAGGT GTTGATATCG GTATTGCCAG TTAAACACCG CTTTATCCGT
GGTTTGGGAA TTGGTCGAAG TGTTTTAATT GTTGATGAAG TTCATGCTTA CGACACGTAT
ATGAACGGCT TGCTGGAGGC TGTGCTCAAG GCTCAGGCTG ATGTGGGGGG GAGTGTTATT
CTTCTTTCCG CAACTCTACC AATGAAACAA AAACAGAAAC TTCTGGATAC TTATGGTCTG
CATACAGATC CAGAGGAAAA TAACTCCGCA TATCCACTCA TTAACTGGCG AGGTGTGAAT
GGTGCGCAAC GTTTTGATCT GTTAGCTCAT CCAGAACAAC TCCCGCCCCG CTTTTCAATT
CAGCCAGAAC CTATTTATTT AGCTGACATG TTACCTGACC TTACGATGTT AGAGCGAATG
ATCGCAGCGG CAAACGCGGG TGCACAAGTC TGTCTTATTT GCAATTTAGT TGACGTTGCA
CAAGTATGCT ATCAACGGCT AAAGGAGCTA AATAACACGC AAGTAGATAT AGATTTGTTT
CATGCACGCT TTACGCTGAA CTCTCGTCGA GAAAAAGAGA ATCGAGTTAT TAGCAATTTC
GGCAAAAATG GGAAGCGAAA TGTTGGACGG ATACTTGTCG CCACCCAGGT CGTGGAACAA
TCACTCGACG TTGATTTTGA TTGGTTAATT ACTCAGCATT GTCCTGCAGA TTTGCTTTTC
CAACGATTGG GTCGTTTACA TCGCCATCAT CGCAAATATC GTCCCGCTGG TTTTGAGATT
CCTCTGGCAA CCGTTTTGCT GCCTGATGGC GAAGGTTACG GACGACATGA GCACATTTAT
AGCAACGTTA GAGTCATGTG GCGGACGCAG CAACATATTG AGGAACTTAA TGGAGCATCC
TTATTTTTCC CTGATGCTTA CCGGCAATGG CTGGATAGCA TTTACGATGA TGCTGAAATG
GATGAGCCAG AATGGGTCAT CAAAGGCATG GATAAGTTTG AAAGCGCTGA GTGTGAAAAG
AGGTTCAAGG CTCGCAAGGT TCTGCAGTGG GCTGAAGAAT ATAGTTTGCA GGATAACGAT
GAAACCATTC TTGCGGTAAC GAGGGATGGG GAAATGAGCC TGCCATTATT GCCTTATGTA
CAAACGTCTT CAGGTAAACA ACTGCTCGAT GGCCAGGTCT ACGAGGACCT AAGTCATGAA
CAGCAGTATG AGGCGTTAGC TCTTAATCGC GTCAATGTAC CCTTCACCTG GAAACGTAGT
TTTTCTGAAG TAGTAGATGA AGATGGGTTA CTTTGGCTGG AAGGGAAACA GAATCAGGAT
GGATGGATCT GGCAGGGTAA TAATATTGTT ATTACCTACA CTCGGGATGA AGGGATGACC
AGAGTCATCC CTGCAAATCC CAAATAA
 
Protein sequence
MEPFKYICHY WGKSSKSLTK GNDIHLLIYH CLDVAAVADC WWDQSVVLQN AFCRNEMLSK 
QRVKAWLLFF IALHDIGKFD IRFQYKSAES WLKLNPATPS LNGPSTQMCR KFNHGAAGLY
WFNQDSLSEQ SPGDFFSFFD AAPHPYESWF PWVEAVTGHH GFILHSQDQD KSRWEMPASL
ASYAAQDKQA REEWISVLEA LFLTPAGLSI NDIPPDCSSL LAGFCSLADW LGSWTTTDTF
LFKEDAPSGI QALRTYFQDR QQDACRVLAL SGLVSNKRRY DGVHALLDNG YQPRQLQVLV
DALPAAPGLT VIEAPTGSGK TETALAYAWK LIDQQLADSV IFALPTQATA NAMLSRMEAN
ASRLFTSPNL ILAHGNSRFN HLFQSIKSRA FTEQGQEEAW VQCCQWLSQS NKKVFLGQIG
VCTIDQVLIS VLPVKHRFIR GLGIGRSVLI VDEVHAYDTY MNGLLEAVLK AQADVGGSVI
LLSATLPMKQ KQKLLDTYGL HTDPEENNSA YPLINWRGVN GAQRFDLLAH PEQLPPRFSI
QPEPIYLADM LPDLTMLERM IAAANAGAQV CLICNLVDVA QVCYQRLKEL NNTQVDIDLF
HARFTLNSRR EKENRVISNF GKNGKRNVGR ILVATQVVEQ SLDVDFDWLI TQHCPADLLF
QRLGRLHRHH RKYRPAGFEI PLATVLLPDG EGYGRHEHIY SNVRVMWRTQ QHIEELNGAS
LFFPDAYRQW LDSIYDDAEM DEPEWVIKGM DKFESAECEK RFKARKVLQW AEEYSLQDND
ETILAVTRDG EMSLPLLPYV QTSSGKQLLD GQVYEDLSHE QQYEALALNR VNVPFTWKRS
FSEVVDEDGL LWLEGKQNQD GWIWQGNNIV ITYTRDEGMT RVIPANPK