Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2899 |
Symbol | cse1 |
ID | 5592515 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2899863 |
End bp | 2901371 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640922016 |
Product | hypothetical protein |
Protein accession | YP_001459527 |
Protein GI | 157162209 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02547] CRISPR system CASCADE complex protein CasA/Cse1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 0.550415 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTGC TTATTGATAA CTGGATCCCT GTACGCCCGC AAAGTGGGGG GAAAGTCCAA ATCATAAATC TGCAATCGCT ATTCTGCAGT AAAAATCAGT GGCGGTTAAG TTTGCCCCGT GACGATATGG AACTGGCCGC TTTAGCTCTG CTGGTTTGCA TCGGGCAAAT TATCGCCCCT GCAAAAGATG ACGTTGAATT TCGGCATCGC ATTATGAATC CGCTCAGTGA AGATGAGTTT CAACGACTCA TCGCGCCGTG GATAGATATG TTCTACCTTA ATCACGCAGA ACATCCCTTT ATGCAGACCA AAGGTGTTAA AGCAAATGAT GTGACTCCAA TGGAAAAACT GTTGGCTGGG GTAAGCGGCG CGACGAATTG TGCATTTGTC AATCAACCGG GTCAGGGTGA AGCATTATGT GGTGGATGCA CTGCGATTGC GTTATTCAAC CAGGCGAATC AGGCACCTGG TTTTGGTGGT GGCTTTAAAA GCGGTTTACG TGGAGGAACA CCTATAACCA CCTTCGTACG TGGGATCGAT CTTCGCTCGA CGGTGTTACT CAATGTCCTC ACAATACCTC GTCTCCAAAA ACAATTTCCC AATGAATCAC ATACGGAAAA CCAACCTACC TGGATTAAGC CTGTCAAGCC CAATGAATCT GTGCCAGCTT CGTCAATTGG CTTTGTCCGT GGCTTATTCT GGCAACCAGC GCATATTGAA TTATGCGATC CCATTGGGAT TGGTAAATGT TCTTGCTGTG GACAGGAAAG CAATTTGCGT TATACCGGTT TTCTTAAGGA AAAATTTACC TTTACAGTTA ATGGGCTATG GCCCCATCCG CATTCCCCTT GTCTGGTAAC AGTCAAGAAA GGGGAGGTTG AGGAAAAATT TCTTGCTTTC ACCACCTCCG CACCATCATG GACACAAATC AGCCGAGTTG TGGTAGATAA AATTATTCAA AATGAAAATG GAAATCGCGT GGCGGCGGTT GTGAATCAAT TCAGAAATAT TGCGTCGCAA AGTCCTCTTG AATTGATTAT GGGGGGATAT CGTAATAATC AAGCATCTAT TCTTGAACGG CGTCATGATG TGCTGATGTT TAATCAGGGA TGGCAACAAT ACGGCAATGT GATAAACGAA ATAGTGACTG TTGGTTTGGG ATATAAAACA GCCTTACGCA AGGCGTTATA TACCTTTGCA GAAGGGTTTA AAAATAAAGA CTTCAAAGGG GCCGGAGTCT CTGTTCATGA GACTGCAGAA AGGCATTTCT ATCGCCAGAG TGAATTATTA ATTCTCGATG TACTGGCGAA TATTAATTTT TCCCAGGCTG ATGAGGTAAT AGCTGATTTA CGAGACAAAC TTCATCAATT GTGTGAAATG CTATTTAATC AATCTGTAGC GCCCTATGCA CATCATCCTA AATTAATAAG CACATTAGCG CTTGCCCGCG CCACGCTATA CAAACATTTA CGGGAGTTAA AACCGCAAGG AGGGCCATCA AATGGCTGA
|
Protein sequence | MNLLIDNWIP VRPQSGGKVQ IINLQSLFCS KNQWRLSLPR DDMELAALAL LVCIGQIIAP AKDDVEFRHR IMNPLSEDEF QRLIAPWIDM FYLNHAEHPF MQTKGVKAND VTPMEKLLAG VSGATNCAFV NQPGQGEALC GGCTAIALFN QANQAPGFGG GFKSGLRGGT PITTFVRGID LRSTVLLNVL TIPRLQKQFP NESHTENQPT WIKPVKPNES VPASSIGFVR GLFWQPAHIE LCDPIGIGKC SCCGQESNLR YTGFLKEKFT FTVNGLWPHP HSPCLVTVKK GEVEEKFLAF TTSAPSWTQI SRVVVDKIIQ NENGNRVAAV VNQFRNIASQ SPLELIMGGY RNNQASILER RHDVLMFNQG WQQYGNVINE IVTVGLGYKT ALRKALYTFA EGFKNKDFKG AGVSVHETAE RHFYRQSELL ILDVLANINF SQADEVIADL RDKLHQLCEM LFNQSVAPYA HHPKLISTLA LARATLYKHL RELKPQGGPS NG
|
| |