Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_0928 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 999625 |
End bp | 1001133 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | |
Product | CRISPR-associated protein, Cse1 family |
Protein accession | ACX38611 |
Protein GI | 260448189 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTGC TTATTGATAA CTGGATCCCT GTACGCCCGC GAAACGGGGG GAAAGTCCAA ATCATAAATC TGCAATCGCT ATACTGCAGT AGAGATCAGT GGCGATTAAG TTTGCCCCGT GACGATATGG AACTGGCCGC TTTAGCACTG CTGGTTTGCA TTGGGCAAAT TATCGCCCCG GCAAAAGATG ACGTTGAATT TCGACATCGC ATAATGAATC CGCTCACTGA AGATGAGTTT CAACAACTCA TCGCGCCGTG GATAGATATG TTCTACCTTA ATCACGCAGA ACATCCCTTT ATGCAGACCA AAGGTGTCAA AGCAAATGAT GTGACTCCAA TGGAAAAACT GTTGGCTGGG GTAAGCGGCG CGACGAATTG TGCATTTGTC AATCAACCGG GGCAGGGTGA AGCATTATGT GGTGGATGCA CTGCGATTGC GTTATTCAAC CAGGCGAATC AGGCACCAGG TTTTGGTGGT GGTTTTAAAA GCGGTTTACG TGGAGGAACA CCTGTAACAA CGTTCGTACG TGGGATCGAT CTTCGTTCAA CGGTGTTACT CAATGTCCTC ACATTACCTC GTCTTCAAAA ACAATTTCCT AATGAATCAC ATACGGAAAA CCAACCTACC TGGATTAAAC CTATCAAGTC CAATGAGTCT ATACCTGCTT CGTCAATTGG GTTTGTCCGT GGTCTATTCT GGCAACCAGC GCATATTGAA TTATGCGATC CCATTGGGAT TGGTAAATGT TCTTGCTGTG GACAGGAAAG CAATTTGCGT TATACCGGTT TTCTTAAGGA AAAATTTACC TTTACAGTTA ATGGGCTATG GCCCCATCCG CATTCCCCTT GTCTGGTAAC AGTCAAGAAA GGGGAGGTTG AGGAAAAATT TCTTGCTTTC ACCACCTCCG CACCATCATG GACACAAATC AGCCGAGTTG TGGTAGATAA GATTATTCAA AATGAAAATG GAAATCGCGT GGCGGCGGTT GTGAATCAAT TCAGAAATAT TGCGCCGCAA AGTCCTCTTG AATTGATTAT GGGGGGATAT CGTAATAATC AAGCATCTAT TCTTGAACGG CGTCATGATG TGTTGATGTT TAATCAGGGG TGGCAACAAT ACGGCAATGT GATAAACGAA ATAGTGACTG TTGGTTTGGG ATATAAAACA GCCTTACGCA AGGCGTTATA TACCTTTGCA GAAGGGTTTA AAAATAAAGA CTTCAAAGGG GCCGGAGTCT CTGTTCATGA GACTGCAGAA AGGCATTTCT ATCGACAGAG TGAATTATTA ATTCCCGATG TACTGGCGAA TGTTAATTTT TCCCAGGCTG ATGAGGTAAT AGCTGATTTA CGAGACAAAC TTCATCAATT GTGTGAAATG CTATTTAATC AATCTGTAGC TCCCTATGCA CATCATCCTA AATTAATAAG CACATTAGCG CTTGCCCGCG CCACGCTATA CAAACATTTA CGGGAGTTAA AACCGCAAGG AGGGCCATCA AATGGCTGA
|
Protein sequence | MNLLIDNWIP VRPRNGGKVQ IINLQSLYCS RDQWRLSLPR DDMELAALAL LVCIGQIIAP AKDDVEFRHR IMNPLTEDEF QQLIAPWIDM FYLNHAEHPF MQTKGVKAND VTPMEKLLAG VSGATNCAFV NQPGQGEALC GGCTAIALFN QANQAPGFGG GFKSGLRGGT PVTTFVRGID LRSTVLLNVL TLPRLQKQFP NESHTENQPT WIKPIKSNES IPASSIGFVR GLFWQPAHIE LCDPIGIGKC SCCGQESNLR YTGFLKEKFT FTVNGLWPHP HSPCLVTVKK GEVEEKFLAF TTSAPSWTQI SRVVVDKIIQ NENGNRVAAV VNQFRNIAPQ SPLELIMGGY RNNQASILER RHDVLMFNQG WQQYGNVINE IVTVGLGYKT ALRKALYTFA EGFKNKDFKG AGVSVHETAE RHFYRQSELL IPDVLANVNF SQADEVIADL RDKLHQLCEM LFNQSVAPYA HHPKLISTLA LARATLYKHL RELKPQGGPS NG
|
| |