Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_0952 |
Symbol | |
ID | 6068332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 1039261 |
End bp | 1040769 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641600360 |
Product | hypothetical protein |
Protein accession | YP_001723948 |
Protein GI | 170018994 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02547] CRISPR system CASCADE complex protein CasA/Cse1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.557794 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTTGC TTATTGATAA CTGGATCCCT GTACGCCCGC GAAACGGGGG GAAAGTCCAA ATCATAAATC TGCAATCGCT ATACTGCAGT AGAGATCAGT GGCGATTAAG TTTGCCCCGT GACGATATGG AACTGGCCGC TTTAGCACTG CTGGTTTGCA TTGGGCAAAT TATCGACCCG GCAAAAGATG ACGTTGAATT TCGACATCGC ATAATGAATC CGCTCACTGA AGATGAGTTT CAACAACTCA TCGCGCCGTG GATAGATATG TTCTACCTTA ATCACGCAGA ACATCCCTTT ATGCAGACCA AAGGTGTCAA AGCAAATGAT GTGACTCCAA TGGAAAAACT GTTGGCAGGG GTAAGCGGCG CGACGAATTG TGCTTTTGTC AATCAACCGG GTCAGGGTGA AGCATTATGT GGTGGATGCA CTGCGATTGC GTTATTCAAC CAGGCGAATC AGGCACCTGG TTTTGGTGGT GGCTTTAAAA GCGGTTTACG TGGAGGAACA CCTATAACCA CCTTCGTACG TGGGATCGAT CTTCGCTCGA CGGTGTTACT CAATGTCCTC ACAATACCTC GTCTTCAAAA ACAATTTCCC AATGAATCAC ATACGGAAAA CCAACCTACC TGGGTTAAGC CTGTCAAACC CAATGAATCT GTGCCAGCTT CGTCAATTGG CTTTGTCCGC GGCTTATTCT GGCAACCTGC GCATATTGAA TTATGCGATC CCATTGGGAT TGGAAAATGT TCTTGCTGTG GACAGGAAAG CAATTTGCGT TATACCGGTT TTCTTAAGGA AAAATTTACC TTTACAGTTA ATGGGCTATG GCCCCATCCG CATTCCCCTT GTCTGGTAAC AGTCAAGAAA GGGGAGGTTG AGGAAAAATT TCTTGCTTTC ACCACCTCCG CACCATCATG GACACAAATC AGCCGAGTTG TGGTAGATAA GATTATTCAA AATGAAAATG GAAATCGCGT GGCGGCGGTT GTGAATCAAT TCAGAAATAT TGCGCCGCAA AGTCCTCTTG AATTGATTAT GGGGGGATAT CGTAATAATC AAGCATCTAT TCTTGAACGG CGTCATGATG TGTTGATGTT TAATCAGGGG TGGCAACAAT ACGGCAATGT GATAAACGAA ATAGTTACTG TTGGTTTGGG ATATAAAACA GCCTTACGCA AGGCGTTATA TACCTTTGCA GAAGGGTTTA AAAATAAAGA CTTTAAAGGG GCTGGAGTCT CTGTTCATGA GACTGCAGAA AGGCATTTCT ATCGACAGAG TGAATTATTA ATTCCCGATG TACTGGCGAA TGTTAATTTT TCCCAGGCTG ATGAGGTCAT AGCTGATTTA CGAGACAAAC TTCATCAATT GTGTGAAATG TTATTTAATC AATCTGTAGC GCCCTATGCA CATCATCCTA AATTAATAAG CACATTAGCG CTCGCCCGCG CCACGCTATA CAAACATTTA CGGGAGTTAA AACCGCAAGG AGGGCCATCA AATGGCTGA
|
Protein sequence | MNLLIDNWIP VRPRNGGKVQ IINLQSLYCS RDQWRLSLPR DDMELAALAL LVCIGQIIDP AKDDVEFRHR IMNPLTEDEF QQLIAPWIDM FYLNHAEHPF MQTKGVKAND VTPMEKLLAG VSGATNCAFV NQPGQGEALC GGCTAIALFN QANQAPGFGG GFKSGLRGGT PITTFVRGID LRSTVLLNVL TIPRLQKQFP NESHTENQPT WVKPVKPNES VPASSIGFVR GLFWQPAHIE LCDPIGIGKC SCCGQESNLR YTGFLKEKFT FTVNGLWPHP HSPCLVTVKK GEVEEKFLAF TTSAPSWTQI SRVVVDKIIQ NENGNRVAAV VNQFRNIAPQ SPLELIMGGY RNNQASILER RHDVLMFNQG WQQYGNVINE IVTVGLGYKT ALRKALYTFA EGFKNKDFKG AGVSVHETAE RHFYRQSELL IPDVLANVNF SQADEVIADL RDKLHQLCEM LFNQSVAPYA HHPKLISTLA LARATLYKHL RELKPQGGPS NG
|
| |