Gene EcolC_0952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0952 
Symbol 
ID6068332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1039261 
End bp1040769 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content44% 
IMG OID641600360 
Producthypothetical protein 
Protein accessionYP_001723948 
Protein GI170018994 
COG category 
COG ID 
TIGRFAM ID[TIGR02547] CRISPR system CASCADE complex protein CasA/Cse1 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.557794 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTGC TTATTGATAA CTGGATCCCT GTACGCCCGC GAAACGGGGG GAAAGTCCAA 
ATCATAAATC TGCAATCGCT ATACTGCAGT AGAGATCAGT GGCGATTAAG TTTGCCCCGT
GACGATATGG AACTGGCCGC TTTAGCACTG CTGGTTTGCA TTGGGCAAAT TATCGACCCG
GCAAAAGATG ACGTTGAATT TCGACATCGC ATAATGAATC CGCTCACTGA AGATGAGTTT
CAACAACTCA TCGCGCCGTG GATAGATATG TTCTACCTTA ATCACGCAGA ACATCCCTTT
ATGCAGACCA AAGGTGTCAA AGCAAATGAT GTGACTCCAA TGGAAAAACT GTTGGCAGGG
GTAAGCGGCG CGACGAATTG TGCTTTTGTC AATCAACCGG GTCAGGGTGA AGCATTATGT
GGTGGATGCA CTGCGATTGC GTTATTCAAC CAGGCGAATC AGGCACCTGG TTTTGGTGGT
GGCTTTAAAA GCGGTTTACG TGGAGGAACA CCTATAACCA CCTTCGTACG TGGGATCGAT
CTTCGCTCGA CGGTGTTACT CAATGTCCTC ACAATACCTC GTCTTCAAAA ACAATTTCCC
AATGAATCAC ATACGGAAAA CCAACCTACC TGGGTTAAGC CTGTCAAACC CAATGAATCT
GTGCCAGCTT CGTCAATTGG CTTTGTCCGC GGCTTATTCT GGCAACCTGC GCATATTGAA
TTATGCGATC CCATTGGGAT TGGAAAATGT TCTTGCTGTG GACAGGAAAG CAATTTGCGT
TATACCGGTT TTCTTAAGGA AAAATTTACC TTTACAGTTA ATGGGCTATG GCCCCATCCG
CATTCCCCTT GTCTGGTAAC AGTCAAGAAA GGGGAGGTTG AGGAAAAATT TCTTGCTTTC
ACCACCTCCG CACCATCATG GACACAAATC AGCCGAGTTG TGGTAGATAA GATTATTCAA
AATGAAAATG GAAATCGCGT GGCGGCGGTT GTGAATCAAT TCAGAAATAT TGCGCCGCAA
AGTCCTCTTG AATTGATTAT GGGGGGATAT CGTAATAATC AAGCATCTAT TCTTGAACGG
CGTCATGATG TGTTGATGTT TAATCAGGGG TGGCAACAAT ACGGCAATGT GATAAACGAA
ATAGTTACTG TTGGTTTGGG ATATAAAACA GCCTTACGCA AGGCGTTATA TACCTTTGCA
GAAGGGTTTA AAAATAAAGA CTTTAAAGGG GCTGGAGTCT CTGTTCATGA GACTGCAGAA
AGGCATTTCT ATCGACAGAG TGAATTATTA ATTCCCGATG TACTGGCGAA TGTTAATTTT
TCCCAGGCTG ATGAGGTCAT AGCTGATTTA CGAGACAAAC TTCATCAATT GTGTGAAATG
TTATTTAATC AATCTGTAGC GCCCTATGCA CATCATCCTA AATTAATAAG CACATTAGCG
CTCGCCCGCG CCACGCTATA CAAACATTTA CGGGAGTTAA AACCGCAAGG AGGGCCATCA
AATGGCTGA
 
Protein sequence
MNLLIDNWIP VRPRNGGKVQ IINLQSLYCS RDQWRLSLPR DDMELAALAL LVCIGQIIDP 
AKDDVEFRHR IMNPLTEDEF QQLIAPWIDM FYLNHAEHPF MQTKGVKAND VTPMEKLLAG
VSGATNCAFV NQPGQGEALC GGCTAIALFN QANQAPGFGG GFKSGLRGGT PITTFVRGID
LRSTVLLNVL TIPRLQKQFP NESHTENQPT WVKPVKPNES VPASSIGFVR GLFWQPAHIE
LCDPIGIGKC SCCGQESNLR YTGFLKEKFT FTVNGLWPHP HSPCLVTVKK GEVEEKFLAF
TTSAPSWTQI SRVVVDKIIQ NENGNRVAAV VNQFRNIAPQ SPLELIMGGY RNNQASILER
RHDVLMFNQG WQQYGNVINE IVTVGLGYKT ALRKALYTFA EGFKNKDFKG AGVSVHETAE
RHFYRQSELL IPDVLANVNF SQADEVIADL RDKLHQLCEM LFNQSVAPYA HHPKLISTLA
LARATLYKHL RELKPQGGPS NG