Gene EcHS_A2899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2899 
Symbolcse1 
ID5592515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2899863 
End bp2901371 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content44% 
IMG OID640922016 
Producthypothetical protein 
Protein accessionYP_001459527 
Protein GI157162209 
COG category 
COG ID 
TIGRFAM ID[TIGR02547] CRISPR system CASCADE complex protein CasA/Cse1 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value0.550415 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTGC TTATTGATAA CTGGATCCCT GTACGCCCGC AAAGTGGGGG GAAAGTCCAA 
ATCATAAATC TGCAATCGCT ATTCTGCAGT AAAAATCAGT GGCGGTTAAG TTTGCCCCGT
GACGATATGG AACTGGCCGC TTTAGCTCTG CTGGTTTGCA TCGGGCAAAT TATCGCCCCT
GCAAAAGATG ACGTTGAATT TCGGCATCGC ATTATGAATC CGCTCAGTGA AGATGAGTTT
CAACGACTCA TCGCGCCGTG GATAGATATG TTCTACCTTA ATCACGCAGA ACATCCCTTT
ATGCAGACCA AAGGTGTTAA AGCAAATGAT GTGACTCCAA TGGAAAAACT GTTGGCTGGG
GTAAGCGGCG CGACGAATTG TGCATTTGTC AATCAACCGG GTCAGGGTGA AGCATTATGT
GGTGGATGCA CTGCGATTGC GTTATTCAAC CAGGCGAATC AGGCACCTGG TTTTGGTGGT
GGCTTTAAAA GCGGTTTACG TGGAGGAACA CCTATAACCA CCTTCGTACG TGGGATCGAT
CTTCGCTCGA CGGTGTTACT CAATGTCCTC ACAATACCTC GTCTCCAAAA ACAATTTCCC
AATGAATCAC ATACGGAAAA CCAACCTACC TGGATTAAGC CTGTCAAGCC CAATGAATCT
GTGCCAGCTT CGTCAATTGG CTTTGTCCGT GGCTTATTCT GGCAACCAGC GCATATTGAA
TTATGCGATC CCATTGGGAT TGGTAAATGT TCTTGCTGTG GACAGGAAAG CAATTTGCGT
TATACCGGTT TTCTTAAGGA AAAATTTACC TTTACAGTTA ATGGGCTATG GCCCCATCCG
CATTCCCCTT GTCTGGTAAC AGTCAAGAAA GGGGAGGTTG AGGAAAAATT TCTTGCTTTC
ACCACCTCCG CACCATCATG GACACAAATC AGCCGAGTTG TGGTAGATAA AATTATTCAA
AATGAAAATG GAAATCGCGT GGCGGCGGTT GTGAATCAAT TCAGAAATAT TGCGTCGCAA
AGTCCTCTTG AATTGATTAT GGGGGGATAT CGTAATAATC AAGCATCTAT TCTTGAACGG
CGTCATGATG TGCTGATGTT TAATCAGGGA TGGCAACAAT ACGGCAATGT GATAAACGAA
ATAGTGACTG TTGGTTTGGG ATATAAAACA GCCTTACGCA AGGCGTTATA TACCTTTGCA
GAAGGGTTTA AAAATAAAGA CTTCAAAGGG GCCGGAGTCT CTGTTCATGA GACTGCAGAA
AGGCATTTCT ATCGCCAGAG TGAATTATTA ATTCTCGATG TACTGGCGAA TATTAATTTT
TCCCAGGCTG ATGAGGTAAT AGCTGATTTA CGAGACAAAC TTCATCAATT GTGTGAAATG
CTATTTAATC AATCTGTAGC GCCCTATGCA CATCATCCTA AATTAATAAG CACATTAGCG
CTTGCCCGCG CCACGCTATA CAAACATTTA CGGGAGTTAA AACCGCAAGG AGGGCCATCA
AATGGCTGA
 
Protein sequence
MNLLIDNWIP VRPQSGGKVQ IINLQSLFCS KNQWRLSLPR DDMELAALAL LVCIGQIIAP 
AKDDVEFRHR IMNPLSEDEF QRLIAPWIDM FYLNHAEHPF MQTKGVKAND VTPMEKLLAG
VSGATNCAFV NQPGQGEALC GGCTAIALFN QANQAPGFGG GFKSGLRGGT PITTFVRGID
LRSTVLLNVL TIPRLQKQFP NESHTENQPT WIKPVKPNES VPASSIGFVR GLFWQPAHIE
LCDPIGIGKC SCCGQESNLR YTGFLKEKFT FTVNGLWPHP HSPCLVTVKK GEVEEKFLAF
TTSAPSWTQI SRVVVDKIIQ NENGNRVAAV VNQFRNIASQ SPLELIMGGY RNNQASILER
RHDVLMFNQG WQQYGNVINE IVTVGLGYKT ALRKALYTFA EGFKNKDFKG AGVSVHETAE
RHFYRQSELL ILDVLANINF SQADEVIADL RDKLHQLCEM LFNQSVAPYA HHPKLISTLA
LARATLYKHL RELKPQGGPS NG