Gene EcDH1_0928 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0928 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp999625 
End bp1001133 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content44% 
IMG OID 
ProductCRISPR-associated protein, Cse1 family 
Protein accessionACX38611 
Protein GI260448189 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTGC TTATTGATAA CTGGATCCCT GTACGCCCGC GAAACGGGGG GAAAGTCCAA 
ATCATAAATC TGCAATCGCT ATACTGCAGT AGAGATCAGT GGCGATTAAG TTTGCCCCGT
GACGATATGG AACTGGCCGC TTTAGCACTG CTGGTTTGCA TTGGGCAAAT TATCGCCCCG
GCAAAAGATG ACGTTGAATT TCGACATCGC ATAATGAATC CGCTCACTGA AGATGAGTTT
CAACAACTCA TCGCGCCGTG GATAGATATG TTCTACCTTA ATCACGCAGA ACATCCCTTT
ATGCAGACCA AAGGTGTCAA AGCAAATGAT GTGACTCCAA TGGAAAAACT GTTGGCTGGG
GTAAGCGGCG CGACGAATTG TGCATTTGTC AATCAACCGG GGCAGGGTGA AGCATTATGT
GGTGGATGCA CTGCGATTGC GTTATTCAAC CAGGCGAATC AGGCACCAGG TTTTGGTGGT
GGTTTTAAAA GCGGTTTACG TGGAGGAACA CCTGTAACAA CGTTCGTACG TGGGATCGAT
CTTCGTTCAA CGGTGTTACT CAATGTCCTC ACATTACCTC GTCTTCAAAA ACAATTTCCT
AATGAATCAC ATACGGAAAA CCAACCTACC TGGATTAAAC CTATCAAGTC CAATGAGTCT
ATACCTGCTT CGTCAATTGG GTTTGTCCGT GGTCTATTCT GGCAACCAGC GCATATTGAA
TTATGCGATC CCATTGGGAT TGGTAAATGT TCTTGCTGTG GACAGGAAAG CAATTTGCGT
TATACCGGTT TTCTTAAGGA AAAATTTACC TTTACAGTTA ATGGGCTATG GCCCCATCCG
CATTCCCCTT GTCTGGTAAC AGTCAAGAAA GGGGAGGTTG AGGAAAAATT TCTTGCTTTC
ACCACCTCCG CACCATCATG GACACAAATC AGCCGAGTTG TGGTAGATAA GATTATTCAA
AATGAAAATG GAAATCGCGT GGCGGCGGTT GTGAATCAAT TCAGAAATAT TGCGCCGCAA
AGTCCTCTTG AATTGATTAT GGGGGGATAT CGTAATAATC AAGCATCTAT TCTTGAACGG
CGTCATGATG TGTTGATGTT TAATCAGGGG TGGCAACAAT ACGGCAATGT GATAAACGAA
ATAGTGACTG TTGGTTTGGG ATATAAAACA GCCTTACGCA AGGCGTTATA TACCTTTGCA
GAAGGGTTTA AAAATAAAGA CTTCAAAGGG GCCGGAGTCT CTGTTCATGA GACTGCAGAA
AGGCATTTCT ATCGACAGAG TGAATTATTA ATTCCCGATG TACTGGCGAA TGTTAATTTT
TCCCAGGCTG ATGAGGTAAT AGCTGATTTA CGAGACAAAC TTCATCAATT GTGTGAAATG
CTATTTAATC AATCTGTAGC TCCCTATGCA CATCATCCTA AATTAATAAG CACATTAGCG
CTTGCCCGCG CCACGCTATA CAAACATTTA CGGGAGTTAA AACCGCAAGG AGGGCCATCA
AATGGCTGA
 
Protein sequence
MNLLIDNWIP VRPRNGGKVQ IINLQSLYCS RDQWRLSLPR DDMELAALAL LVCIGQIIAP 
AKDDVEFRHR IMNPLTEDEF QQLIAPWIDM FYLNHAEHPF MQTKGVKAND VTPMEKLLAG
VSGATNCAFV NQPGQGEALC GGCTAIALFN QANQAPGFGG GFKSGLRGGT PVTTFVRGID
LRSTVLLNVL TLPRLQKQFP NESHTENQPT WIKPIKSNES IPASSIGFVR GLFWQPAHIE
LCDPIGIGKC SCCGQESNLR YTGFLKEKFT FTVNGLWPHP HSPCLVTVKK GEVEEKFLAF
TTSAPSWTQI SRVVVDKIIQ NENGNRVAAV VNQFRNIAPQ SPLELIMGGY RNNQASILER
RHDVLMFNQG WQQYGNVINE IVTVGLGYKT ALRKALYTFA EGFKNKDFKG AGVSVHETAE
RHFYRQSELL IPDVLANVNF SQADEVIADL RDKLHQLCEM LFNQSVAPYA HHPKLISTLA
LARATLYKHL RELKPQGGPS NG