Gene EcDH1_0930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0930 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1001621 
End bp1002712 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content43% 
IMG OID 
ProductCRISPR-associated protein, Cse4 family 
Protein accessionACX38613 
Protein GI260448191 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAACT TTATCAATAT TCATGTTCTG ATCTCTCACA GCCCTTCATG TCTGAACCGC 
GACGATATGA ACATGCAGAA AGACGCTATT TTCGGCGGCA AAAGACGAGT AAGAATTTCA
AGTCAAAGCC TTAAACGTGC GATGCGTAAA AGTGGTTATT ACGCACAAAA TATTGGTGAA
TCCAGTCTCA GAACCATTCA TCTTGCACAA TTACGTGATG TTCTTCGGCA AAAACTTGGT
GAACGTTTTG ACCAAAAAAT CATCGATAAG ACATTAGCGC TGCTCTCCGG TAAATCAGTT
GATGAAGCCG AAAAGATTTC TGCCGATGCG GTTACTCCCT GGGTTGTGGG AGAAATAGCC
TGGTTCTGTG AGCAGGTTGC AAAAGCAGAG GCTGATAATC TGGATGATAA AAAGCTGCTC
AAAGTTCTTA AGGAAGATAT TGCCGCCATA CGTGTGAATT TACAGCAGGG TGTTGATATT
GCGCTTAGTG GAAGAATGGC AACCAGCGGC ATGATGACTG AGTTGGGAAA AGTTGATGGT
GCAATGTCCA TTGCGCATGC GATCACTACT CATCAGGTTG ATTCTGATAT TGACTGGTTC
ACCGCTGTAG ATGATTTACA GGAACAAGGT TCTGCACATC TGGGAACTCA GGAATTTTCA
TCGGGTGTTT TTTATCGTTA TGCCAACATT AACCTCGCTC AACTTCAGGA AAATTTAGGT
GGTGCCTCCA GGGAGCAGGC TCTGGAAATT GCAACCCATG TTGTTCATAT GCTGGCAACA
GAGGTCCCTG GAGCAAAACA GCGTACTTAT GCCGCTTTTA ACCCTGCGGA TATGGTAATG
GTTAATTTCT CCGATATGCC ACTTTCTATG GCAAATGCTT TTGAAAAAGC GGTTAAAGCG
AAAGATGGCT TTTTGCAACC GTCTATACAG GCGTTTAATC AATATTGGGA TCGCGTTGCC
AATGGATATG GTCTGAACGG AGCTGCTGCG CAATTCAGCT TATCTGATGT AGACCCAATT
ACTGCTCAAG TTAAACAAAT GCCTACTTTA GAACAGTTAA AATCCTGGGT TCGTAATAAT
GGCGAGGCGT GA
 
Protein sequence
MSNFINIHVL ISHSPSCLNR DDMNMQKDAI FGGKRRVRIS SQSLKRAMRK SGYYAQNIGE 
SSLRTIHLAQ LRDVLRQKLG ERFDQKIIDK TLALLSGKSV DEAEKISADA VTPWVVGEIA
WFCEQVAKAE ADNLDDKKLL KVLKEDIAAI RVNLQQGVDI ALSGRMATSG MMTELGKVDG
AMSIAHAITT HQVDSDIDWF TAVDDLQEQG SAHLGTQEFS SGVFYRYANI NLAQLQENLG
GASREQALEI ATHVVHMLAT EVPGAKQRTY AAFNPADMVM VNFSDMPLSM ANAFEKAVKA
KDGFLQPSIQ AFNQYWDRVA NGYGLNGAAA QFSLSDVDPI TAQVKQMPTL EQLKSWVRNN
GEA