Gene ECH74115_3066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3066 
Symbol 
ID6969892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2838310 
End bp2839323 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content44% 
IMG OID643386898 
Productintegrase 
Protein accessionYP_002271366 
Protein GI209400896 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0315896 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.000000103666 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAATCA AAAAACTCGA TGATGGTCGA TATGAAGTGG ACATCCGCCC TACTGGACGT 
AACGGAAAAC GCATCCGTAG GAAGTTTGAT AAGAAAAGCG AAGCTGTCGC TTTCGAAAAA
TACACGTTGT ACAACCACCA CAATAAAGAA TGGCTATCAA AACCAACAGA CAAACGACGT
CTGTCGGAAC TGACACAGAT CTGGTGGGAT TTAAAGGGTA AACACGAAGA GCATGGGAAA
TCTAATCTTG GAAAAATTGA AATCTTCACA AAAATAACGA ATGACCCATG CGCATTTCAA
ATCACGAAAT CCCTTATCAG CCAGTACTGC GCCACCCGAA GAAGTCAGGG TATTAAACCT
TCGAGTATCA ATCGTGATTT AACATGTATT AGCGGCATGT TTACAGCCCT GATTGAAGCG
GAGTTATTCT TTGGTGAGCA TCCTATCAGA GGGACAAAGA GGCTTAAGGA GGAAAAACCA
GAAACAGGCT ATCTCACACA GGAAGAAATA GCCTTACTGC TTGCAGCACT TGACGGCGAC
AATAAAAAGA TTGCGATTCT TTGCCTAAGT ACAGGAGCAC GTTGGGGAGA AGCAGCTCGT
TTGAAAGCAG AAAATATCAT CCATAACCGC GTCACGTTTG TTAAAACGAA AACAAACAAA
CCACGCACCG TCCCGATCTC AGAGGCTGTT GCCAAAATGA TCGCGGATAA CAAACGAGGT
TTTTTATTCC CTGATGCTGA TTACCCTCGC TTCAGACGAA CAATGAAAGC AATAAAACCG
GATTTGCCAA TGGGGCAAGC CACACATGCA CTAAGGCACA GCTTTGCCAC TCATTTCATG
ATTAATGGAG GAAGTATTAT CACGCTACAA CGGATACTAG GTCACACGCG GATTGAGCAA
ACTATGGTTT ACGCTCATTT TGCGCCAGAG TACCTTCAGG ACGCCATTTC TCTTAATCCG
CTAAGAGGTG GTACTGAGGC CGAGAGTGTC CACACAGTGT CCACAGTAGA GTAA
 
Protein sequence
MAIKKLDDGR YEVDIRPTGR NGKRIRRKFD KKSEAVAFEK YTLYNHHNKE WLSKPTDKRR 
LSELTQIWWD LKGKHEEHGK SNLGKIEIFT KITNDPCAFQ ITKSLISQYC ATRRSQGIKP
SSINRDLTCI SGMFTALIEA ELFFGEHPIR GTKRLKEEKP ETGYLTQEEI ALLLAALDGD
NKKIAILCLS TGARWGEAAR LKAENIIHNR VTFVKTKTNK PRTVPISEAV AKMIADNKRG
FLFPDADYPR FRRTMKAIKP DLPMGQATHA LRHSFATHFM INGGSIITLQ RILGHTRIEQ
TMVYAHFAPE YLQDAISLNP LRGGTEAESV HTVSTVE