Gene ECH74115_1503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1503 
Symbol 
ID6968325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1485217 
End bp1486335 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content45% 
IMG OID643385474 
Productphage integrase family protein 
Protein accessionYP_002269968 
Protein GI209398624 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.279125 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACCGA GGCCACGAAA AAACAGCACT GACGTAGCCG GTCTTTACGA AAAGTTTGAT 
CGCAGAACTG GCAGAGTTTA CTACCAGTAT AAAAATCCTG TGACTGGAAA ATTTCACGGA
CTCGGAACAG ACAAAGGTAA GGCAGAAAAA ATCGCTTCCA CAGCCAATCA GCGAATAGCT
GCAGCAGAAG CTGAATATTT CATGCGCAAA ATTGATGAAA GTCCGTCAGC AACAAAACGT
CGGGGTATCA GATTAAAGGC ATGGGTTGAT CGATATCTGA AAATACAGGA CACGCGACTG
AAAAATGGAG ATATTGCAGC TACAACTCAC AAAGAAAAAA CTCGAATGGC TGCATACCTG
GTTTCCCGTC TGGGAAACCA CCCATTGAAA GAACTGGAAG TAAGAGACTT TGCATTAATA
CTGGATGAGT GGCTGGATAA AGACATGGTC AGTACAGCGA GAGTAAATCG TGGATTATGG
GTTGATATTT ATAAAGAAGC ACAGCATGCA GGGGAAGTTC CTCCTGGATG GAATCCTCCG
GAGGCTACCC GTAAACCGAT CCCTAAAGTA ACCAGAGCCA GGCTCACCAT GGAAGACTGG
CAAAAAATTT ACAATGCAAC GCCTGAAAAA CACTTTATCC GTAACGCAAT GCTTCTTGCG
ATTGTTACTG GTCAGCGCCG TGATGACATT TGCCACATGC GTTTTTCAGA TGTGTGGAAC
GAACACTTGC ATATCACCCA GGGAAAAACC GGAATGCGTC TGGCGTTACC GCTTACACTA
CGCTGTGATG CCATTGGGAT AACGTTAAAA GAAGTTATTG ATGGGTGCCG AGACAGAATA
TTAAGTCCAT ATCTAATCCA TAGTCGGCAC CAGAAACAAC CGAAGCCGAT GAGTAAAGAC
AACCTGAGCG ACTACTTTGC CAAAGCACGG GATCTGGCTG GGGTAATTCC ACCAGCAGGA
AAAACTCCGC CAACATTTCA TGAACAACGC TCTCTATCAG AACGGCTGTA CCGTGCACAG
GGTATCGATA CAAAAACATT ACTAGGACAT AAAGTCCAGG CAACCACCGA TCGCTATAAC
GATACTCGAG GTCAGGAATG GGTTAAGTTG GTTATTTGA
 
Protein sequence
MSPRPRKNST DVAGLYEKFD RRTGRVYYQY KNPVTGKFHG LGTDKGKAEK IASTANQRIA 
AAEAEYFMRK IDESPSATKR RGIRLKAWVD RYLKIQDTRL KNGDIAATTH KEKTRMAAYL
VSRLGNHPLK ELEVRDFALI LDEWLDKDMV STARVNRGLW VDIYKEAQHA GEVPPGWNPP
EATRKPIPKV TRARLTMEDW QKIYNATPEK HFIRNAMLLA IVTGQRRDDI CHMRFSDVWN
EHLHITQGKT GMRLALPLTL RCDAIGITLK EVIDGCRDRI LSPYLIHSRH QKQPKPMSKD
NLSDYFAKAR DLAGVIPPAG KTPPTFHEQR SLSERLYRAQ GIDTKTLLGH KVQATTDRYN
DTRGQEWVKL VI