Gene ECH74115_1571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1571 
Symbol 
ID6968162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1535679 
End bp1536908 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content52% 
IMG OID643385536 
Productsite-specific recombinase, phage integrase family 
Protein accessionYP_002270030 
Protein GI209397964 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0359454 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.000000698249 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCAGAG CACTTAACAA ACTGAGCGAT ACACAGCTGA GGAAAATCAA CGGCACACCC 
GCCCAAAAAA CAGCCTTTCT TAATGACGGT GGAAACCTGA GCGTCAGGCA TTCAACCAGT
GGCCTTTTAA CCTGGTATTT CACTTACAGG GCCGGAACGG GAAGGGGGGC ACCACCGGAA
CGCATTAAGC TGGGAAATTA TCCTGATCTG AGCCTGAAAT CAGCCAGGGA AAAAGCCGCC
CAGTGTCGCG CATGGCTGGC AGAGGGGAAA AATCCACGTC ATGAGCTTAA TTACACCGTA
CAGGAAGCGT TAAAGCCGGT AACGGTTGGC GATGCGCTCA CCTACTGGCT TGAGTCGTAC
GCAAAGGAAA ACCGCGTGGA TTATGCCGCC CTGAAAAAGC GCCTTAATAA TCACGTAATA
CAGCACATTG GTGCTATGCC GCTGGATAAA TGCGAGCTAC GGCACTGGCT GGCCTGTTTT
GACCAGGTGG CAAAGCGAAC GCCTGTTACT GCCGGATTCT TGCTACAGAC GTGCAAACAG
GCGCTTAAGT TCTGCCGGAG GCGGCGCTAT GCAATCAGCA ACGTTCTTGA TGATATGAGT
GTGGCGGACG TTGGGAAAAA ACCGGATATA AGCGAGCGTG TCTTAAGCAC CAAAGAACTG
GGCGAATTAT TGCAGGCACT GGACAAAAAA ATATTCTCCC CCTACTACAT CGCGTTAATC
CGCCTCCTGA TTGTGTTCGG ATGCCGGACG GTAGAACTGA GGTTATCGGA GATCAGCGAG
TGGGATTTTA CCGAAATGCT CTGGACCGTT CCGAAGGAGC ACAGCAAAAC GAAGGTAGCC
ATATTCCGGC CTATACCGGA AGCAATACTG CCGTTCATCA CGCAGCTGGT GGAGCAGAAC
AGGCACACGG GCTTATTGCT GGGGGAAGTG AAACAGGAGG CCAGCGTATC GCAGTATGGA
AGATTAGCGC ACAGGAGGCT TAATCACCCT CACTGGTCAC TGCATGACAT CCGGCGCACC
TTTACAACCA TGCTGAACGA TTTAGGCGTT GATCCGCATG TCGTGGAGCA GCTTACAGGC
CACCAGATGC CAGGAATGCA GCGAGTTTAT AATCATTCCC GTTATCTGGA TGCGAAACGC
AATGCGCTGG ATATGTGGAC GGAGCGGTTA GGGATACTGG CGGGAACACA TGAAAACGTA
ACCACGCTAC CAGTAGCCAG AAGAAAATAA
 
Protein sequence
MSRALNKLSD TQLRKINGTP AQKTAFLNDG GNLSVRHSTS GLLTWYFTYR AGTGRGAPPE 
RIKLGNYPDL SLKSAREKAA QCRAWLAEGK NPRHELNYTV QEALKPVTVG DALTYWLESY
AKENRVDYAA LKKRLNNHVI QHIGAMPLDK CELRHWLACF DQVAKRTPVT AGFLLQTCKQ
ALKFCRRRRY AISNVLDDMS VADVGKKPDI SERVLSTKEL GELLQALDKK IFSPYYIALI
RLLIVFGCRT VELRLSEISE WDFTEMLWTV PKEHSKTKVA IFRPIPEAIL PFITQLVEQN
RHTGLLLGEV KQEASVSQYG RLAHRRLNHP HWSLHDIRRT FTTMLNDLGV DPHVVEQLTG
HQMPGMQRVY NHSRYLDAKR NALDMWTERL GILAGTHENV TTLPVARRK