Gene ECH74115_2817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2817 
Symbol 
ID6970349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2618574 
End bp2619653 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content43% 
IMG OID643386667 
Productsite-specific recombinase, phage integrase family 
Protein accessionYP_002271143 
Protein GI209399988 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000101654 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAGAC GAAGGAAAAA TCCCGAACAC GAAAAATTAC CGCCAAAGGT ATACCCCAAT 
AAATATAGTG TATGGAAACC GACATCCAGA GAATCTGTAA CCTTAACTGC AATCGAGGAT
GGTTTAGCCG CATTATGGAA AAAGTATGAA GAAACGGTTA ACCATCGCGA TCGCGCAATG
ACATTTGGGC GTTTGTGGGA AAAATTTCTC GCCAGCGCCT ATTACAGCGA GCTTAGTCCT
AGAACTCAAA AAGATTATCT GCAACATCAA AAAAAGCTGC TGGCCGTATT CGGTAAGGTG
CTGGCCGATT CTGTAAAACC AGAGCACATC AGACGATACA TGGACAAAAG AGGCGAGCAG
AGTAAAACGC AGGCAAACCA TGAAAAAAGC AGTATGTCGC GCGTTTATAG TTGGGGGTAT
GAGCGAGGAT ACGTGAAGGC TAACCCATGT GCAGGTGTAA GTAAATTCAA GGCCAAAAAC
CGCGAACGAT ATGTAACCGA CAAAGAATAC CAGGCAGTAT TAAGCGTTGC ACCTCTTCCT
GTTTTTATCG CAATGGAAAT TGCCTATCTG TGTGCAGCGA GGGTTTCCGA TGTGTTATCG
CTGAAATGGG AGCAGATTGG AAACGACGGG ATCTTTATCC AGCAAGGGAA AACAGGGAAA
AAACAGATAA AAGCATGGAG TCCACGATTA CAGGCGGCGA TCGAAAAAGC AAAACAGTTA
CCAACATCCG CCTATGTAAT CAGCAATCAA TACGGCAACC GATATATGTA CAAAGGCTTT
AACGAAATGT GGGTAGAAGC AAGAAATCAC GCAGGCAAAA TTTCAGGTAT TTTAACCGAC
TTCACCTTTC ATGATCTGAA GGCGAAAGGA ATTTCAGACT ATGAAGGAAG CAGCCGGGAT
AAGCAACTTT TCTCTGGTCA CAAAACCGAA GGGCAAGTGC TAATCTATGA CAGGAAGGTT
AAAGTTTCAC CAACACTTGA TGTCCCGTTA CCTGAAAATA TTCCAAGAAA ATATTCCAAG
AAAATATTCC AAGTAATTCC AAGTGTGATT TTTGTCACTG ACTTAATGAT GTGTAAGTGA
 
Protein sequence
MGRRRKNPEH EKLPPKVYPN KYSVWKPTSR ESVTLTAIED GLAALWKKYE ETVNHRDRAM 
TFGRLWEKFL ASAYYSELSP RTQKDYLQHQ KKLLAVFGKV LADSVKPEHI RRYMDKRGEQ
SKTQANHEKS SMSRVYSWGY ERGYVKANPC AGVSKFKAKN RERYVTDKEY QAVLSVAPLP
VFIAMEIAYL CAARVSDVLS LKWEQIGNDG IFIQQGKTGK KQIKAWSPRL QAAIEKAKQL
PTSAYVISNQ YGNRYMYKGF NEMWVEARNH AGKISGILTD FTFHDLKAKG ISDYEGSSRD
KQLFSGHKTE GQVLIYDRKV KVSPTLDVPL PENIPRKYSK KIFQVIPSVI FVTDLMMCK