Gene ECH74115_3251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3251 
Symbol 
ID6967490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2983824 
End bp2985110 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content43% 
IMG OID643387064 
ProductInt protein 
Protein accessionYP_002271528 
Protein GI209395928 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.383765 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0000530695 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTAACG CATCATACCC GACAGGCGTT GAAAACCATG GCGGATCACT CCGTATATGG 
TTTCACTATA ATGGCAAACG TGTCAGAGAA AACCTCGGTG TTCCTGACAC AGCCAAAAAC
CGGAAGATCG CTGGTGAACT TCGCACTTCC GTTTGTTTTG CAATCAGAAT GGGGAGTTTC
GACTACGCCG CGCAGTTCCC TAATTCCCCT AACCTGAAAC ACTTTGGTCT GGGAAAAAGA
GAGATAACCG TTAAGGCACT TTCGGAAAAA TGGTTGGACC TTAAGAAAAT TGAGATTTGT
GCGAATGCAC TTAACCGTTA CCAGTCAGTA ATTAAAAACA TGTTACCAAT GTTAGGTGAA
AAAAAACTGG TTTCATCCAT AACAAAAGAG GATTTACTTT TCGTAAGGAG AGATTTGTTG
ACCGGTTACC AAAAGCTTTC TAATGGAAAG ACTTCTTCCA TAAAAGGGCG CTCAGTGGTC
ACGGTAAACT ACTATATGAC AACCATAGCT GGAATGTTTC AATTTGCAAC AGATAATGGT
TATACCTCAG GAAACCCATT TAACGGTCTG GCTCCCTTAA AAAAGTCCAA GGTAAAACCA
GATCCTCTCA CCCGTGACGA ATTTATTCGT TTTATTGAGG CTTGCCGTCA TCAACAAACA
AAAAACCTGT GGATTCTCGC TGTATACACG GGTATTCGTC ACGGGGAGTT GGTATCGCTG
GCATGGGAAG ATATAGACCT TAAAGCAAGG ACTATAACCA TCCGTAGAAA TTATACAAAA
CTTGGCGAAT TCACTCCACC AAAAACCGAT GCAGGCACCG GAAGGACAAT TCATCTGGTT
CAACCAGCTA TTGATGCTCT TAAAAGCCAG GCGGAAATGA CCATGCTTGG AAAGCAACAT
TCTGTAGAGG TGAAGCAGAG GGAATATGGG AGAACTGCTG TGCATAAATG CACTTTTGTT
TTTAGTCCTC AGGTAACAAA ACAGCAGCAG TTGTCCGGAC CTCACTACAA GGTTGACTCC
ATCAGGGAGT CATGGACAAG TATCTTAAAA CGCGCAGGTC TGAGACACAG AAAATCGTAC
CAATCCAGGC ATACTTATGC ATGCTGGTCA CTTGCCGCAG GAGCTAATCC TAGTTTTATC
GCAAGCCAGA TGGGCCACAC AAACGCACAA ATGGTATTCA ATGTTTACGG AGCATGGATG
AAAGACAACA ATCACGAACA GATAGAACTC CTTAACAAAA GACTATCTGA AAGTGTCCCA
TGTATGCCCC ATAAGAAAGC AGGGTAA
 
Protein sequence
MSNASYPTGV ENHGGSLRIW FHYNGKRVRE NLGVPDTAKN RKIAGELRTS VCFAIRMGSF 
DYAAQFPNSP NLKHFGLGKR EITVKALSEK WLDLKKIEIC ANALNRYQSV IKNMLPMLGE
KKLVSSITKE DLLFVRRDLL TGYQKLSNGK TSSIKGRSVV TVNYYMTTIA GMFQFATDNG
YTSGNPFNGL APLKKSKVKP DPLTRDEFIR FIEACRHQQT KNLWILAVYT GIRHGELVSL
AWEDIDLKAR TITIRRNYTK LGEFTPPKTD AGTGRTIHLV QPAIDALKSQ AEMTMLGKQH
SVEVKQREYG RTAVHKCTFV FSPQVTKQQQ LSGPHYKVDS IRESWTSILK RAGLRHRKSY
QSRHTYACWS LAAGANPSFI ASQMGHTNAQ MVFNVYGAWM KDNNHEQIEL LNKRLSESVP
CMPHKKAG