Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3251 |
Symbol | |
ID | 6967490 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 2983824 |
End bp | 2985110 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643387064 |
Product | Int protein |
Protein accession | YP_002271528 |
Protein GI | 209395928 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0582] Integrase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.383765 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.0000530695 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTAACG CATCATACCC GACAGGCGTT GAAAACCATG GCGGATCACT CCGTATATGG TTTCACTATA ATGGCAAACG TGTCAGAGAA AACCTCGGTG TTCCTGACAC AGCCAAAAAC CGGAAGATCG CTGGTGAACT TCGCACTTCC GTTTGTTTTG CAATCAGAAT GGGGAGTTTC GACTACGCCG CGCAGTTCCC TAATTCCCCT AACCTGAAAC ACTTTGGTCT GGGAAAAAGA GAGATAACCG TTAAGGCACT TTCGGAAAAA TGGTTGGACC TTAAGAAAAT TGAGATTTGT GCGAATGCAC TTAACCGTTA CCAGTCAGTA ATTAAAAACA TGTTACCAAT GTTAGGTGAA AAAAAACTGG TTTCATCCAT AACAAAAGAG GATTTACTTT TCGTAAGGAG AGATTTGTTG ACCGGTTACC AAAAGCTTTC TAATGGAAAG ACTTCTTCCA TAAAAGGGCG CTCAGTGGTC ACGGTAAACT ACTATATGAC AACCATAGCT GGAATGTTTC AATTTGCAAC AGATAATGGT TATACCTCAG GAAACCCATT TAACGGTCTG GCTCCCTTAA AAAAGTCCAA GGTAAAACCA GATCCTCTCA CCCGTGACGA ATTTATTCGT TTTATTGAGG CTTGCCGTCA TCAACAAACA AAAAACCTGT GGATTCTCGC TGTATACACG GGTATTCGTC ACGGGGAGTT GGTATCGCTG GCATGGGAAG ATATAGACCT TAAAGCAAGG ACTATAACCA TCCGTAGAAA TTATACAAAA CTTGGCGAAT TCACTCCACC AAAAACCGAT GCAGGCACCG GAAGGACAAT TCATCTGGTT CAACCAGCTA TTGATGCTCT TAAAAGCCAG GCGGAAATGA CCATGCTTGG AAAGCAACAT TCTGTAGAGG TGAAGCAGAG GGAATATGGG AGAACTGCTG TGCATAAATG CACTTTTGTT TTTAGTCCTC AGGTAACAAA ACAGCAGCAG TTGTCCGGAC CTCACTACAA GGTTGACTCC ATCAGGGAGT CATGGACAAG TATCTTAAAA CGCGCAGGTC TGAGACACAG AAAATCGTAC CAATCCAGGC ATACTTATGC ATGCTGGTCA CTTGCCGCAG GAGCTAATCC TAGTTTTATC GCAAGCCAGA TGGGCCACAC AAACGCACAA ATGGTATTCA ATGTTTACGG AGCATGGATG AAAGACAACA ATCACGAACA GATAGAACTC CTTAACAAAA GACTATCTGA AAGTGTCCCA TGTATGCCCC ATAAGAAAGC AGGGTAA
|
Protein sequence | MSNASYPTGV ENHGGSLRIW FHYNGKRVRE NLGVPDTAKN RKIAGELRTS VCFAIRMGSF DYAAQFPNSP NLKHFGLGKR EITVKALSEK WLDLKKIEIC ANALNRYQSV IKNMLPMLGE KKLVSSITKE DLLFVRRDLL TGYQKLSNGK TSSIKGRSVV TVNYYMTTIA GMFQFATDNG YTSGNPFNGL APLKKSKVKP DPLTRDEFIR FIEACRHQQT KNLWILAVYT GIRHGELVSL AWEDIDLKAR TITIRRNYTK LGEFTPPKTD AGTGRTIHLV QPAIDALKSQ AEMTMLGKQH SVEVKQREYG RTAVHKCTFV FSPQVTKQQQ LSGPHYKVDS IRESWTSILK RAGLRHRKSY QSRHTYACWS LAAGANPSFI ASQMGHTNAQ MVFNVYGAWM KDNNHEQIEL LNKRLSESVP CMPHKKAG
|
| |