Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3579 |
Symbol | |
ID | 6968897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3295659 |
End bp | 3296828 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 643387378 |
Product | integrase |
Protein accession | YP_002271838 |
Protein GI | 209400578 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.000000000000745226 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGGCACTGA GCGACACCAA ACTACGAGGC CTCTACGGAA AACCCTACTC TGGCCCTGCT GAAATCACCG ATGGTGACGG GTTAAGTGTG CGGATTACTC CAGCAGGAAC TATCACGTTT CAGTACCGTT ATCGCTGGAA TGGTAAGCCA GTACGTCTTA CTGTCGGGCG CTATCCGTCT ACTTCACTGA AGGACGCGCG TGTTATCGTC GGTGAGATGC GCGCATTGTA CATGAAGGGA GTTAACCCTA AAAATTATTT TGCGCCCAGT GACGGCGAAC TGACATTAAA AGAATGCCTG GATCAGTGGT GGGATAAGTA TGTTACTGAT CTGAAACCCA ATACACAGAC GCTTTACAGA TCCGTCGTGT ACAACACTAT GTACACACAG TTTGAAGGCT CGCCAGTTGC CAGTATCCCA GTATCGGCAT GGGCTCGTTT CTTTGACAAA CAAGAAAGCC TCAACAAGAA AAAGGCTCGT GTTTTGTTGT TACAACTGCG TTCAGTTATT AACTGGTGTA TCAGCCGCCA GTTAATCCCA TCATGCGAAT TACTCAAGCT TAGTGTTAAG AATATTGGGA AAAAACCAGA TGTAGGGAGT CGTGTACTTA CCTATACGGA ACTGGCAAAA ATTTGGCTGG CGCTGGAGAA CTCAAAAGTA GTTACTTCCA ACAAGGTGTT ACATCAATTG CTATTGCTAT GGGGGGCCAG ACTTTCTGAG CTACGTCTCG CGACAGCCAG TGAGTTCAAC ATGGAAGATC TGGTCTGGAC AACCCCAAAA GAGCATTCAA AAATGGGGAA TATTATCCGG CGTCCGGTAT TCACTCAGGT TAAACCTTAT ATCGAGAGAT TGCTTAATGC GGGTTTTGAT GTGCTATTCC CCGGGCAGGA AATAGATAAA CCTATTGATC GTTCGTCGGC TAATTTGTAC ATGAAAAAGT TAAGGGAGAA AATTGATATT CCTGAATGGA GAACGCATGA TTTTAGGCGT TCTCTGGTAA CGAATTTATC TGGGGAAGGA ATTATGCCCC ACGTCACTGA AAAAATGCTG GGGCATGAAC TTGGGGGAGT TATGGCCGTG TATAATAAAC ACGACTGGTT GTCGGAACAG AAAGATGCGT ATGAGTTGTA TGCTGATAAA ATTTTCTGGC ACGCTAAACA GCTCGGTTAA
|
Protein sequence | MALSDTKLRG LYGKPYSGPA EITDGDGLSV RITPAGTITF QYRYRWNGKP VRLTVGRYPS TSLKDARVIV GEMRALYMKG VNPKNYFAPS DGELTLKECL DQWWDKYVTD LKPNTQTLYR SVVYNTMYTQ FEGSPVASIP VSAWARFFDK QESLNKKKAR VLLLQLRSVI NWCISRQLIP SCELLKLSVK NIGKKPDVGS RVLTYTELAK IWLALENSKV VTSNKVLHQL LLLWGARLSE LRLATASEFN MEDLVWTTPK EHSKMGNIIR RPVFTQVKPY IERLLNAGFD VLFPGQEIDK PIDRSSANLY MKKLREKIDI PEWRTHDFRR SLVTNLSGEG IMPHVTEKML GHELGGVMAV YNKHDWLSEQ KDAYELYADK IFWHAKQLG
|
| |