Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1821 |
Symbol | |
ID | 6969159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1737316 |
End bp | 1738446 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 643385759 |
Product | site-specific recombinase, phage integrase family |
Protein accession | YP_002270249 |
Protein GI | 209397356 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0160439 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00000000000638112 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGCGCC CGCGAAAATA TAAAACCGAT GTTCCGGGAT TATCTCCGTA TTTTGACAAA AGAAATAACA AAGTTTACTG GCGTTACAGG CATCCCATAA CAGGCAAAAA TCACGGTCTC GGCAGTATTG ACCAGAAACT GGCAGAAACT ATTGCAGCAG AAGCGAACAG CCGTCTTGCC CGGCAGCAAA TGGAACAAAT GCTCAGTCTG CAGGAGAAAA TTATTAGTGA TACCGGCGGT TCATCAACCG TTACCATTTT TCTGAATAAT TACAGAAAAA TTCAACAGGA AAGATATGAA AACGGCGAGA TCAAACTCAA CACGCTGAAA CAGAAAGCGG CCCCTCTCAG GGTATTTGAT GAACGTTTTG GCACCAGACC GTTAGATGCC ATAACCGTAA AAGATGTGGT ATCAGTACTG GAAGAGTACA AGGCCAGAGG ACATAACAGA ATGGGACAAA TTTTCAGGAA AGTACTGATC GATGTTTTCC GGGAAGCTCA GCAAACGGGC GATGTCCCGC CAGGCTTTAA CCCTGCAGAA TCTGCAAAAA AACCGCAGGT GCGGATATCA AGACAGCGAC TGACTTTTGA TGAGTGGATG ATGATTTATA ACGCAGCGGA AAAGGATGGT TACTTTTTAC AGCGCGGTAT GCTGCTGGCA CTGATGACAG GCCAGCGCCT TTCAGATATT TGCAAAATGC AATTTTCGGA TATCCGGGAT GGTTATCTTC ATGTCGAACA GCAAAAAACA GGAACCCGGA TTGCCATCCC TCTGGCTCTG CGTTGCGATA AATTAAATCT CACCCTGGAT GATGTGGTGT CATCCTGCCG CGATTGCGTT CTTAGTCCGT GGCTATTGCA CCACCATCAC GCGAAAGGGA CAGCTAAGCG CGGCGGGATG GTTAAGCCAG CAACATTAAC CGTTGCATTT AAAAAAGCCC GGGATTCTGT GGATTACAAC TGGCGTGCTA ATGGCACCCC ACCCTCTTTC CATGAGCAGA GATCTTTATC AGAGCGATTG TTCAGAGAGC AGGGGGTTGA TACCAAAATT TTGCTAGGCC ATTCGAATCA AAAAATGATC GATATTTACA ACGACGCACG CGGTAAGGAA TGGAAAAAAC TGGTCATTTG A
|
Protein sequence | MARPRKYKTD VPGLSPYFDK RNNKVYWRYR HPITGKNHGL GSIDQKLAET IAAEANSRLA RQQMEQMLSL QEKIISDTGG SSTVTIFLNN YRKIQQERYE NGEIKLNTLK QKAAPLRVFD ERFGTRPLDA ITVKDVVSVL EEYKARGHNR MGQIFRKVLI DVFREAQQTG DVPPGFNPAE SAKKPQVRIS RQRLTFDEWM MIYNAAEKDG YFLQRGMLLA LMTGQRLSDI CKMQFSDIRD GYLHVEQQKT GTRIAIPLAL RCDKLNLTLD DVVSSCRDCV LSPWLLHHHH AKGTAKRGGM VKPATLTVAF KKARDSVDYN WRANGTPPSF HEQRSLSERL FREQGVDTKI LLGHSNQKMI DIYNDARGKE WKKLVI
|
| |