Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2663 |
Symbol | |
ID | 6971822 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 2505696 |
End bp | 2508518 |
Gene Length | 2823 bp |
Protein Length | 940 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643386526 |
Product | bacteriophage replication gene A protein |
Protein accession | YP_002271008 |
Protein GI | 209397309 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.978524 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.000159961 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGGCAG AGTACATCAG GGACTGGCAA CAACCGCGCC ACGCAGTGGG GCGTGAAGGA ACGGGGATCC CCGCTCCTGA ATCCGCGCTT TCCTCCTGGC TGGATGCCTA CCGGGTAGAG AACGAGCGCC GCCAGGAAAT GGCTGATGCG GCGTTCTCCG CAACGCCGCT GGGCAACCTG ATTAATAAAA GCCTGGACGC ACAGGAAAAA CAGGACAAAA CCATCACACT GGCAGGAGAC GCCAGAAAAC AGGCACGCGG TGCGGTGGAT GAAGCCATGG CCTCGCTGCG CCTGCTGCCG TCCTATCTGC GCGATCCGCT TATTCGCCAC CTCTCCTTCC TGCGCAAAAA ACAGGAAGCC GATCGTCAGA AAGGCAAAAA GAGCTGGCAG GCTGAACGCT ACGCGCGCGG AAACCTGCGC AAAATATTCG AACGTCTGGA GCGCACCGAT CACCGCTGGC TGACACAGGG TTATCGCTCC CTTGCCGGAC GCGAACGCCT GGACGATTTG CTTTACCTGC CGCAGCTCAA CAAACACCAG ATACAGACGC TGGCCACCAT GACGGCGGCG ATGTTCAGCA GCACCTTCGA AAAACTCTGC GATGGCTTTG GCGCGACCGA TGGCGAACTG ACCATGGATG TAACGCTGAA GGCGTATCAG ATGCTGGCCC GCATGGCGTT ACACCTGCAC GCCATGCCTC CACATTATGA CGCACTGACA ACAGACAAAG ACCGGAGGAA CGAACCGGAC ACGGAGCTGC TGCCGGGCGC AATCCTTCGC CTGACCTGTG CGGAATGGTG GAAACGCAAA CTGTGGCTGT TACGTTGCGA GTGGAGAGAA GAACAACTCC GCGCCGCCTG TCTGGTTTCC AGAAAAACAT CGCCCTATCT GAGCCAGGAC GCGTTAAGCG AGTTTCGCGC ACAGCGCGAG AAAACACGCG ATTTCCTGAA AAGTTTCATG CTGGAAAACG AAGACGGGTT CACGATTGAT CTCGAGACAG TGTATTACGC GGGAGTAAGT AACCCGGTTC ACCGTAAGGC AGAAATGATG GCCACCATGA AGGGGCTGGA ACTTCTGGCC GAAGCCCGTG GCGACAAAGC GGTGTTTCTG ACTGTCACCT GCCCGTCAAA ATACCACGCT ACAACAGAGA ACGGTCATCC GAATCCCAAA TGGAACGGGG CCACCATGCG CGACTCCAGC GATTACCTGG TTAACACGTT TTTTGCGGCG GTCCGCAAGA AACTGAACCG CGACGGCCTG CGCTGGTATG GCATCCGCAC GGTGGAGCCT CACCATGACG GCACCGTGCA CTGGCATATG ATGGTCTTTG CTCATCCGGA AGAAATCGAC ACCATTGTGT CCCACACCCG CGATATTGCC ATTCAGGAAG ATCGTCACGA GCTGGGCGAT GATATTACTC CGCGCTTTAA GGCGGAGTAT GTCGACGGCT CAAAAGGCAC GCCAACCAGC TACATCGCCA CCTACATCGG AAAAAACCTG GACAGCCGCG CCGTGGATGG CATCGACCCG AAAACAGGCA AACCACGCGT TGACCACGAA ACCGGAAAAT CAATGACCGA GAGCGTGGAA CGCGCCATTG GCTGGGCGCG CCTTCACCGG GTCCGCCAGT TCCAGTTCTT TGGCATCCCC TCCCGTCAGG TGTGGCGTGA ACTGCGTCGC CTTGCCAGCC AGATGGCACG CAACCCGGAA GGCCCGCAAC GGCTGAAGGA TGACGCAATG GATGCGGTTC TTGCTGCCGC TGATGCCGGA TGTTTTACCA CCTACATTGA GAAACAGGGA GGCGTACTTG TTCCACGCAA GGACTACCTG ATTCGCACCG CCTACGACCT CGCAGATGAG CTGAACGATT ACGGCGAACA GAGCGTACAG ATTTACGGGA TCTGGTCACC ACTCATCGGG GAATCCTCCC GTGTGTGCAC GCACCCGGAT AACTGGAAGC TGGTAAGACG TAAACCGGGA GTAGAAGACA GCGCCCGCGA AAATGGTTTT GACCTTCAGG GCGGCCCTGC CGCCCCTTGG ACTCGTGGCA ATAACTGTCC CCGTGTACAG GAAACGGACA ACAACGGGAC AGAACAGCCG GAAGAACGGC CAGCACCGTG GCCGCAGCTT CCTGACGGCG TTGACGTGAA CGAATGGATG CGCTCACTGA AACGGCACGA ACGCCGGGCG CTGATGCGTT CGCTTCGTGA CAAACAGGCA AAAAACAGCA GTGATGAAAT GCAGAGCTGG ACACAGAGCC GCAAACAGCA GCGGCCTTTG CCTGATAACC ACGAATTACT CGCTAAAGAA TGGCGGGAGT CTGCTGAATC TCTCGGCCTG CATATCGGTG AACAACAGAT GCAGCACCTG TTACGGGGCG GCAGTCTGTA CGTTGACGGC AGCATCATTG CACCGCAGGG ATTTGAAATT GTACGCAAAC CGGATACCCG CCCGGACAGC CGAATCACGC AGCTCTGGCA GCGCCTGAGC CGTAATCATG GCGTAAGCAG CACGGAGATC CGCCATAACC CGGTCGCCAG CTATCTGGCA CAGCTGGGGG CATCAGACCC TGAAGCCGCC GCACGCCTGG CATCCACACT TCAGCAGGAC CAGAACACCA TGAAAACACC CGTTACCGTG CTTTCTGACA TGCTGCGCGC CATCCGCGAC GCAGAGCACG CACAGAGAAT CAGTGAAACC ACTGAACGCG CCAGCCGCAA AGCAGACCTG CTGCGGGGTG GCCTGACCAG TGGAAACAAA AAACAGACAG AAACGGGACT CACAAATCCC GTAAATGAGC AAAAAACGCG CCGCGATATA TGA
|
Protein sequence | MTAEYIRDWQ QPRHAVGREG TGIPAPESAL SSWLDAYRVE NERRQEMADA AFSATPLGNL INKSLDAQEK QDKTITLAGD ARKQARGAVD EAMASLRLLP SYLRDPLIRH LSFLRKKQEA DRQKGKKSWQ AERYARGNLR KIFERLERTD HRWLTQGYRS LAGRERLDDL LYLPQLNKHQ IQTLATMTAA MFSSTFEKLC DGFGATDGEL TMDVTLKAYQ MLARMALHLH AMPPHYDALT TDKDRRNEPD TELLPGAILR LTCAEWWKRK LWLLRCEWRE EQLRAACLVS RKTSPYLSQD ALSEFRAQRE KTRDFLKSFM LENEDGFTID LETVYYAGVS NPVHRKAEMM ATMKGLELLA EARGDKAVFL TVTCPSKYHA TTENGHPNPK WNGATMRDSS DYLVNTFFAA VRKKLNRDGL RWYGIRTVEP HHDGTVHWHM MVFAHPEEID TIVSHTRDIA IQEDRHELGD DITPRFKAEY VDGSKGTPTS YIATYIGKNL DSRAVDGIDP KTGKPRVDHE TGKSMTESVE RAIGWARLHR VRQFQFFGIP SRQVWRELRR LASQMARNPE GPQRLKDDAM DAVLAAADAG CFTTYIEKQG GVLVPRKDYL IRTAYDLADE LNDYGEQSVQ IYGIWSPLIG ESSRVCTHPD NWKLVRRKPG VEDSARENGF DLQGGPAAPW TRGNNCPRVQ ETDNNGTEQP EERPAPWPQL PDGVDVNEWM RSLKRHERRA LMRSLRDKQA KNSSDEMQSW TQSRKQQRPL PDNHELLAKE WRESAESLGL HIGEQQMQHL LRGGSLYVDG SIIAPQGFEI VRKPDTRPDS RITQLWQRLS RNHGVSSTEI RHNPVASYLA QLGASDPEAA ARLASTLQQD QNTMKTPVTV LSDMLRAIRD AEHAQRISET TERASRKADL LRGGLTSGNK KQTETGLTNP VNEQKTRRDI
|
| |