Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0798 |
Symbol | phrB |
ID | 6966712 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 819425 |
End bp | 820843 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643384825 |
Product | deoxyribodipyrimidine photolyase |
Protein accession | YP_002269331 |
Protein GI | 209397896 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0415] Deoxyribodipyrimidine photolyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.825453 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTACCC ATCTGGTCTG GTTTCGCCAG GATTTACGTC TGCATGATAA TCTCGCGCTG GCTGCCGCCT GCCGCAATTC ATCTGCACGC CTGCTGGCGT TGTATATCGC TACACCACGC CAGTGGGCGG CGCATAATAT GTCGCCGCGT CAGGCTGAAC TTATCAATGC TCAACTGAAC GGGCTACAAA TCGCGCTGGC GGAAAAAGGC ATTCCTTTAT TGTTCCGTGA AGTGGATGAC TTTGCCGCCA GCGTCGAAAT AGTTAAACAG GTGTGCGCGG AAAACAGCGT TACTCATCTT TTTTATAACT ATCAGTATGA GGTGAATGAG CGGGCGCGGG ATGTGCAGGT TGAGAGAACT CTGCGTAACG TGGTGTGTGA AGGATTTGAT GACAGCGTGA TCCTTCCGCC TGGCGCGGTG ATGACCGGCA ATCATGAGAT GTACAAAGTC TTTACGCCTT TTAAGAATGC CTGGCTGAAA CGGCTGCGGG AAGGGATGCC GGAGTGCGTC GCTGCACCAA AAGTTCGTAG TAGCGGATCG ATAAAGCCCG CGCCATCCAT TACGCTGAAT TATCCTCGTC AGTCTTTCGA TACTGCGCAT TTCCCGGTGG AAGAAAAAGC GGCGATTGCG CAATTACGCC AGTTTTGCCA GAACGGTGCC GGAGAATATG AGCAACAACG AGATTTTCCG GCAGTGGAAG GCACCAGTCG TTTGTCCGCC AGCCTGGCAA CGGGCGGGTT ATCGCCTCGC CAGTGTTTGC ATCGCTTGTT GGCGGAACAG CCGCAGGCGC TGGACGGTGG GGCCGGTAGT GTCTGGCTTA ATGAGCTGAT CTGGCGCGAG TTCTACCGTC ATCTGATGAC GTATTACCCC TCGTTGTGTA AACATTGTCC GTTTATTGCC TGGACGGATC GTGTGCAGTG GCAGAGCAAT CCCGCACATT TACAGGCCTG GCAGGAAGGC AAAACGGGAT ACCCGATTGT CGATGCTGCC ATGCGTCAGC TTAACAGCAC TGGCTGGATG CATAACAGGC TACGGATGAT TACAGCCAGT TTTCTGGTGA AAGATTTGTT GATCGACTGG CGCGAAGGCG AGCGATATTT CATGTCGCAG CTGATTGATG GTGATTTGGC AGCCAATAAC GGTGGCTGGC AGTGGGCGGC TTCTACGGGT ACTGATGCAG CGCCGTATTT TCGTATTTTC AACCCGACAA CCCAGGGCGA GAAATTTGAC CGCGAAGGCG AGTTTATCCG CCAGTGGCTA CCGGAACTGC GCAATGTACC GGGGAAATCG GTGCATGAGC CGTGGAAGTG GGCTGAGAAA GCAGGTGTGA AGCTGGATTA TCCGCAACCG ATAGTCGAGC ACAAAGAGGC GAGAGTACAA ACGTTGGCAG CGTATGAGGC GGCGCGGAAG GGGAAATAA
|
Protein sequence | MTTHLVWFRQ DLRLHDNLAL AAACRNSSAR LLALYIATPR QWAAHNMSPR QAELINAQLN GLQIALAEKG IPLLFREVDD FAASVEIVKQ VCAENSVTHL FYNYQYEVNE RARDVQVERT LRNVVCEGFD DSVILPPGAV MTGNHEMYKV FTPFKNAWLK RLREGMPECV AAPKVRSSGS IKPAPSITLN YPRQSFDTAH FPVEEKAAIA QLRQFCQNGA GEYEQQRDFP AVEGTSRLSA SLATGGLSPR QCLHRLLAEQ PQALDGGAGS VWLNELIWRE FYRHLMTYYP SLCKHCPFIA WTDRVQWQSN PAHLQAWQEG KTGYPIVDAA MRQLNSTGWM HNRLRMITAS FLVKDLLIDW REGERYFMSQ LIDGDLAANN GGWQWAASTG TDAAPYFRIF NPTTQGEKFD REGEFIRQWL PELRNVPGKS VHEPWKWAEK AGVKLDYPQP IVEHKEARVQ TLAAYEAARK GK
|
| |