Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0755 |
Symbol | phrB |
ID | 5592818 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 765587 |
End bp | 767005 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640919931 |
Product | deoxyribodipyrimidine photolyase |
Protein accession | YP_001457505 |
Protein GI | 157160187 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0415] Deoxyribodipyrimidine photolyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 0.391934 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTACCC ATCTGGTCTG GTTTCGCCAG GATTTACGTC TGCACGATAA TCTCGCACTG GCTGCCGCCT GCCGCAATTC GTCTGCACGC GTGCTGGCGT TGTATATCGC TACACCACGC CAGTGGGCGA CGCATAACAT GTCGCCGCGT CAGGCTGAAC TTATCAATGC TCAACTGAAT GGGCTACAAA TAGCGCTTGC GGAAAAAGGT ATTCCTTTAT TGTTCCGTGA AGTGGATGAC TTTGTCGCCA GTGTCGAAAT AGTTAAACAG GTGTGCGCGG AAAACAGCGT TACCCACCTG TTTTATAACT ATCAGTATGA AGTGAATGAG CGGGCGCGGG ATGTGGAAGT TGAAAGAGCG CTGCGTAACG TGGTGTGTGA AGGATTTGAT GACAGCGTGA TCCTGCCGCC TGGCGCGGTG ATGACCGGTA ATCACGAGAT GTACAAAGTC TTTACGCCTT TTAAGAATGC CTGGCTGAAA CGGCTGCGGG AAGGGATGCC GGAGTGCGTC GCTGCGCCAA AAGTTCGTAG TAGCGGATCG ATAGAGCCCT CGCCATCCAT TACGCTGAAT TATCCTCGTC AGTCTTTCGA TACTGCGCAT TTTCCGGTGG AAGAAAAAGC GGCGATTGCG CAATTACGCC AGTTTTGCCA GAACGGTGCC GGAGAATATG AGCAACAACG AGATTTTCCG GCAGTGGAAG GCACCAGCCG TTTGTCGGCC AGCCTGGCAA CGGGCGGGTT ATCGCCTCGC CAGTGCTTGC ATCGCTTGTT GGCTGAACAG CCGCAGGCGC TGGACGGTGG GGCCGGTAGT GTCTGGCTTA ATGAGCTGAT CTGGCGCGAG TTTTACCGTC ACCTGATAAC GTATCACCCC TCGTTGTGTA AACATCGTCC ATTTATTGCC TGGACGGATC GTGTACAGTG GCAGAGCAAT CCCGCACATT TACAGGCCTG GCAGGAAGGC AAAACGGGAT ACCCGATTGT TGATGCCGCT ATGCGTCAGC TTAACAGCAC TGGCTGGATG CATAACAGGC TACGGATGAT TACAGCCAGT TTTCTGGTGA AAGATTTATT GATCGACTGG CGCGAAGGCG AGCGATATTT CATGTCGCAG CTGATTGATG GTGATTTGGC AGCCAATAAC GGTGGCTGGC AGTGGGCCGC TTCAACCGGA ACCGATGCAG CGCCGTATTT TCGTATTTTC AACCCGACAA CCCAGGGCGA GAAATTTGAT CATGAGGGCG AGTTTATCCG CCAGTGGCTA CCGGAACTGC GCGATGTGCC AGGGAAAGTG GTGCATGAGC CGTGGAAGTG GGCGCAGAAA GCAGGTGTGA CGCTGGATTA TCCGCAACCG ATAGTCGAGC ACAAAGAAGC GAGAGTACAA ACGTTGGCAG CGTATGAGGC GGCGCGGAAG GGGAAATAA
|
Protein sequence | MTTHLVWFRQ DLRLHDNLAL AAACRNSSAR VLALYIATPR QWATHNMSPR QAELINAQLN GLQIALAEKG IPLLFREVDD FVASVEIVKQ VCAENSVTHL FYNYQYEVNE RARDVEVERA LRNVVCEGFD DSVILPPGAV MTGNHEMYKV FTPFKNAWLK RLREGMPECV AAPKVRSSGS IEPSPSITLN YPRQSFDTAH FPVEEKAAIA QLRQFCQNGA GEYEQQRDFP AVEGTSRLSA SLATGGLSPR QCLHRLLAEQ PQALDGGAGS VWLNELIWRE FYRHLITYHP SLCKHRPFIA WTDRVQWQSN PAHLQAWQEG KTGYPIVDAA MRQLNSTGWM HNRLRMITAS FLVKDLLIDW REGERYFMSQ LIDGDLAANN GGWQWAASTG TDAAPYFRIF NPTTQGEKFD HEGEFIRQWL PELRDVPGKV VHEPWKWAQK AGVTLDYPQP IVEHKEARVQ TLAAYEAARK GK
|
| |