Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_00657 |
Symbol | phr |
ID | 8114637 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 697588 |
End bp | 698955 |
Gene Length | 1368 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 644846928 |
Product | hypothetical protein |
Protein accession | YP_002998501 |
Protein GI | 251784197 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0415] Deoxyribodipyrimidine photolyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTACCC ATCTGGTCTG GTTTCGCCAG GATTTACGTC TGCACGATAA TCTCGCACTG GCTGCCGCCT GCCGCAATTC GTCTGCACGC GTGCTGGCGT TATATATCGC TACACCACGC CAGTGGGCGA CGCATAACAT GTCGCCGCGT CAGGCTGAAC TTATCAATGC TCAACTGAAT GGGCTACAAA TAGCGCTTGC GGAAAAAGGT ATTCCTTTAT TGTTCCGTGA AGTGGATGAC TTTGTCGCCA GTGTCGAAAT AGTTAAACAG GTGTGCGCGG AAAACAGCGT TACCCACCTG TTTTATAACT ATCAGTATGA AGTGAATGAG CGGGCGCGGG ATGTGGAAGT TGAAAGAGCG CTGCGTAACG TGGTGTGTGA AGGATTTGAT GACAGCGTGA TCCTGCCGCC TGGCGCGGTG ATGACCGGTA ATCACGAGAT GTACAAAGTC TTTACGCCTT TTAAGAATGC CTGGCTGAAA CGGCTGCGGG AAGGGATGCC GGAGTGTGTC GCTGCGCCAA AAGTTCGTAG TAGCGGATCG ATAGAGCCCG CGCCATCCAT TACGCTGAAT TATCCTCGTC AGTCTTTCGA TACTGCGCAT TTCCCGGTGG AAGAAAAAGC GGCGATTGCG CAATTACGCC AGTTTTGCCA GAACGGTGCC GGAGAATATG AGCAACAACG AGATTTTCCG GCAGTGGAAG GCACCAGTCG TTTGTCCGCC AGCCTGGCAA CGGGCGGGTT ATCGCCTCGC CAGTGTTTGC ATCGCTTGTT GGCGGAACAG CCGCAGGCGC TGGACGGTGG GGCCGGTAGT GTCTGGCTTA GTGAGCTGAT CTGGCGCGAG TTCTACCGTC ATCTGATGAC GTATTACCCC TCGTTGTGTA AACATTGTCC GTTTATTGCC TGGACGGATC GTGTGCAGTG GCAGAGCAAT CCCGCACATT TACAGGCCTG GCAGGAAGGC AAAACGGGAT ACCCGATTGT CGATGCTGCC ATGCGTCAGC TTAACAGCAC TGGCTGGATG CATAACCGGC TACGGATGAT TACAGCCAGT TTTCTGGTTA AAGATTTGTT GATCGACTGG CGCGAAGGCG AGCGATATTT CATGTCGCAG CTGATTGATG GTGATTTGGC AGCCAATAAC GGTGGCTGGC AGTGGGCCGC TTCAACCGGA ACCGATGCAG CGCCGTATTT TCGTATTTTC AACCCGACAA CCCAGGGCGA GAAATTTGAC CGTGAGGGCG AGTTTATTCG TCGATGGTTA CCGGAGCTGC GCGATGTACC AGGGAAAGCG GTGCATGAGC CGTGGAAGTG GGCGCAGAAA GCAGGTGTGA TGCTGGATTA TCCGCAACCG ATAGTTGATC ACAAAGAG
|
Protein sequence | MITHLVWFRQ DLRLHDNLAL AAACRNSSAR VLALYIATPR QWATHNMSPR QAELINAQLN GLQIALAEKG IPLLFREVDD FVASVEIVKQ VCAENSVTHL FYNYQYEVNE RARDVEVERA LRNVVCEGFD DSVILPPGAV MTGNHEMYKV FTPFKNAWLK RLREGMPECV AAPKVRSSGS IEPAPSITLN YPRQSFDTAH FPVEEKAAIA QLRQFCQNGA GEYEQQRDFP AVEGTSRLSA SLATGGLSPR QCLHRLLAEQ PQALDGGAGS VWLSELIWRE FYRHLMTYYP SLCKHCPFIA WTDRVQWQSN PAHLQAWQEG KTGYPIVDAA MRQLNSTGWM HNRLRMITAS FLVKDLLIDW REGERYFMSQ LIDGDLAANN GGWQWAASTG TDAAPYFRIF NPTTQGEKFD REGEFIRRWL PELRDVPGKA VHEPWKWAQK AGVMLDYPQP IVDHKE
|
| |