Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0723 |
Symbol | phrB |
ID | 6145206 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 729183 |
End bp | 730601 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641615613 |
Product | deoxyribodipyrimidine photolyase |
Protein accession | YP_001742812 |
Protein GI | 170681516 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0415] Deoxyribodipyrimidine photolyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.121356 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.867142 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTACCC ATCTGGTCTG GTTTCGCCAG GATTTACGTC AGCACGATAA TCTCGCGCTG GCTGCCGCCT GCCGCAATTC GTCTGTGCGC GTGCTGGCGT TGTATATCGC TACCCCACGC CAGTGGGCGG CGCATAATAT GTCGCCGCGT CAGGCTGAAT TTATCAATGC ACAACTGAAC GGGCTACAAA TCGCGCTGGC GGAAAAAGGC ATTCCTTTGC TGTTTCATGA AGTGGATGAC TTTGCCGCCA GCGTCGAAAT AGTTAAACAG GTGTGCGCGG AAAATCGTGT TACCCATCTG TTTTATAACT ATCAGTATGA GGTGAATGAG CGGGCGCGGG ATGTGCAGGC TGAGAGGGCG CTGCGTAACG TGGTGTGTGA AGGATTTGAC GACGGCGTGA TCCTGCCGCC GGGGGCGGTG ATGACTGGCA ATCATGAGAT GTACAAAGTC TTTACGCCAT TTAAAAACGC CTGGCTTAAA CGGCTGCGAG AAGGGATGCC GGAGTGCGTC GCTGCGCCAA AAGTACGTAG TAGCGGATCG ATAGATCCCG CGCCATCCAT TACGCTGAAT TATCCTCGTC AGCCTTTCGA TACTGCGCAT TTCCCGGTGG AAGAGAAAGC GGCGATTGCG CAATTACGCC AGTTTTGCCA GAACGGTGCC GGAGAGTATG AGCAACAGCG AGATTTCCCT GCGGTAGAAG GCACCAGCCG TTTGTCCGCC AGCCTGGCAA CGGGCGGGTT ATCGCCTCGC CAGTGTTTGC ATCGCCTGTT GGCGGAACAG CCGCAGGCGC TGGACGGTGG GGCCGGTAGT ATCTGGCTTA ATGAGCTGAT CTGGCGCGAG TTTTACCGTC ATCTGATAAC GTATTACCCC TCGCTGTGTA AACATCGTCC GTTTATTGTC TGGACCGACC GCGTGCAGTG GCAGAGCAAT TCCGCGCATT TAAAGGCCTG GCAGGAAGGC AAAACGGGAT ACCCGATTGT TGATGCTGCC ATGCGTCAGC TTAACAGCAC TGGCTGGATG CATAACCGGC TACGGATGAT TACAGCCAGT TTTTTGGTGA AAGATTTATT GATCGACTGG CGCGAAGGTG AGCGATATTT CATGTCGCAG CTGATTGATG GTGATTTGGC AGCCAATAAC GGTGGCTGGC AGTGGGCCGC TTCAACCGGA ACCGATGCAG CGCCGTATTT TCGTATTTTT AACCCGACAA CCCAGGGCGA GAAATTTGAC CGCGAAGGCG AGTTTATCCG CCAGTGGCTA CCGGAACTGC GCGATGTGCC AGGGAAAGTG GTGCATGAGC CGTGGAAGTG GGCGCAGAAA GCAGGTGTGA CGCTGGATTA TCCGCAACCG ATAGTTGATC ACAAAGAGGC AAGGCTGCGA ACGCTGGCAG CGTACGAAGA AGCGAGGAAA GGAGCCTGA
|
Protein sequence | MTTHLVWFRQ DLRQHDNLAL AAACRNSSVR VLALYIATPR QWAAHNMSPR QAEFINAQLN GLQIALAEKG IPLLFHEVDD FAASVEIVKQ VCAENRVTHL FYNYQYEVNE RARDVQAERA LRNVVCEGFD DGVILPPGAV MTGNHEMYKV FTPFKNAWLK RLREGMPECV AAPKVRSSGS IDPAPSITLN YPRQPFDTAH FPVEEKAAIA QLRQFCQNGA GEYEQQRDFP AVEGTSRLSA SLATGGLSPR QCLHRLLAEQ PQALDGGAGS IWLNELIWRE FYRHLITYYP SLCKHRPFIV WTDRVQWQSN SAHLKAWQEG KTGYPIVDAA MRQLNSTGWM HNRLRMITAS FLVKDLLIDW REGERYFMSQ LIDGDLAANN GGWQWAASTG TDAAPYFRIF NPTTQGEKFD REGEFIRQWL PELRDVPGKV VHEPWKWAQK AGVTLDYPQP IVDHKEARLR TLAAYEEARK GA
|
| |