Gene EcSMS35_0723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0723 
SymbolphrB 
ID6145206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp729183 
End bp730601 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content55% 
IMG OID641615613 
Productdeoxyribodipyrimidine photolyase 
Protein accessionYP_001742812 
Protein GI170681516 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.121356 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.867142 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACCC ATCTGGTCTG GTTTCGCCAG GATTTACGTC AGCACGATAA TCTCGCGCTG 
GCTGCCGCCT GCCGCAATTC GTCTGTGCGC GTGCTGGCGT TGTATATCGC TACCCCACGC
CAGTGGGCGG CGCATAATAT GTCGCCGCGT CAGGCTGAAT TTATCAATGC ACAACTGAAC
GGGCTACAAA TCGCGCTGGC GGAAAAAGGC ATTCCTTTGC TGTTTCATGA AGTGGATGAC
TTTGCCGCCA GCGTCGAAAT AGTTAAACAG GTGTGCGCGG AAAATCGTGT TACCCATCTG
TTTTATAACT ATCAGTATGA GGTGAATGAG CGGGCGCGGG ATGTGCAGGC TGAGAGGGCG
CTGCGTAACG TGGTGTGTGA AGGATTTGAC GACGGCGTGA TCCTGCCGCC GGGGGCGGTG
ATGACTGGCA ATCATGAGAT GTACAAAGTC TTTACGCCAT TTAAAAACGC CTGGCTTAAA
CGGCTGCGAG AAGGGATGCC GGAGTGCGTC GCTGCGCCAA AAGTACGTAG TAGCGGATCG
ATAGATCCCG CGCCATCCAT TACGCTGAAT TATCCTCGTC AGCCTTTCGA TACTGCGCAT
TTCCCGGTGG AAGAGAAAGC GGCGATTGCG CAATTACGCC AGTTTTGCCA GAACGGTGCC
GGAGAGTATG AGCAACAGCG AGATTTCCCT GCGGTAGAAG GCACCAGCCG TTTGTCCGCC
AGCCTGGCAA CGGGCGGGTT ATCGCCTCGC CAGTGTTTGC ATCGCCTGTT GGCGGAACAG
CCGCAGGCGC TGGACGGTGG GGCCGGTAGT ATCTGGCTTA ATGAGCTGAT CTGGCGCGAG
TTTTACCGTC ATCTGATAAC GTATTACCCC TCGCTGTGTA AACATCGTCC GTTTATTGTC
TGGACCGACC GCGTGCAGTG GCAGAGCAAT TCCGCGCATT TAAAGGCCTG GCAGGAAGGC
AAAACGGGAT ACCCGATTGT TGATGCTGCC ATGCGTCAGC TTAACAGCAC TGGCTGGATG
CATAACCGGC TACGGATGAT TACAGCCAGT TTTTTGGTGA AAGATTTATT GATCGACTGG
CGCGAAGGTG AGCGATATTT CATGTCGCAG CTGATTGATG GTGATTTGGC AGCCAATAAC
GGTGGCTGGC AGTGGGCCGC TTCAACCGGA ACCGATGCAG CGCCGTATTT TCGTATTTTT
AACCCGACAA CCCAGGGCGA GAAATTTGAC CGCGAAGGCG AGTTTATCCG CCAGTGGCTA
CCGGAACTGC GCGATGTGCC AGGGAAAGTG GTGCATGAGC CGTGGAAGTG GGCGCAGAAA
GCAGGTGTGA CGCTGGATTA TCCGCAACCG ATAGTTGATC ACAAAGAGGC AAGGCTGCGA
ACGCTGGCAG CGTACGAAGA AGCGAGGAAA GGAGCCTGA
 
Protein sequence
MTTHLVWFRQ DLRQHDNLAL AAACRNSSVR VLALYIATPR QWAAHNMSPR QAEFINAQLN 
GLQIALAEKG IPLLFHEVDD FAASVEIVKQ VCAENRVTHL FYNYQYEVNE RARDVQAERA
LRNVVCEGFD DGVILPPGAV MTGNHEMYKV FTPFKNAWLK RLREGMPECV AAPKVRSSGS
IDPAPSITLN YPRQPFDTAH FPVEEKAAIA QLRQFCQNGA GEYEQQRDFP AVEGTSRLSA
SLATGGLSPR QCLHRLLAEQ PQALDGGAGS IWLNELIWRE FYRHLITYYP SLCKHRPFIV
WTDRVQWQSN SAHLKAWQEG KTGYPIVDAA MRQLNSTGWM HNRLRMITAS FLVKDLLIDW
REGERYFMSQ LIDGDLAANN GGWQWAASTG TDAAPYFRIF NPTTQGEKFD REGEFIRQWL
PELRDVPGKV VHEPWKWAQK AGVTLDYPQP IVDHKEARLR TLAAYEEARK GA