Gene SeHA_C0831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C0831 
Symbol 
ID6491460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp823199 
End bp824620 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content57% 
IMG OID642741082 
Productdeoxyribodipyrimidine photolyase 
Protein accessionYP_002044740 
Protein GI194448497 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.31293 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.0019608 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCACCC ATTTAGTCTG GTTCAGGCGC GATCTGCGTT TACAGGATAA CCTTGCGCTG 
GCCGCCGCCT GCCGCGATGC ATCCGCGCGG GTGCTGGCGC TTTATATTTC CACCCCCGCG
CAGTGGCAGG CCCATGATAT GGCGCCGCGG CAGGCGGCAT TCATCAGCGC GCAGCTCAAT
GCGCTACAGG CGGCCCTTGC AGAGAAAGGC ATTCCGCTGC TGTTTCATGA AGTGGCGGAT
TTTAACGCCA GTATTGAAAC GGTCAAAAAC GTTTGCCGAC AGCATGACGT CAGCCATCTG
TTTTATAACT ATCAGTATGA GTTTAACGAG CGTCAGCGTG ATGCGGCGGT GGAAAAAACG
CTGCCCTCTG TCATCTGCGA AGGCTTTGAC GATAGCGTGA TTCTGGCGCC CGGCGCGGTG
ATGACCGGCA ATCATGAAAT GTATAAAGTT TTTACGCCGT TTAAAAACGC CTGGTTGAAA
CGGCTAAAAG AGGATATTCC GCCATGCGTT CCGGCCCCGA AGATCCGGGT GAGCGGCGCG
CTTTCTACAC CCTTAACGCC AGTCTCGCTT AACTACCCGC AACAGGCGTT TGACGCCGCG
CTTTTCCCGG TGGAAGAAAA CGCGGTCATC GCGCAGCTAC GTCAGTTTTG CGCGCAGGGC
GCAGACGGGT ATGAGTCACG GCGGGATTTC CCTGCGGTCG ACGGCACCAG TCGGCTTTCC
GCCAGCCTGG CGACGGGTGG CCTGTCGCCG CGACAGTGCC TGCACCGATT ACTGGCGGAG
CAGCCGCAGG CGCTGGACGG CGGACCGGGA AGCGTCTGGT TGAATGAGCT CATCTGGCGG
GAATTTTACC GTCATTTAAT GACCTGGTAT CCGGCATTAT GCAAACACCA GCCGTTTATC
CGCTGGACAA AACGCGTCGC GTGGCAGGAA AACCCGCACT ATTTTCAGGC ATGGCAGAAA
GGCGAAACCG GTTATCCCAT TGTCGATGCG GCGATGCGCC AGCTTAACGC GACGGGCTGG
ATGCATAACC GTTTACGCAT GATTACAGCC AGCTTTCTGG TAAAAGATCT GCTGATTGAC
TGGCGGTTGG GGGAGCGCTA TTTCATGTCG CAGCTTATTG ACGGCGATCT TGCCGCCAAC
AATGGCGGCT GGCAGTGGGC CGCTTCAACC GGTACTGATG CCGCGCCTTA TTTTCGTATT
TTTAATCCCA CGACTCAGGG AGAGAGGTTC GATCGCGACG GCGAATTTAT CCGCCAGTGG
CTACCGGCAT TACGCGATAT CCCTGGAAAA GCGATTCACG AGCCGTGGCG GTGGGCGGAA
AAAGCGGGAG TCGTGCTTGA TTATCCTCGG CCTATTGTGG AACACAAACA GGCGAGAATC
GCGACGCTTT CCGCCTATGA GGCGGCGAGA AAAGGGGCGT GA
 
Protein sequence
MPTHLVWFRR DLRLQDNLAL AAACRDASAR VLALYISTPA QWQAHDMAPR QAAFISAQLN 
ALQAALAEKG IPLLFHEVAD FNASIETVKN VCRQHDVSHL FYNYQYEFNE RQRDAAVEKT
LPSVICEGFD DSVILAPGAV MTGNHEMYKV FTPFKNAWLK RLKEDIPPCV PAPKIRVSGA
LSTPLTPVSL NYPQQAFDAA LFPVEENAVI AQLRQFCAQG ADGYESRRDF PAVDGTSRLS
ASLATGGLSP RQCLHRLLAE QPQALDGGPG SVWLNELIWR EFYRHLMTWY PALCKHQPFI
RWTKRVAWQE NPHYFQAWQK GETGYPIVDA AMRQLNATGW MHNRLRMITA SFLVKDLLID
WRLGERYFMS QLIDGDLAAN NGGWQWAAST GTDAAPYFRI FNPTTQGERF DRDGEFIRQW
LPALRDIPGK AIHEPWRWAE KAGVVLDYPR PIVEHKQARI ATLSAYEAAR KGA