Gene Haur_1969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1969 
Symbol 
ID5733858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2413635 
End bp2415095 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content52% 
IMG OID641279113 
Productdeoxyribodipyrimidine photo-lyase 
Protein accessionYP_001544740 
Protein GI159898493 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGTGA TTTGTTGGTT CCGCCGCGAT TTACGCCTAA CCGATCATCG CGCCTTGTAT 
GCCGCCGCCG AGGCCAGCGC TGGGGCAGTT ATTCCAGTCT TTATTTTGGA TGATACGATT
CTGCACGATG GCTATGTTGG AGCAGCCCTG ATCGCCGTAA CGCTTGCCAT GCTCGAAGCA
CTCGATCACG ATTTGCAGCA ACGAGGCAGT CGTTTGATTG TGCGCCATGG CCAGCCATTA
GCCGAATTGC AACGCCTCGT GAGCGAAACT CAGGCCAGCG GCGTGTACTG GAATCGCGAT
TATTTGCCTT ATGCGATCAA GCGTGATAGC GCAGTTAAGC ACTGGTTACG TGAACAAGGC
TTGCAAGCCC ATTCGTTTCA CGATAGCGTT TTAGTTGAGC CAGAGGGGCT AAAAACCAAA
ACTGAGCAAA AACCCTATGT GGTCTATGGA TCATATGTCA AGCGCTGGAG TGAATTAGCC
TATCACCAAG CCGAGCAACT TGTGCCCGCC CCCAGCAAAT TCGTGGCCCC GCCAAGCGAT
TTGGCGAGTT TGCCAATTCC AAGCTTGGCT GATTTGGGCT TTGAGCTACA ACAAACGATT
CCACAGGTTG GCGAAACAAT TGCCCAACAA CGTTTGGCGC AGTTTTTTGA TCGGCGGCAG
AAACTTTCGG TACTCAAGTA TACCAAAGCC CGCGAAGTGC CTGCCGAGGC CGGAACCTCG
CAGCTTTCAG TTGATTTGCG CATGGGCACG ATTTCGATTC GCCAATGTTT GAAACAGGCT
GTCGATCTGC TGACCGAGCC ATTAAACGCT GAGCAACGTC AAGGAGTCGA TACTTGGCTC
AAAGAATTGG CTTGGCGCGA TTACTACACC CAATTGATCT ACCACAACCC ATATATGCTC
AACGGCTCGC TCGATCCACG CTACGATCAG ATCATTTGGC GCAACGATCC AAGTGAGTTT
TTGGCGTGGC AACAGGGCCA AACTGGGTAT CCAATTGTTG ATGCAGGCCA GCGCCAGCTC
AACCAAATGG CGTGGATGCA TAATCGAGTG CGCATGATCA GCGCCTCATT TTTGATCAAA
GATTTGCTGA TCGATTGGCG TTGGGGTGAG CGCTATTTTA TGCAGCAGTT ATGTGATGGC
GACCCGACCG CCAATAACGG CGGTTGGCAG TGGGCAGCAG GTTCAAGTGG GCCATCAGCC
CAACCCTATT TTCGCATCTT CAACCCAATT GCCCAGAGCA AAAAGCACGA CCCAGACGGC
CAGTATATTC GGCGATTTGT GCCCGAATTA GCTAACGTGC CCGATCACTA TATTCACGAG
CCATGGACCA TGCCGCCAGC CGTGCAAGCA CATGTTGGCT GCGTGATTGG GCGCGATTAT
CCTGCGCCGC TAGTTGAGCA TAGTTTTGCC CGTGAACGCG CCTTGGCAGC CTATCGCACA
GCCCTGCAAA CCAATGATTA G
 
Protein sequence
MPVICWFRRD LRLTDHRALY AAAEASAGAV IPVFILDDTI LHDGYVGAAL IAVTLAMLEA 
LDHDLQQRGS RLIVRHGQPL AELQRLVSET QASGVYWNRD YLPYAIKRDS AVKHWLREQG
LQAHSFHDSV LVEPEGLKTK TEQKPYVVYG SYVKRWSELA YHQAEQLVPA PSKFVAPPSD
LASLPIPSLA DLGFELQQTI PQVGETIAQQ RLAQFFDRRQ KLSVLKYTKA REVPAEAGTS
QLSVDLRMGT ISIRQCLKQA VDLLTEPLNA EQRQGVDTWL KELAWRDYYT QLIYHNPYML
NGSLDPRYDQ IIWRNDPSEF LAWQQGQTGY PIVDAGQRQL NQMAWMHNRV RMISASFLIK
DLLIDWRWGE RYFMQQLCDG DPTANNGGWQ WAAGSSGPSA QPYFRIFNPI AQSKKHDPDG
QYIRRFVPEL ANVPDHYIHE PWTMPPAVQA HVGCVIGRDY PAPLVEHSFA RERALAAYRT
ALQTND