Gene Hore_04990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_04990 
Symbol 
ID7314478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp542198 
End bp543598 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content40% 
IMG OID643610922 
ProductDeoxyribodipyrimidine photo-lyase 
Protein accessionYP_002508252 
Protein GI220931344 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID[TIGR00591] photolyase PhrII 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATACATA ATAGTAGAAT AAAACCCCTT AATAAAAAGA ATATAAATCC ACGCGGGGAA 
TATATTCTAT ACTGGATGCA GGCCTCCCAG AGAACAGAGT ACAACCATGC CCTGGAGTAT
GCCATCATCG AGGCCAATAA ATCCAATAAA CCACTGCTTG TCTATTTCGG GATTGATACC
TCATTCCCGG AGGCTAATCG ACGCCATTAT CAATTTATGC TGGAAGGGTT ACAGGAAGTA
AAGAAATCCC TCTATAACCG GGGAATAAAA ATGATTATTG AATCCGTTCC CCCCGACAAG
GATATTTTAA AGTTTGCAGA GTATGCCTCT CTCCTGGTAG TAGACAGGGG TTATCTTAAA
ATCGAACGAA CCTGGCGAAA TAATGTGAGC CAACAGATTG ACTGTCCACT GATCCAGGTT
GAAAGCAATG TAATAGTTCC TGTTGAAGTG GCCTCTTCTA AAGAAGAATA TGCTGCCTAT
ACCATCAGAA AAAAACTATA CCGTAAGTTG CCTGAATTCC TCCATCCCTT ACATACCAGG
ACCATCAGGG TAAGCTCCCT TGACCTGAAG CTATCATTTA TAAACTATAA GGATATTCCC
CTTGATAATG TTACCCTGTG CCTTGATAGA TTAAAAGTTG ACAATACTGT ACCGGAAGTT
AACTTATACC GGGGTGGCAC TACCCGTGCT CTGGCTTTAT ATAACGATTT TTTACATAAT
AAAATTAAAG ACTACCATGA ATACCGGAAT GATCCTGTTA AAAACTGGAT TTCCAACATG
AGCCCCTACC TCCATTTTGG ACAGGTCTCA CCCCTGCACC TAATTATTAA GGGGAATAAC
TATTGTAAAA AACATGAAAT AGATAAAGGC TTTAAAGAAT TTTTTGAGGA GCTTGTAATC
AGGAGGGAGC TATCTTTTAA TTTTGTATAT TATAACCCTG ATTATGATTC TATTAAATCT
CTCCCGGACT GGGCTAAAAA AACTCTGAAA GAACATGAAA ATGACACCCG GGAATTTAGC
TATTCACTTC AGGAATTGGA AGATGCTAAA ACCCATGACC CTTACTGGAA TGCTGCCCAG
AAAGAACTTT TACTGACAGG TAAAATCCAT GGGTATATGA GAATGTACTG GGGCAAAAAA
ATACTGGAAT GGACTTCCTC ACCTGACCTT GCCTATAAAT ATGCCCTGTA CTTAAATAAC
AAATATGCCC TTGATGGTCG TGACCCCAAT GGGTTTGCCG GGGTAGCCTG GTGTTTTGGT
AAGCATGACC GTCCCTGGCC CGGGTGTAAT ATATTTGGAA AGGTAAGGTA TATGAGTTCC
GGTGGTCTTA AAAGAAAATT TAAAATAGAC TTATATTTAA AAAGAATACA TAACCTTGAG
GAGGCATCAC ATGTTGGATA A
 
Protein sequence
MIHNSRIKPL NKKNINPRGE YILYWMQASQ RTEYNHALEY AIIEANKSNK PLLVYFGIDT 
SFPEANRRHY QFMLEGLQEV KKSLYNRGIK MIIESVPPDK DILKFAEYAS LLVVDRGYLK
IERTWRNNVS QQIDCPLIQV ESNVIVPVEV ASSKEEYAAY TIRKKLYRKL PEFLHPLHTR
TIRVSSLDLK LSFINYKDIP LDNVTLCLDR LKVDNTVPEV NLYRGGTTRA LALYNDFLHN
KIKDYHEYRN DPVKNWISNM SPYLHFGQVS PLHLIIKGNN YCKKHEIDKG FKEFFEELVI
RRELSFNFVY YNPDYDSIKS LPDWAKKTLK EHENDTREFS YSLQELEDAK THDPYWNAAQ
KELLLTGKIH GYMRMYWGKK ILEWTSSPDL AYKYALYLNN KYALDGRDPN GFAGVAWCFG
KHDRPWPGCN IFGKVRYMSS GGLKRKFKID LYLKRIHNLE EASHVG