Gene Phep_2087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2087 
Symbol 
ID8253191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2408473 
End bp2409777 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content37% 
IMG OID644935735 
ProductDeoxyribodipyrimidine photo-lyase 
Protein accessionYP_003092354 
Protein GI255531982 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0527605 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0533895 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGTA AAGTTTCGAT TTTTTGGTTT CGCAGAGATT TGCGTTTAGA AGATAATGTG 
GGTTTATATC ATGCCTTATC TTCAGGGTTT CCTGTCCTTC CTATTTTTAT TTTCGATGAA
AATATCTTAG GAAAGCTTGG GGATAAAAAA GACAGAAGAG TGGATTATAT TGATCAGGCA
CTTTTGAAAA TAAATACCCA ACTAAAATTA TCCAAAACAA GGCTGAACAC ATTTCACGGA
AATCCGATTG AAATTTTCAA TATGCTTTCA GAGCAATATG CTGTCCAGGC TGTTTTTTGC
AACAGGGATT ATGAACCGCT AACTATTCAA AGAGATGTGG AAATTTATGA GTTTTTTAAA
CGGAACCAAA TTCCGTTTAA GGCATTTAAA GACCAGGTTA TTTTTGACAA AAGTGATGTT
TTAAAAAATG ATGGGACCCC CTATACGGTT TATACACCTT ATTCAAAAAA ATGGAAAGAG
CTATTGAAGG AAGAACATTA CAGGTCGTAC CATCCTGATT ATAATAATTT TTTCAGGCAA
GAGTTTACCG GAATTCATTC CTTGAACGAT ATTGGTTTTG AAAAAACAGA CATCGCCTTT
GAAACCCCGA AATTGACTAC TACAATCATT GATGAATACG ATAAATACAG AGATTATCCT
GCAATGCAAC GCACCACACA GTTGGGGATT GCCCTTCGGT TTGGCACCAT CAGCATTCGC
AAATGCGTAG CTTTTGGATT GAAACACAAT CAAACCTGGC TGAATGAATT AATTTGGCGG
GAATTTTTTA TGCAAATTTT GTATCATTTC CCTAAAGTGG TCGATCAATC TTTCAAATCG
AAATACGATA ATATCAAATG GCGAAACAAT GAGCATGAAT TTGATCGATG GTGCGAAGGG
AAAACAGGTT ACCCGATTGT AGATGCAGGA ATGAGACAGT TGAACCAAAC AGGTTTTATG
CACAATCGGG TACGGATGAT TGCAGCAAGC TTTTTGTGCA AGCATTTACT GATTGACTGG
CGTTGGGGTG AAGCTTATTT TGCACAAAAG TTGAACGATT ACGATTTGTC GGCCAATAAT
GGTAACTGGC AATGGGCATC AGGTTCGGGT TGCGATTCTG CACCTTATTT CAGGGTGTTT
AACCCAACGC TTCAAACCGA AAAATTCGAT AAAAACTTCG CTTACCTCAA AAAATGGATT
CCCGAGTTCG AAACAGAAAA CTATCCAGAA CCAATCGTGG AACATAGTTT TGCAAGAGAA
AGAGCTTTGA AAACATATGG CAATGCCATC AAAGAAAACG ATTAA
 
Protein sequence
MKSKVSIFWF RRDLRLEDNV GLYHALSSGF PVLPIFIFDE NILGKLGDKK DRRVDYIDQA 
LLKINTQLKL SKTRLNTFHG NPIEIFNMLS EQYAVQAVFC NRDYEPLTIQ RDVEIYEFFK
RNQIPFKAFK DQVIFDKSDV LKNDGTPYTV YTPYSKKWKE LLKEEHYRSY HPDYNNFFRQ
EFTGIHSLND IGFEKTDIAF ETPKLTTTII DEYDKYRDYP AMQRTTQLGI ALRFGTISIR
KCVAFGLKHN QTWLNELIWR EFFMQILYHF PKVVDQSFKS KYDNIKWRNN EHEFDRWCEG
KTGYPIVDAG MRQLNQTGFM HNRVRMIAAS FLCKHLLIDW RWGEAYFAQK LNDYDLSANN
GNWQWASGSG CDSAPYFRVF NPTLQTEKFD KNFAYLKKWI PEFETENYPE PIVEHSFARE
RALKTYGNAI KEND