Gene Krad_4047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagKrad_4047 
Symbol 
ID5336853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameKineococcus radiotolerans SRS30216 
KingdomBacteria 
Replicon accessionNC_009664 
Strand
Start bp2838654 
End bp2839769 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content79% 
IMG OID 
ProductDeoxyribodipyrimidine photolyase-like 
Protein accessionYP_001363774 
Protein GI152967990 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00445605 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.106433 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGGCCT CGCCCGTGCG CGGCGGGCAG GCCGCCGCCG ACGCCGCCCT CGCCGCGCTG 
GACCTCACCG GCTACGCCGC CCGGCGCAGC GAGGTCTGGC CGCCCGAGCG CCGCGGCGCG
ACCCGGCTCT CGCCCTACGT GCGCCACGGG CTGCTCCCGC TGCCGACGGT GTGGGCCGCG
GCGGGGGACG CCCCCGCCCG CGACCGGGCG AAGTTCCGCG ACGAGCTGCT GTGGCAGGAG
TACGCCCGCC ACCTCTACGC CCGCCTCGGC CCCGCGACGG CCCGCCCGCT GCGCTTCGCC
GCCCCCGTGC CCGCGCAGCC GTGGGAGCGC GAACCGTGGC CGCAGGACAT GGCCTGCGTC
GCCACCACCA CCGCGGAGCT GCACGAGGAG GGCTGGCTGG TCAACCAGAC CCGGATGTGG
CTGGCCTCGC AGTACTCCGT GCGGGCCGGG GCGGACTGGC GCGAGGGCGC GCGGGAGATG
TACCGCCACC TCCTCGACGG CTCCCCCGCC GCGAACCGCC TCGGCTGGCA GTGGGCCGTG
GGCACCGGGA CGGGGAAGGT CTACGGCTTC AGCCGGTGGC AGGTCGAGAA GCGCGCCCCG
GGCCTGTGCG GCACGTGCGC GCTGCGGCGG GCCTGCCCGG TCCAGGACTG GCCGGAGACG
GAGGCCGGTC CCCGCGTGGA GCCCCCCGAG GGCCTCGCCG GCGGACCCAC CGACGCCGGC
CCCCGCACCC CCGAGGTCAC CGGCGAGCCC GACGCGGTGT GGCTCACCGC GGAGTCCCTC
GCCGACACCG ACCCCGCGCT CGCCGCCCAC CCCGGCGTCC CCGCGGTGTT CGTGTTCGAC
GAGCCGCTGC TGGCCCGGCT GCGGCTGTCC GGCAAGCGCC TGGTGTTCCT CGCCGAGACC
CTCGCCGAGC TCGGCTGCGA GGTCCGCCTC GGCGATCCCG TCGCGGAGCT CGCCGGGCGC
CGCCTCGCCG TCACCCACGC CCCCGTCCCC GGTTTCGCCC GCCGCGCGGC CCGCCTCGAC
GTCGTCGCCC GCCACCCCTG GCCCTGGCTG CGCCGCCCGG GCTCGGGGTC GCTGCGCTCC
TTCAGCGCGT GGGAGCGCTC GGCGCCCAGG CGCTGA
 
Protein sequence
MPASPVRGGQ AAADAALAAL DLTGYAARRS EVWPPERRGA TRLSPYVRHG LLPLPTVWAA 
AGDAPARDRA KFRDELLWQE YARHLYARLG PATARPLRFA APVPAQPWER EPWPQDMACV
ATTTAELHEE GWLVNQTRMW LASQYSVRAG ADWREGAREM YRHLLDGSPA ANRLGWQWAV
GTGTGKVYGF SRWQVEKRAP GLCGTCALRR ACPVQDWPET EAGPRVEPPE GLAGGPTDAG
PRTPEVTGEP DAVWLTAESL ADTDPALAAH PGVPAVFVFD EPLLARLRLS GKRLVFLAET
LAELGCEVRL GDPVAELAGR RLAVTHAPVP GFARRAARLD VVARHPWPWL RRPGSGSLRS
FSAWERSAPR R