Gene Namu_4704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4704 
Symbol 
ID8450334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5233089 
End bp5234468 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content73% 
IMG OID645043744 
ProductDeoxyribodipyrimidine photo-lyase 
Protein accessionYP_003203969 
Protein GI258654813 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.940834 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCGA CCAGCCTGCT CTGGTTCCGC CGCGACCTGC GGTTGGGCGA CCACCCGGCG 
TTGGCCGCCG CCGCCGACCA CAACCACCGT GTGCTGGGGG TTTTCGTCGC CGACGACGTG
CCCCTGGACG CCTCCGGCTC GCCCCGGCGG GCCGTGCTGG CCCGGACCCT GGCCGCCCTG
GCGCAGGCCA TGGACGGTCG GCTGCTCATC GCCCACGGGC GCCCGCGGTC GGTGCTGCCC
CGGCTGGCCC GGGCGGTGGA CGCGGACGTC GTGCACGTCT CGGCCGATTA CGGGCCGTAC
GGGCGGCGGC GGGACGAGCA GGTGCAGCGG GCCCTGCAGG ATGCCGGCGT CGAGTGGGTG
GCCACCGGCT CGCCCTACGC GGTGGCCCCC GGGCGCGTGC GCAAGAGCAA CGGCGAGCGG
TACGCGGTGT TCACGCCCTT CTACCGGGGC TGGACCGACC ACGGGTGGCG CAAGCCCGCC
CGGTCCGGGT CCGGGGTCGA CTGGGTCGAC CCGGGCGAGG TCGACGGGAT CACTGCCCAC
GACCCGCAGG AGTACGCCCG CACGGTGCCG GCCGGCATGT CGTTGCCGGA GGTGGGCGAG
CAGGCCGCGC TCGATGGCTG GCGCACCTTC CGGGACGAGG CCCTGGACGA CTACGACGGC
GACCGGGACC GCCCGGATCG CCCTGGCACC AGCCACATGT CGGTGTACCT GAAGTGGGGC
TCGATCCATC CGCGCACCCT GCTGGCCGAT CTGGCCGGCC GCCGCTCGAC TGGCGCGGCC
AGCTACCGGC GCGAGCTGGC CTGGCGGGAG TTCTACGCGG ACAGCGTCTT TCATCTGCCC
GAGTCGGTCT GGACCTCGGT GGACCCGGTG ATCGACCGGA TGGCCTGGGA TTCCGGCCAG
CCGGCCGAGG AACGCTTCGA GGCGTGGCGG GCCGGGCGGA CCGGCTACCC GTTCATCGAT
GCCGGGATGC GTCAGCTGCT GGCCGAGGGC TGGATGCACA ACCGCCTGCG GATGGCCACC
GCCTCGTTCC TGATCAAGGA CCTGCACCTG CCCTGGCAGC GCGGCGCCGA GCACTTCCTG
GAGCACCTGG TGGACGGCGA CTACGCGTCG AACAATCACG GCTGGCAGTG GGTGGCCGGA
TCGGGCGCCC AGGCGGCGCC GTTCTTCCGC ATCTTCAACC CGCTCACCCA GGGCGAGAAG
TTCGACCCGT CAGGGGATTT CGTCCGCCGG TACATTCCCG AACTACGGGA CGTGCCGGGT
CGCAAGGTGC ACCGGCCGTG GGAGCTGGAC GGCGGGGTCC CCGCCGGCTA CCCGGAGCCG
ATCGTCGATC ACGCCGACGA GCGGGCCGAG GCCCTGCGCC GCTGGCAGCA GCGCGGCTGA
 
Protein sequence
MTSTSLLWFR RDLRLGDHPA LAAAADHNHR VLGVFVADDV PLDASGSPRR AVLARTLAAL 
AQAMDGRLLI AHGRPRSVLP RLARAVDADV VHVSADYGPY GRRRDEQVQR ALQDAGVEWV
ATGSPYAVAP GRVRKSNGER YAVFTPFYRG WTDHGWRKPA RSGSGVDWVD PGEVDGITAH
DPQEYARTVP AGMSLPEVGE QAALDGWRTF RDEALDDYDG DRDRPDRPGT SHMSVYLKWG
SIHPRTLLAD LAGRRSTGAA SYRRELAWRE FYADSVFHLP ESVWTSVDPV IDRMAWDSGQ
PAEERFEAWR AGRTGYPFID AGMRQLLAEG WMHNRLRMAT ASFLIKDLHL PWQRGAEHFL
EHLVDGDYAS NNHGWQWVAG SGAQAAPFFR IFNPLTQGEK FDPSGDFVRR YIPELRDVPG
RKVHRPWELD GGVPAGYPEP IVDHADERAE ALRRWQQRG