Gene Ndas_4333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4333 
Symbol 
ID9248208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5167848 
End bp5169155 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content76% 
IMG OID 
ProductDeoxyribodipyrimidine photo-lyase 
Protein accessionYP_003682228 
Protein GI297563254 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.652268 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCGTCCG CCACCATCGT CCTGTTCACA CAGGACCTCC GCGTCCACGA CCACCCCGCC 
CTGGCCGCCG CGGTCGAGCG CCCCGGCCCG ACCCTGCCGG TCTTCGTGCT GGACCCCGCG
GTGGCCGGCC GGTCCGTCCG CAACCGAGCC GCCCTGCTCG CCGACGCCCT CGCCGACCTG
AGGGCCGCAC TGCGCGAGCG CGGCGCCGAC CTGGTCCTCC TCCGGGGCGA CACCGTCCGT
GAGGTCGCCG CCCTGGCCCG GGCCAGCGGG GCCGACACCG TCCACGTCAC CGACGGCGTC
AGCTCCCTCG CCGCGCGGCG CGCGCGACGC CTGCGCGCGA CCGGCCTGAC CGTGCGGGCC
TTCCCGGGGC TCACCGTGGT CCCGCCGGGG GAGGTCACCC CGGCGGGCGG CGACCACTAC
AAGGTGTTCA CGCCCTACTG GAAGGCCTGG GAGGCCGCCG CGTGGCGCGA TCCGGTCGCG
CCGCCGGACC GTGTGGTCCT GCCCGAGGGG GTCCGGCCCG GCGAGCTGCC CGGACGCGGG
ACGGACGGTC TGGGCGCGCT CGCTGAGGAC CTGCCGCGGG GAGGGGAGCG CGCGGCCCGC
GAGCGCCTGG ACGCCTGGCT GTCGTCCGGG CTGGCGGCCT ACCCCGACCG GCACGACGAC
CTGCCGGGCG CGGCCACCTC TCACCTGTCC GCCGACCTGC GCCTGGGCTG CCTGTCGCCG
TTGGAGGCCG CCCTGGCCGC CTCACCCCTG GCCGGGGGAG AGGCGTTCGT CCGCCAGCTC
GCCTGGCGCG ACTTCCACCA CCAGGTCACC GCGGCCTTCC CGGCGGTGAA CGTCCGCGAC
TACCGGCCCC GGGACAGGAA ATGGCGCGAG GACCCCGACG CCTTGGACGC CTGGAAGCGG
GGGCGGACCG GCGTTCCGGT CGTGGACGCA GGAATGCGCC AGCTGCTGCG CGAGGGGTTC
GTGCACAACC GGGCGCGCAT GATCACCGCG GCGTTCCTCA CCCGGACCCT GCGCGTGCAC
TGGCGCGAGG GCGCCGACCA CTTCCACGCC CACCTCGTGG ACGGGGACGT GGCCAACAAC
TACGGCAACT GGCAGTGGGT CGCGGGCACG GGCAACGACA CCCGGCCCAA CAGGGCGTTC
AACCCGCTGC GCCAGGCCCG CAGGTTCGAC CCCGAGGGCG TGTACGTCCG CCGCTACCTG
CCCGAACTGG CCGATCTCCC GGGCGGGCGC GCGCACACGC CCTGGCGGGA GGAGCACCCG
CCCCCCGGAT ATCCGGCGCC GATCGCCGAC CCCGGCCACT GGGGCTGA
 
Protein sequence
MASATIVLFT QDLRVHDHPA LAAAVERPGP TLPVFVLDPA VAGRSVRNRA ALLADALADL 
RAALRERGAD LVLLRGDTVR EVAALARASG ADTVHVTDGV SSLAARRARR LRATGLTVRA
FPGLTVVPPG EVTPAGGDHY KVFTPYWKAW EAAAWRDPVA PPDRVVLPEG VRPGELPGRG
TDGLGALAED LPRGGERAAR ERLDAWLSSG LAAYPDRHDD LPGAATSHLS ADLRLGCLSP
LEAALAASPL AGGEAFVRQL AWRDFHHQVT AAFPAVNVRD YRPRDRKWRE DPDALDAWKR
GRTGVPVVDA GMRQLLREGF VHNRARMITA AFLTRTLRVH WREGADHFHA HLVDGDVANN
YGNWQWVAGT GNDTRPNRAF NPLRQARRFD PEGVYVRRYL PELADLPGGR AHTPWREEHP
PPGYPAPIAD PGHWG