Gene Ndas_4508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4508 
Symbol 
ID9248388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5344007 
End bp5345503 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content77% 
IMG OID 
Productputative transcriptional regulator, PucR family 
Protein accessionYP_003682402 
Protein GI297563428 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGAGA CCGGCCTGAC CGTCAAGGCC CTGCTGAAGG CACTGGACAA GGCCGTCCAC 
GGACTCGCCG CCGCGCCCCG GGGGCTCGGG GGCGTGGTCC GCTCGGTGGT CGTGGCCGGG
GACGGCGCAC CCGTACCCGC CGGTGCGCTG GTCGTCGTCC CGCCCGGGGA GGCCGCCGCC
CTGGCCGCGC GTCCCGGGCA GCTGGCCGCC AGCGCCGTGG CGGCCGTGGA CGGGGACGCC
GACGGGCTGC GCGGTCTGGG CCTGCCCGTG CTCTGCGTGG ACCCCGCCGT GGGCACCGGC
GAGTTCGCCG CCCTGGCCCG GTCGGTCCTG GAGGCGGCCG CCACCGACAC CGGCCACGGC
GACCTGTTCT CCCTGGCCCA GACCGTCGCG ACCCTGACCG GCGGCATCGT CAGCATCGAG
GACGCCGCCG GTCAGGTCCT GGCCTACTCC GCCTCCGGCG AGGGGGCCGA CGAGCTGCGC
AGGCAGACCA TCCTCGGCCG CGGCTGCCCG GAGTCGTTCC TGGCGCACCT GCGCGAGTGG
GGCGTCAACG AGCGGATACG GGACGGCGAG GTGGTCGAGG TCGCCGAACG CCCCGACCTG
GGCGCCGCGC GCCGCCTGGT GGTGGGGATA CAGGCGGGCG AGCGCGCGCT CGGCACCATC
TGGGTGCAGG TGGGCGGCAG GCCCCTCGCG CGGACCAGTC CCCAGGTTCT GCGCGGCGCG
GCGCGCCTGG CCGCGCTGCA CGTCATGCGC GCCCACAGCG AGGCGCGCAG CACCGGGCGC
GACACCGAGG GGCTGGCCGT GGGCCTGCTC ACCGGCGCCT TCGACACCGA CGCCCTCGCC
CGCCACCTGG GCGCCGACCC CACGACGCCG GTGGCGGTGG TCGCGCTCGC CCTTCGGGAG
GGGGAGGTCG ACCCGCCCTG GCGGCTGGAC CAGGCCGCGG AGATCACGTC GGTGTACGCG
GCGGCCTACC GCCGCGAGGC CCTGGTCATC CCGGCCTGCG GCCTGCTGTA CGTGCTGGTC
CCCGCGCCCT CCGGGCAGGC GCCCACCGCG TGGACGCGGG AGCTGGTGTC GGTGCTGCGC
GAGAACCTGG GCACGCCCGT CCAGGCGGCG GTGGCGGGAG TGGCGCCGAG GATGCGCGCG
GTGCCGTCGC TGAAGAAGGT CGCCAGCCGC GCCCTGGAGG TGGTCGCGGA CCGCCCCGAG
CGGCTGGTGA CCGCGTTCGA GGAGGTCCGT TCCTCGGTGG TGCTGCGCGA GCTGTTCGAC
GCCCTGGCCG ACCGCCCCGG GCTGCGCGAC GACCGGCTCG ACGCCCTCGA CGACGAGCAG
CGCCGCTCCC TGTCCGCCTA CCTGGACGCC TTCGGCGACG TGTCCGGGGC TGCTGAGCGG
CTGCACGTGC ACCCCAACAC GCTGCGGCAC CGGATCAGGC GGATCCGCGA GCTCACCGGA
CTCGACCTGG ACGACGCCGA CCAGAGGCTC CTGGCCACGC TCACGCTGCG GGCGTGA
 
Protein sequence
MSETGLTVKA LLKALDKAVH GLAAAPRGLG GVVRSVVVAG DGAPVPAGAL VVVPPGEAAA 
LAARPGQLAA SAVAAVDGDA DGLRGLGLPV LCVDPAVGTG EFAALARSVL EAAATDTGHG
DLFSLAQTVA TLTGGIVSIE DAAGQVLAYS ASGEGADELR RQTILGRGCP ESFLAHLREW
GVNERIRDGE VVEVAERPDL GAARRLVVGI QAGERALGTI WVQVGGRPLA RTSPQVLRGA
ARLAALHVMR AHSEARSTGR DTEGLAVGLL TGAFDTDALA RHLGADPTTP VAVVALALRE
GEVDPPWRLD QAAEITSVYA AAYRREALVI PACGLLYVLV PAPSGQAPTA WTRELVSVLR
ENLGTPVQAA VAGVAPRMRA VPSLKKVASR ALEVVADRPE RLVTAFEEVR SSVVLRELFD
ALADRPGLRD DRLDALDDEQ RRSLSAYLDA FGDVSGAAER LHVHPNTLRH RIRRIRELTG
LDLDDADQRL LATLTLRA