Gene Ndas_3985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3985 
Symbol 
ID9247856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4766758 
End bp4767774 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content75% 
IMG OID 
Producttranscriptional regulator, LacI family 
Protein accessionYP_003681888 
Protein GI297562914 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.610149 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGACA CCCCGTCCCG AGACGTCACG ATCTCCGCCA TCGCCGAGGA GGCGGGCGTC 
TCCGCGCCGA CGGTCTCGCG CGTGCTCAAC GGCCGGGGCG ACGTCGCCCC CGCCACGCGC
GAGCGCGTGG AGAGCCTGAT CCGCGCCCAC GGCTACCGGC GCCGCGGAGG TCGGCCGCGG
GAGCGCGTCG GCCTGCTCGA CCTCGTCTTC AACGACCTGG ACAGCCCGTG GGCCGTCGAG
ATCATCCGCG GGGTGGAGGA CGCCGCGCAC GAGAGCGGTA CCGGCATCGT GGTGTCGGCC
ATCCACCGCC GGGTCAGCTC CACCCGCCAG TGGCTGGAGA ACGTGCGCTC GCGCGCCACC
GACGGGGCGA TCCTCGTGAC CACCGACCTG GACCCCGAGC TGCGCGAGGA ACTGCGGGAA
CTCCACGTGC CCGCCGTGGT CGTCGACCCG GTCGGCGTCC CGGACCTGGA CACCCCCACG
GTCGGCGTCA CCAACTGGGC CGGGGGCCTC AGCGCGACCG AGCACCTGAT CCACCTCGGC
CACCGGCGCA TCGCCTTCGT CGCCGGGCGC CCCGAGCTGT GGTGCAGCCG GGCCCGGCTC
GACGGCTACC GGGCGGGCCT GGAGACGGCG GGGCTCGCGG TCGACGACGA GCTGGTCGTG
CCGGGGGAGT TCGGCTACGA GTCCGGCTTC CGGGCGGGGG AGCGGTTGTT CGACCTCGCC
GATCCGCCCA CGGCCGTGTT CGCGGCCAGC GACCAGATGG CGCTGGGCGT CTACGAGGCG
CTGCGCCGCC GCGGCCTGCG GGTGCCCGCC GACGTCAGCG TGGTCGGCTT CGACGACCTG
CCCGAGGCGC GCTGGTCCTC GCCGTCCCTG ACCACCGTGC GCCAGCCGCT GTCGGACATG
GGCAGGCTCG CGGTGCGCAC CGTGCACCGC CTGGTGCAGC GCGAGACCAT CGAGAGCCCG
CGGGTCGAGC TGGCCACCGA GCTCGTCGTG CGCGACAGCA CCGCCCCGCC GCCGTGA
 
Protein sequence
MPDTPSRDVT ISAIAEEAGV SAPTVSRVLN GRGDVAPATR ERVESLIRAH GYRRRGGRPR 
ERVGLLDLVF NDLDSPWAVE IIRGVEDAAH ESGTGIVVSA IHRRVSSTRQ WLENVRSRAT
DGAILVTTDL DPELREELRE LHVPAVVVDP VGVPDLDTPT VGVTNWAGGL SATEHLIHLG
HRRIAFVAGR PELWCSRARL DGYRAGLETA GLAVDDELVV PGEFGYESGF RAGERLFDLA
DPPTAVFAAS DQMALGVYEA LRRRGLRVPA DVSVVGFDDL PEARWSSPSL TTVRQPLSDM
GRLAVRTVHR LVQRETIESP RVELATELVV RDSTAPPP