Gene Ndas_3567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3567 
Symbol 
ID9247436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4277991 
End bp4279160 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content79% 
IMG OID 
Producttranscriptional regulator, MerR family 
Protein accessionYP_003681474 
Protein GI297562500 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.728715 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCTCCC AGCTAGAGTT CTACGCGTGT GGTGCGCGAC ACAAGACGCA CCGCTTTCGC 
AACGCTTGGG GCGCCATGGC CGACGCACTG ACCCCGGGGG CGACCGCGCG TCTGCTCGGG
GTCTCGCCCT CCACGCTGCG CAGCTGGGAC CGGCGCTACG GCGTGGGGCC GCGCGAGCGC
AGTCCGGGCG GGCACCGCCG CTACTCGCCC GCCGACGTGG CGCGTCTGCG CGAGCTGTGC
CGCCTGGTCG GTGAGGGCCT GTCGCCCGCC TCGGCGGCGG AGTGCGTGCT GGTCCCGGCT
CGCGGGGGTC CGGCGCCCGA TCCAGCCCCG CCGCGCGTTC CCCGGCCGCG GGGCGCGGGG
GTGCCACGGG TGGGCGGAGG CGCCGAGGAG GAGGAACCGG GGACCGGGCA GGAGGGTCCG
CGGGACTTCC GGAAGGGCTC GGGAGCCAGC GGGGAGGACG CGGGCAGCGC CCAACCGGGC
GCGAAAGCTC CCGGGAGGGG TGCGCGGCCC CGCTGGCGCC CCGGCGGGGA CACCCTGCCG
CTGGTTCCGG CCGGGCCCAC CCTCCAGGGG ATCGCCCGCG CCGCCATGCG CATGGACGCC
GAACTCGTGG AGCGCCTGCT GGAGGAGGCC CTGGACGAGT ACGGCGTGGT GGCGGCCTGG
GAGGACCTGG CGATGCCGCT GCTGTACGGG ATGGGCCGCA AGTGGGAGGA CACCCGGCGC
TACGTGGAGG TGGAGCACCT TCTGTCGTGG TGCGTGTCCT CGGCGCTGCG CCGCGTGGCC
GCCCCCGGGG ACGCGGACCC GGCGGGCCGC CCCACGGTCC TGGCCTGCGG CCCCGGCCAG
ATGCACAGTC TCCCGATGGA GGCGCTGGCC GCCGCGCTGC GCGAACGGGG CGTGCCGCGC
CGGGTGCTGG GGCCGTGCAC GCCCGTGGTG GCGACGGTGC GGGCGGTGCG CCGCACGGGT
CCGCGCGCGG TGGTCCTGTG GTCCCACGCC GGAGACGCCG ACGACGTCGC GGCGCTGCGG
GCGGCGGTGC GCGCGGCGGC GGGGTCGGCG CAGGCCACGG CCGTGTACAC GGCCGGGCCG
GGCTGGCGGT CGCTGGGCGC GGCGCCGGGG CTGGCCGCCG GGCACCTGGG CTCGCTCACC
GACGCCGTGC GGGCCCTGGC CCCCGGCTGA
 
Protein sequence
MFSQLEFYAC GARHKTHRFR NAWGAMADAL TPGATARLLG VSPSTLRSWD RRYGVGPRER 
SPGGHRRYSP ADVARLRELC RLVGEGLSPA SAAECVLVPA RGGPAPDPAP PRVPRPRGAG
VPRVGGGAEE EEPGTGQEGP RDFRKGSGAS GEDAGSAQPG AKAPGRGARP RWRPGGDTLP
LVPAGPTLQG IARAAMRMDA ELVERLLEEA LDEYGVVAAW EDLAMPLLYG MGRKWEDTRR
YVEVEHLLSW CVSSALRRVA APGDADPAGR PTVLACGPGQ MHSLPMEALA AALRERGVPR
RVLGPCTPVV ATVRAVRRTG PRAVVLWSHA GDADDVAALR AAVRAAAGSA QATAVYTAGP
GWRSLGAAPG LAAGHLGSLT DAVRALAPG