Gene Ndas_3549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3549 
Symbol 
ID9247418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4258857 
End bp4259870 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content74% 
IMG OID 
Producttranscriptional regulator, AraC family 
Protein accessionYP_003681456 
Protein GI297562482 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.757715 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.622262 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAAGA ATCAGGCCAT GAGGATTTTC TCGCGGCCCG GCCGCCACCA CGTGGCCGTC 
CTCGTCCGGC ACGGCCTGCT GCCCATCGAG GCGGGCATCG TCCACCGCCT GTTCGGCCAG
GCCCGGAGCG CCGACGGGGA GCTGCTCTAC GAAGTCGTCA CCTGCGCGCT GGAACCGGGG
GAGATCAGCA CCGACACCGA CTTCACGATC AACGTGGCCC ACGGACCGGA GGCCCTCGAC
GAGGCGGACA CGGTGATCCT GCCCGCCGCC GACGAGGACT ACGGCGAACG CCCGCACGCC
CCCCTCGCGC CGGCTCTGGC CGCGGCCGTC GCGCGCATCC CGCCGAACGC GCGCGTGGCC
TCGATCTGCA CCGGCGCGTT CGTGCTCGCC GCGGCCGGGC TCCTGGACGG GTGCCGCGTG
ACCACCCACT GGAAGTCCGC GGGCTACTTC CGCGCCATGT ACCCCGGCAT CGACCTGGAC
CCGGACGTGC TGTACACCGA CAACGGGCGT GTGCTGACAG CTGCCGGGGT CGCCTCGGGC
ATCGACCTGG GCCTGCACAT GATCCGGCTC GACCACGGCG CCGCCGTGGC CAACGAGGTG
GCGCGCAGCA CCGTCGTCCC GCCCCACCGC GACGGCGGCC AGGCCCAGTA CATCCGCCGT
CCCGTGCCCG CGCCGGAGCG TGCCGCGACG GGTCGGGCGC GCGCCTGGGC CCTGGAACAC
CTGCACCTCC AGCCGGGCCT GCGGGAGATG GCCCTCCAGG AGGCCGTGAG CGTGCGTACC
CTCACCCGCC GTTTCCGCGA CGAGGTGGGG CTCTCGCCCG GCCAGTGGGT CGCCCGGCAG
CGCCTGGACC GGGCCCGGCA GCTGCTGGAG GAGTCGGACC TGCCGGTGGA CAGGGTCGCG
CACGAGGCGG GTTTCGGCAC GGCGGCGTCG CTGCGCCAGC ACATGCACGC CGAACTGGGC
GTGTCCCCCA GCGCCTACCG GCGCACCTTC CGGGGCGCAC CGGACCCGGC CTGA
 
Protein sequence
MRKNQAMRIF SRPGRHHVAV LVRHGLLPIE AGIVHRLFGQ ARSADGELLY EVVTCALEPG 
EISTDTDFTI NVAHGPEALD EADTVILPAA DEDYGERPHA PLAPALAAAV ARIPPNARVA
SICTGAFVLA AAGLLDGCRV TTHWKSAGYF RAMYPGIDLD PDVLYTDNGR VLTAAGVASG
IDLGLHMIRL DHGAAVANEV ARSTVVPPHR DGGQAQYIRR PVPAPERAAT GRARAWALEH
LHLQPGLREM ALQEAVSVRT LTRRFRDEVG LSPGQWVARQ RLDRARQLLE ESDLPVDRVA
HEAGFGTAAS LRQHMHAELG VSPSAYRRTF RGAPDPA