Gene Ndas_0512 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0512 
Symbol 
ID9244353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp629995 
End bp631200 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content69% 
IMG OID 
Producttranscriptional regulator, XRE family 
Protein accessionYP_003678465 
Protein GI297559491 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCCCA AACCGTTCCG TCTGGTGCTC CGTGAACTCC GGTCCGCGCG TGGTTGGTCG 
CAAGCGCGGT TGGCGGAAGC GCTGTGTGAC GTGTCGGGAC GTCCAACGGT GACCCGACAC
GATGTGTCGC GGTGGGAGCG GGGGAAGCGT GTCCCGCGTG CGTGGCTTCC CTACCTCGCG
GAGGTTCTGG ACGTCTCCCG GGAGACGTTG GAACGGGCTA CGGTCACGGA GCCGGAGCCC
TTTCCGGTTG CGGAAACGCT GGCGTCTCTC TTGCCTCCGG GGGAGGCTGT CGCGCCGCTA
CAGGCCCGAG CAGGCCGGAG GGTGGGGCAG ACGACGGCGG ATGATCTGGC GACCCGTGCG
CACGGTCTGC GGCTTGCCGA CGACGTTCTA GCCGGAGGCG ATTTGATCGG TCCGGCCTTC
CGGGAACTGG ACGCGGCCGT TCGCGTCCTC CGGGAATCGA CGCACACGGA CGAGGTCCGG
CGGGAACTAC TCCGGGCGGT TGGTGAACTC GCGCAGATAG CCGGATGGAT TGCCAGCGAC
GCGGCGGACT CCCGGGCGGA GGGGGCCTAT CGGCTTGGGC TGGACGCGGC ACGGGAAGCC
GGGGACGGCC CGTTGGCCGC GCAGCTTGCC GGGTCTCTCG GCTACCACTT GGTGAACAAC
GGACGTGTTG CCGATGGGGC CGCGCTGTCG GTCGCGGCCG TGGCGGAGGC GGGACCGGAC
GCTCCCGGGA AGACGCGGGC ACTGTTCCAT GACCGGGCCG CGTGGGCCCA TACACAAGCC
GGGGACGCGC AAGCCGCTAT GCGGTCGTTG GGGGCCGCCC ACGAGGCACT AGCGGAGGAC
AGCGGGGACA CTCCGGAGTG GGCTTACTGG GTGAACGAGG CGGAACTAGA GGTCATGGAT
TCCCGCGTCT ACACGGAACT CCGCCGTCCC CTGCGCGCGG TTCCCCTGCT CTCTCGTGTC
CTCCGTGAAT ACCCGGCCAC GTCCACAAGA GAGCGGGCTT TGTATGAATC GTGGCTTGCC
GTGGCCTACG CGGACGCGAA CGAACCAGAA GAGGCCGCAC GCGTCGCGGC CCGCGTGATC
GAACTATCCG GAGACGTGGC ATCCGCGCGG ACATCGGACC GGGTCCGTGT CGTGCTGTCG
CGACTTGCCG ACTTCCCAGA CGTCCCGGAG GTCCGGGAAG TGCTGGACAG CGTCGGTCCG
GCTTGA
 
Protein sequence
MAPKPFRLVL RELRSARGWS QARLAEALCD VSGRPTVTRH DVSRWERGKR VPRAWLPYLA 
EVLDVSRETL ERATVTEPEP FPVAETLASL LPPGEAVAPL QARAGRRVGQ TTADDLATRA
HGLRLADDVL AGGDLIGPAF RELDAAVRVL RESTHTDEVR RELLRAVGEL AQIAGWIASD
AADSRAEGAY RLGLDAAREA GDGPLAAQLA GSLGYHLVNN GRVADGAALS VAAVAEAGPD
APGKTRALFH DRAAWAHTQA GDAQAAMRSL GAAHEALAED SGDTPEWAYW VNEAELEVMD
SRVYTELRRP LRAVPLLSRV LREYPATSTR ERALYESWLA VAYADANEPE EAARVAARVI
ELSGDVASAR TSDRVRVVLS RLADFPDVPE VREVLDSVGP A