Gene Ndas_1430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1430 
Symbol 
ID9245280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1751136 
End bp1752608 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content69% 
IMG OID 
Producttranscriptional regulator, XRE family 
Protein accessionYP_003679368 
Protein GI297560394 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.236575 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTACT TTGGTGCTGA AGAAATACCG TCTTTGACAG GTGACCTGGA TCTCGTCACC 
TTTGGTCAGC GGCTCCGCCA CCTACGGCGG GCGCGCGGCC TCACCCTCTC CGACCTGGGC
GAACGCGTCG GCCGGGCCCC CTCCCAGCTC TCCCTGCTGG AGAACGGCAA GCGCGAGCCC
AAGCTCTCCC TCCTCACCTC CCTGGCCTCG GCACTGGGCG TCTCGGTGGA GGAGCTGCTG
TCCAAGCAGC CCCCGAGCCG CCGCGCGCAG CTGGAGATCG CGGTCGAGGA GGCCCAGCGC
GACACGCTCT ACCAGGACCT GAACCTGCCG CACCTGAGGA TCGGCAAGCG GGTCCCCAAC
GACGTGCTCG AACACATCGT GGGCCTGTAC GGGGAGCTCC GGCGGCGCAG CGCCAAGCCG
ACGGCCACCC CCGAGGAGGC CCGCCGGGCC AACGCCGACC TGCGCCGCCA GATGCGCGAG
CGCGGCAACT ACTTCGAGCA CATCGAGAAG GCGGCGGCCC AGACCCTCGA CGCCGTCAAC
TACACCGCGG GGGCCCTGTC ACAGGGGCAG ATCCTCGCCA TCGCCACCCA CCACGGGTTC
TCCCTCAAGT ACGTGCAGGA CCTGCCCCGC TCGGTGCGCT CCCTCACCGA CCACGTCAAC
CGGCGCATCT ACCTCAAGCG CGAGACCACG CTGGGCATGC ACAGCCCGCG CACGATCCTG
CTCCAGACCC TGGGCCACGT CGTGCTGGGC CACGGCAGGC CGCGCGACTT CGGCGACTTC
CTGCGCCAGC GGGTGGAGGC CAACTACTTC GCCGCCGCGG TCCTCATCCC CGAGTCCACC
GCGGTGCGCT ACCTGCAGGA GGCCAAGCAG GCCCGCGACC TGTCGGTGGA GGACCTGCGC
GACGTGTACT CGGTCTCCTA CGAGATGGCC GCGCACCGGT TCACCAACCT GGCCCACCGC
CACCTGGACC TGGTCTGCCA CTTCATCCGC AACGACGAGA CCGGCATCAT CTACAAGGCC
TACGAGAACG ACGGCCTCGT CTTCCCCACC GACGACACCG GGGCCATCGA GGGCCAGCGC
ATGTGCCGCC AGTGGTCGGG GCGCCAGGTC TTCGCCTCGC CGGACCGCTA CTCGATCTAC
TACCAGTACA CGGACAAGCC CAACGGCACC CACTGGTGCG TGGCGCACGT GGACCCCAGC
CGCGAGCGCA ACTTCGCCAT CACCCTGGGC GTTCCCTACA AGGAGTCGCG CTGGTTCCGC
GGGCGCGAGA CCACCAACCG CACCAAGTCC AACTGTCCCA ACGGCGAGTG CTGCGTGCAC
CCGCCCGCGG ACCTGGCCGC GCGCTGGGAG GGCAACGTGT GGCCGTCGGC CCGCGCACAC
TCCCACGTGC TCTCGGCGCT GCCCTCGGGG ACCTTCCCGG GCGTGGACGA GCACGACGTC
TACACCTTCC TGGAGAAGCA CAGCGTCGAC TGA
 
Protein sequence
MAYFGAEEIP SLTGDLDLVT FGQRLRHLRR ARGLTLSDLG ERVGRAPSQL SLLENGKREP 
KLSLLTSLAS ALGVSVEELL SKQPPSRRAQ LEIAVEEAQR DTLYQDLNLP HLRIGKRVPN
DVLEHIVGLY GELRRRSAKP TATPEEARRA NADLRRQMRE RGNYFEHIEK AAAQTLDAVN
YTAGALSQGQ ILAIATHHGF SLKYVQDLPR SVRSLTDHVN RRIYLKRETT LGMHSPRTIL
LQTLGHVVLG HGRPRDFGDF LRQRVEANYF AAAVLIPEST AVRYLQEAKQ ARDLSVEDLR
DVYSVSYEMA AHRFTNLAHR HLDLVCHFIR NDETGIIYKA YENDGLVFPT DDTGAIEGQR
MCRQWSGRQV FASPDRYSIY YQYTDKPNGT HWCVAHVDPS RERNFAITLG VPYKESRWFR
GRETTNRTKS NCPNGECCVH PPADLAARWE GNVWPSARAH SHVLSALPSG TFPGVDEHDV
YTFLEKHSVD