Gene Ndas_0476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0476 
Symbol 
ID9244315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp573439 
End bp575358 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content78% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003678429 
Protein GI297559455 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGCTGA GCGACCCGGC GAGGCGGTGG CACGACACGG CCGGACAGGC CCGGCAGGGC 
GCGTCCGGGC ACACCGCCCG CGGGCTGCGC GTCGGACTGC TCGGGCAGTT CGGCGTCGAG
ATGGACGGGG CCCCGCTGCG CCTGCGCGGG GACAAGCGCC GTGCCCTGCT GGCCACGCTC
CTGCTCAACA CCGGGCGCAC CGTCCCCACC GCGCACCTCA TCGAACGCGT CTGGGGGAGC
CCGGCCTCCC CGTCCGCGCG CAGCGCCCTC CAGGTCCACG TGACCCGCGT GCGCGCGGTC
CTGGACCAGC ACTGCGGCAC CCCGCTCATC ACGGGCGGCG ACGGCGGCTA CCGGGTCGAC
CTCGCCGAGG ACCAGTGCGA CCTGCTGCGC TTTCGCTCCC TGGTCCGCCG CGCCGACGGG
GGCGCCGACC CCTCCGACCG CGCCGACCTG CTCATCAGCG CCCTGCGCCT GTGGCGTGGC
CCGGTCCTGG CCGACATCGT CAGCCCCGTC CTGCACGAGC GCGACATCCC GCCCCTGAAC
GAGGAGCTCC TGCGGGCCGC CGAGGAGGGG TTCGGCGCCG CCCTGGCCCG GGGCGACCAC
GAGCGGGTGG CCGACCAGAT CGGCCCCATC GCCACCGACC ACCCCGAGCG CGAACCCCTC
ATCCGCGTCC AGATGACCGC CCTCTACCGC TGCGGGCGCC CCAGCGAGGC CCTGCGCGTG
TACGCCCGCA CCCGCGACGC GCTGGCCGAG CACCTGGGCG CGGACCCGGG CCGCGAACTC
CAGGAGACCT TCCACGGCAT CCTGCGCGGC GACCTCGACC GCGCCCCGGG CACCAGCGTC
CCGCGCCAGC GCGCCTCCGT CGAGGAGGGC GCCGGGGTCC GGCCCCTCTC CGCGGACACC
GGACCGAACA CCGGGTCAGC CGACGCGCCG GGCACCGAAC CGGAGGCGGG ATCCCACGCC
GGACCCGGGC CGGGCGCCGA TCCCGACGTC CCGGCGGGGG CGGAGGAACG CGACGACCGC
GGCGACGGTC CGGTCCACGC GCCCGCCCCG GTGTCGGCCG CCCTCGCCCC GGCGGAGCTG
CCCGCCGCGC CCTCCGCGCT GCTGGGTCGC GAGGAGGCCC TGGCCGAACT GGACCGGCTG
GTGGACCCCT CCAGCACCGC CCCCGGCAGC GCGCTGGTGC GCGGCCCCGC GGGCGCCGGG
GCCAGCGCCC TCGCCCTGAG CTGGGCCCGC GCGGCCGCGC CGCACTTCCC CGACGGACAG
CTCTACGTCG ACCTCCGCGG CGGCGACGGC AGCCCCCGAG ACCCGGTCGA GGTGCTGCGC
CGTCTGGTGC GCTCCCTGTC CACCGGCACC CGCGGCACCG AGGCCATGGA CGCCGACGAG
GCCGCCGCGC GGGTCCGGAC GCTGTTGGCG CACCGGCGCG TCCTGCTGGT CCTGGACAAC
GCGGCCTCCG TGCGCCAGGT GCGCCCCCTG CTCCCCGGCG GCACCGGGTG CGCGGCGCTC
GTCACCAGCC GCTACTGGCT GACCGACCTG CTGGTGCGCG ACGGTCTGCG CGCCCTGCCC
GTGGGGCCGC TCCCCCCGGA CGCCGCGGTG GACCTGTTGC GCCCGCGGGA GGGGCGGGAC
CGGCGCGCCG AGTCGGTGCT GCGCCGCCTC GCCCAGGTCC TGGGACACCT GCCCCTGGCG
CTGCGCATGG CGGCGGTCTG GCTCGACGAC ACCCGCCCGG ACCGGTCCGC CGCGGAGCTG
GTGCGCCGAC TGGAGGGGGC GGACCCGGCA CGCGGGAGCA CGCCCACGGC CCGGATGGCC
GCCGTGCTGC GTGCCGGGCC CCGGGAGGGA CGGCACGGCC GCGGGGAAGT ACCGGCCGAG
CCCCGTCCTC CCGACACCAG TGTCGAACGG GGATCTTACG TTCGGCTTTC ACCCAGGTGA
 
Protein sequence
MSLSDPARRW HDTAGQARQG ASGHTARGLR VGLLGQFGVE MDGAPLRLRG DKRRALLATL 
LLNTGRTVPT AHLIERVWGS PASPSARSAL QVHVTRVRAV LDQHCGTPLI TGGDGGYRVD
LAEDQCDLLR FRSLVRRADG GADPSDRADL LISALRLWRG PVLADIVSPV LHERDIPPLN
EELLRAAEEG FGAALARGDH ERVADQIGPI ATDHPEREPL IRVQMTALYR CGRPSEALRV
YARTRDALAE HLGADPGREL QETFHGILRG DLDRAPGTSV PRQRASVEEG AGVRPLSADT
GPNTGSADAP GTEPEAGSHA GPGPGADPDV PAGAEERDDR GDGPVHAPAP VSAALAPAEL
PAAPSALLGR EEALAELDRL VDPSSTAPGS ALVRGPAGAG ASALALSWAR AAAPHFPDGQ
LYVDLRGGDG SPRDPVEVLR RLVRSLSTGT RGTEAMDADE AAARVRTLLA HRRVLLVLDN
AASVRQVRPL LPGGTGCAAL VTSRYWLTDL LVRDGLRALP VGPLPPDAAV DLLRPREGRD
RRAESVLRRL AQVLGHLPLA LRMAAVWLDD TRPDRSAAEL VRRLEGADPA RGSTPTARMA
AVLRAGPREG RHGRGEVPAE PRPPDTSVER GSYVRLSPR