Gene Ndas_1742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1742 
Symbol 
ID9245592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2119994 
End bp2121286 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content67% 
IMG OID 
Productputative transcriptional regulator, GntR family 
Protein accessionYP_003679676 
Protein GI297560702 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.554811 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACCTGG CGCTCTCCGA CATGCACGCA TCACTTGACT CACCGACCAC GGAGTCGATG 
AATTTCCTGA ACGAGATCGG GAACGTCTAC CCCGACGCCA TCTCGTTCGC CGCGGGGCAG
CCGTTCGAGG GCTTCTTCGA CCTCGACGCC GTCCACCACT ACCTGGACGC GTTCCGCGCC
CACCTCGCCG AGGAGCGGGG GCAGAGCGGG GAGCAGGTCC GGCGCACGCT GCTCCAGTAC
GGCAGCGCCA GGGGCATCGT CAACGACCTC ATCTGCCGCA ACCTGGAGAC GGACGAGGGT
ATCCGGGTGG ACCCCCGTGC CGTCGTCGTC ACCTCCGGCT GCCAGGAGGC CCTCTTCCTG
GTGCTGCGCG CCCTGCGCGG GGGCCCGTCG GACGTGGTGC TCGCGGTGCG GCCGAACTAC
TCGGGGCTCG ACGCGGCGGC GCGCCTGGTG GAGATGGGGG TCCACCCGGT CCGGGAGCCG
GCTTCGGGAA TCGACGGTGA GAGCCTCACC GAAGCCGCGG AGCAGGCACG CCGGGAAGGG
CTCAACCCCC GCGCCTGCTA CGTGATCCCG GACTTCGCCA ACCCCACCGG CCGCAGCCTG
TCGGTGGCCG CCCGGCGGAG CCTGCTGGAG AGCGCCGAGG AACAGGGGAT CCTTCTCATC
GAGGACAACC CCTACGGCAT CTTCGGCCCC GAGGAGAGCG GCACCCCCAC CCTGAAGTCC
CTGGACGCGT CGCGGTCCGT GGTCTACCTC GGTTCCTTCG CCAAGTCCGG TATCCCGGGA
GCGAGGGTCG GCTACGTCGT CGCCGACCAG CGTGTGTCGG CGCACGAATC ATCGGACACC
CTTTTCGCCG ACCACCTGGC CAAGGCCAAG GGAATGCTCA ACATCAACAC CTCCCCGATC
ACTCAGGCGG TGATGGGGGG AAAGCTGATC ATGAACGGGT TCAGCCTCCG TTCGGCGAAC
ACCCGGGAGA GGAACGTCTA CCAGGGCAAC CTGTCCCGCC TCCTCCAGGA GATGTCCCGG
AGGTTTCCCG AAGGCGAGGG CCACGGCGTC AGTTGGAACA CCCCGTCGGG CGGATTCTTC
CTGACCCTGA AGGTCCCCTT CCCAGCGAGT GACGAGGCAC TTGGCGTCTG TGCGCGGAAG
CACGGTGTGC TGTGGACCCC CATGCACCAC TTCCACGGCG ACGGAATTCC ACGGAACGAG
ATCAGGCTCT CCTTCAGCCA TCTCACCCAG GACAGGATCG CGCTCGGTGT CGAACGTTTC
GCCTCGTTCG TCACCGACCA CGCCGGCAGC TGA
 
Protein sequence
MDLALSDMHA SLDSPTTESM NFLNEIGNVY PDAISFAAGQ PFEGFFDLDA VHHYLDAFRA 
HLAEERGQSG EQVRRTLLQY GSARGIVNDL ICRNLETDEG IRVDPRAVVV TSGCQEALFL
VLRALRGGPS DVVLAVRPNY SGLDAAARLV EMGVHPVREP ASGIDGESLT EAAEQARREG
LNPRACYVIP DFANPTGRSL SVAARRSLLE SAEEQGILLI EDNPYGIFGP EESGTPTLKS
LDASRSVVYL GSFAKSGIPG ARVGYVVADQ RVSAHESSDT LFADHLAKAK GMLNINTSPI
TQAVMGGKLI MNGFSLRSAN TRERNVYQGN LSRLLQEMSR RFPEGEGHGV SWNTPSGGFF
LTLKVPFPAS DEALGVCARK HGVLWTPMHH FHGDGIPRNE IRLSFSHLTQ DRIALGVERF
ASFVTDHAGS