Gene Ndas_1195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1195 
Symbol 
ID9245046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1451653 
End bp1452672 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content75% 
IMG OID 
Producttranscriptional regulator, LysR family 
Protein accessionYP_003679142 
Protein GI297560168 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.279757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.378839 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACATCG CCCAGTTGCG GGACTTCATC GCGGTCATCG ACAGCGGGAG CTTCACCCGC 
GCCGCTTCGG TGCTGTTCGT CTCGCAGCCG GCGGTCAGCC AGCGGATGAA GCAGCTGGAG
AGCGAACTGG GCGTGCGGCT CGTGCAGCGC GGCCCCCGCG GGGTGGTGCC CACACCGGCG
GGGCGGACCC TGTACCGGGA CGCCCAGCAG CTCATCCGCC GGTTCGACCA GATCGCCGAG
GACGTGGCCA AGGAGCCCCG GGCCATCCGC GGACCGGTGG CCGTCGGCCT GCCCACCGCG
GCCGCCGTCC ACCTCGCCCC GGCGCTGTTC TCCTGGACGA AACGGCACTA CCCGGGGGTC
CGCCTGCGGC TGTTCGAGTC GGTGAGCGGA TACATCCAGG AGCTGCTCAC GGTCGGGCGG
ATGGACCTGG CCGTCCTCTA CCGCGACGAC GCGGCGCCCC GGCCGGCCGA GACGCCGCTG
TACTCCGAGG AGCTGTACCT GGTCGGGCGC TCGGACGCCG AGGAGCCACC CCGGGGCGGC
CGGGCCGCCG GGTCCGCCGG GACGGGCGCG GCCGCCGCCG ACACCGCGGC GTACGGGGAC
ATCAGCCTGG CCGACATGCT CCGGGTGCCG CTGGTAGCCC CCGGGGCGCG CAGCAACCTG
CGCGTGCTCA TCGACCGCGT CTTCACCGAA CACGGCGCGG CGCCCGTGAT CGCCGCCGAC
GTGGAGTCCC TGGGCACGAT GGTGCGCATC GCCGAGAGCG GCGAGGCCTG CGCCCTGCTC
CCGCTGTCCA GCGTCGAGGC GCTGCGCAGT ACCCCCGACC TCATGGTGCG GCGGGTCGTG
GACCCCGTGA TCGAACGCCA CATCGCGGTG TGCGCCGGTT CGGACTACTA CGAGCCGCGG
GACGCGGTGT CCGTCGTCCG GCACGGCATC GTGCAGGTGA CGACCCGGCT CGCCGAGCAG
GGGGCCTGGC CGGGCATCCG CCCGGCGGCC CGGACCGAAC CGCGTGCCGG CCGACCCTGA
 
Protein sequence
MDIAQLRDFI AVIDSGSFTR AASVLFVSQP AVSQRMKQLE SELGVRLVQR GPRGVVPTPA 
GRTLYRDAQQ LIRRFDQIAE DVAKEPRAIR GPVAVGLPTA AAVHLAPALF SWTKRHYPGV
RLRLFESVSG YIQELLTVGR MDLAVLYRDD AAPRPAETPL YSEELYLVGR SDAEEPPRGG
RAAGSAGTGA AAADTAAYGD ISLADMLRVP LVAPGARSNL RVLIDRVFTE HGAAPVIAAD
VESLGTMVRI AESGEACALL PLSSVEALRS TPDLMVRRVV DPVIERHIAV CAGSDYYEPR
DAVSVVRHGI VQVTTRLAEQ GAWPGIRPAA RTEPRAGRP