Gene Ndas_1604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1604 
Symbol 
ID9245454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1960599 
End bp1962083 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content76% 
IMG OID 
Producttranscriptional regulator, GntR family with aminotransferase domain 
Protein accessionYP_003679539 
Protein GI297560565 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.172343 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCCGCC GCCTGGCCGA ACCGGCCTTC CACCTCGACC GTTCCGCCGC CCGCCCCCTC 
ACGACGCAGC TGTCCGACGC ACTGCGGGAG GCCATGTCCG GCGGGACGCT CGCACCCGGG
GAACGCCTGC CCTCCAGCCG CGCCCTGGCC GCGCAGCTGG TCGTGAGCAG GACCGTGGTC
ACCGAGGCCT ACCAGCAGCT CTACGCCGAG GGCTGGTTGG AGGGACGCCA CGGCTCGGGC
ACGTTCGTCG CCCAGGACAG CTCACCGCCC TCCCCCTCCC GGCCCTCCGC CGACGGACGC
GGCGCTCGTG AGCACACCAT CAGCGGCGAC GACGGCGGAC ACGGCGCCCC GTCCGCCACC
GCGAGCCCCG CCGTGCGCAC CACCGCCCCG CGCCCCTCCC GGGAACAGGC CCGCAGCCAG
GGCATGATCG ACCTGCGCCC CGGCGCCCCC TGGGTCCGCG ACCACGACCG CGCCGCCTGG
CGCCGGGCAT GGCGGCACGC GGCCGAACAA CCCCTGGACG AGGACCCCGA CCCCCGCGGC
CTGCCCCGGC TGCGCGCACT CCTGGCCGAC CACCTGCGCC GCACCCGCGC AGTACGCATC
GGCCCCGAGA ACGTCATGGT CACCCGCGGC ACCGGCAACG GCCTGGACCT GGTCGGGGCC
GCCCTGCTCG GCGCGGGCAC CCGCGCGGGC GTGGAGGAGC CCGGCTACCA GAAGGCCCAC
ACCATCCTGG CCGCCCGCGG AGCCCGCGTG CTCCCCTGCC CGGTCGACCA CGACGGCATC
CTCACCGAGC ACCTGCCCGA CGACCTGACC CTCGTCCACA CCACACCCGC CCACCAGTAC
CCGCTCGGCG GACGGCTGCC CGTCCCGCGC CGCGAACGCC TCCTGGCCTG GGCCCGCCGG
AACGCGGCGA TGATCGTGGA GGACGACTAC GACGCCGAGT TCCGCTACGA CGTGGCGCCC
CTGCCCGCCC TCTACGGCCT GGGCCCCGAC CGCGTCATCC TGCTCGGCAC CCTGTCCAAG
ACCCTCTCCC CCGACCTGGG CATCGGCTGG ATCGTCGCCG AACCCCCGCT GCTGGAACGC
CTGGCCGCCG TCCGCCACGA CCTGTCGGAC CGCACCAGCG TTCCGGTCCA GGCCGCCACC
GCCCTGCTGC TCGAACGCGG CGACCTCGAC CGGCACCTGC GCCGCATGCG CCTGGAGTAC
GCCCGCCGCC GCGGGCTGCT GATCGACCTC CTCACCACCC GCCCCGTCGG GGACACCGCG
GGCCTGCACG TCCTGCTCCC GTTGCCCGCC GACGCCGTCG CCCCCGTAGT GGCCGAGGCC
GCCGAACGGG GCGTGCTGAT GGACGACACC TCGCGGGCCA GCCACGGCGC CCCGACCGTG
CACGGACTGG TCCTGGGCTA CGGGTCGGCG CACCGCGCCG ACCTGCGCCG CGCCTGCGCG
GTCCTCAACG AGGCCGTCGC CCGCCACACC GGCCACACCG CCTGA
 
Protein sequence
MPRRLAEPAF HLDRSAARPL TTQLSDALRE AMSGGTLAPG ERLPSSRALA AQLVVSRTVV 
TEAYQQLYAE GWLEGRHGSG TFVAQDSSPP SPSRPSADGR GAREHTISGD DGGHGAPSAT
ASPAVRTTAP RPSREQARSQ GMIDLRPGAP WVRDHDRAAW RRAWRHAAEQ PLDEDPDPRG
LPRLRALLAD HLRRTRAVRI GPENVMVTRG TGNGLDLVGA ALLGAGTRAG VEEPGYQKAH
TILAARGARV LPCPVDHDGI LTEHLPDDLT LVHTTPAHQY PLGGRLPVPR RERLLAWARR
NAAMIVEDDY DAEFRYDVAP LPALYGLGPD RVILLGTLSK TLSPDLGIGW IVAEPPLLER
LAAVRHDLSD RTSVPVQAAT ALLLERGDLD RHLRRMRLEY ARRRGLLIDL LTTRPVGDTA
GLHVLLPLPA DAVAPVVAEA AERGVLMDDT SRASHGAPTV HGLVLGYGSA HRADLRRACA
VLNEAVARHT GHTA