Gene Ndas_2662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2662 
Symbol 
ID9246513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3170537 
End bp3171667 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content74% 
IMG OID 
Productputative transcriptional regulator, PucR family 
Protein accessionYP_003680585 
Protein GI297561611 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.384645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.292926 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAAAA CAGCGCCGTC GGACGTCATC CCCCGAGCCG CACAGGCGTG CCTGGAGGAG 
CTGGAGACCG TCGCCTGGAC CTACGTGCGC AGGGTCCGCG AGCTGACCGG GTACGCCGAG
ACCGTCATCG ACGACGCCGA GCTGTACGGC ACGGCCCGGG CCACGCTCGA ACTCCTCCTG
GAACTCCTGC GGGGACGGGA CCGCCACGGC GAGCTGCGCG CCCACTCGAT CGAGGTCGGA
CGCTCCCGCG CCCGCCGCGG CATCCCCCTG GAGTCGCTGC TGCGGGCCGT GCGCATGGAC
TTCCGCTTCC TCTGGGAGGC GATGCGCGCC CACGTCGCCG AGGCCGACTT CCGCGACTTC
TCCGAGGAGG TCATCTCCAT CTGGGAGGCG GTGGAGGTGC ACACCACCCA CGTGCAGACC
GGCTACACCG ACGAGATCGC CCGGATGCGC GCGGAACTGG AGCTGGAGCA CGCCTTCCTC
CTGCGGCACC TGCTGAGCGG TTCCGGCGGG GACCCGCGCC TGAACGAGCA GGCGGCGCGG
GCGCTGGGCC TGCGCCCCGA CGGCGTCCAC CTCGTCCTGG TGGCCAACGG CAACCACGCC
CGCGACTTCC GGGGCTGGGT CGGCCAGACG TTTCCCCGGG CCGTCCTGCT CCGCCTGGAC
GGGGTGGAGT TCGCCATCGT CCCCGCGGGG GACGCCGCGG GGGCCGCCCG GGAGACGCTG
CTGGGCAAAC CCGTCGGGAT CTCCCCGAGC GCCCACGGCA TCGGGGAGAT AGCCCCCATG
TGGCGGCTCG CCCGCGAGCT GGCGGAGTGG GCGCGGCCCG GCGCGGCGGC CACGGTCGAG
GGCCACTGGA CCCGGCTGGC GGGCGCGCGC CTGGGCCCGG CCGCCGGGGC CTTCGCCCGC
GACGTGCGGG AGGCGCTCGG CGGGTTCACC GAACGCGAGG TCGATCTGCA GGTCGAGACG
GTCGAGGCCT ACTACCGGAC CGGTTCGGTG ACCGAGGTCG CGCAGGCGAT GTTCTGCCAC
CGCAACACGG TGATCAACCG CCTGCGCCGC TTCGCCGAGG CCACCGGGCT CGACGTCACC
AGGCCGGTCG ACGCGTCCGC GGCCCATCTC GCCCTGACCG TCCTGCGCTA G
 
Protein sequence
MHKTAPSDVI PRAAQACLEE LETVAWTYVR RVRELTGYAE TVIDDAELYG TARATLELLL 
ELLRGRDRHG ELRAHSIEVG RSRARRGIPL ESLLRAVRMD FRFLWEAMRA HVAEADFRDF
SEEVISIWEA VEVHTTHVQT GYTDEIARMR AELELEHAFL LRHLLSGSGG DPRLNEQAAR
ALGLRPDGVH LVLVANGNHA RDFRGWVGQT FPRAVLLRLD GVEFAIVPAG DAAGAARETL
LGKPVGISPS AHGIGEIAPM WRLARELAEW ARPGAAATVE GHWTRLAGAR LGPAAGAFAR
DVREALGGFT EREVDLQVET VEAYYRTGSV TEVAQAMFCH RNTVINRLRR FAEATGLDVT
RPVDASAAHL ALTVLR