Gene Ndas_3662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3662 
Symbol 
ID9247531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4395727 
End bp4397172 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content78% 
IMG OID 
Producttranscriptional regulator, PucR family 
Protein accessionYP_003681566 
Protein GI297562592 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0446387 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACCGA CCCTGGGGGC CCTGATGCGC ACGCCCCGCC TGCGCCTGAA CCTGCTCACC 
GGCCAGGAGA ACCTGGACCG GAGGGTGGAG TGGGTCGCGG TCAGCGAGCT GGAGGACCCC
ACGCCCTACC TGGCGGGCGG CGAACTCCTG CTCACCACCG GCGTGCGCTG GGCCACCGGC
GTCCCCGACC TGCGCGACTA CACGCGCCGC CTGGCCGAGC GCCGCGTCAC CGCGCTGGGC
TTCGCGGTCG GCGTGGTCCT GGAGCGCACC CCCGAGCCGC TCCGCGAGGC CGCCGCCGAG
TTCGGCCTCA CCCTGCTGGA GGTGGCCAGG GAGACCCCCT TCATCGCGAT CGGCAAGGAG
GTGTCCCGGC TCCTGGCCAA GGAGGAGTAC GAGGGGCTGA GCCGGGCCTT CGCCGCCCAG
CGCGACCTCA CCCGCGCCGC GCTCACCGGG GAGGCGGCGA TCGTGGACCG GCTGGCCCGC
GAACTCGGCG CCTGGGTGCT GCTGCTGTCC GCCGACGGCG CCCCCCGGCA CGCCGCCCCC
GCCGGGGCCT CCCCCCGCGC CGCCGGACTC GCCGAGGAGC TGGACCGGCT GCGCGAGGCG
GGCGTGCGCG CCAGCGTCTC CCTCACCTCG GGCGGCGAAC ACGTCTCCGT GCAGCCCCTG
GCCACCGGCC GCCGGGTCCG CGGCTTCCTT GCCGTGGGCA CCGGCGGGCG CCTCGGCTCG
GACGAGCGCA CGCTGGTCAA CGCGGCGGTG TCGCTGCTCT CCCTCGAACT GGAGCGCACC
GCGCATGACG CCACGGCCCG GGTGCGCGAG GGGGTGCTCG CGGCGCTGCT CACCGGGGCG
CTGGACCCCC TGCACCCGGG GGCGGAGCGG CTGCGCGGGG TCCTTCCCGC CGGACCCGTC
CTCGTGGCGG CCGCCGACGG GGTCGGGCCC GCACAGCCGC CCGAGGGCGT CCTGGTCACC
GAGCACGACG GCCGCGTCCT GCTGCTGGCC CCCGCCGACA CCGGAACGGG AGTGCTCGCC
GAGGTCCTGG AGGGGCCGGT CGGGGTGAGC GACCCCTCCC CCTACGCGGA ACTGTCCGCG
GCGCTGTCCC AGGCCGAACG CGCGCTGGCC GCCGCGCGCG ACGCGGGCGG GGGCCTGCTC
CGCGTCGGCG ACCTGCCCGG CGGACTGCTC GGGCTGGCCG ACACCCCCGC CGGCGCCCGC
ATGGCCGGGG ACCTGCTCGC CCCGCTGCTG CGCCAGCGCA CCTCGGCCGA ACTGCTGGCC
TCGCTGCGCG CCTACCTCGC GGCCTCGGGC CGGTGGGACG CGGCGTCGGA GGCACTGGGG
ATCCACCGGC ACACGCTGCG CTACCGCATG CGGCGCATCC GCGACCTGCT GCCCGGCGAC
CTGGACGATC CCGACTACCG CACCGAGCTG TGGATCGCCC TGCGCGTCCA CGGCAGCGCG
GGCTGA
 
Protein sequence
MPPTLGALMR TPRLRLNLLT GQENLDRRVE WVAVSELEDP TPYLAGGELL LTTGVRWATG 
VPDLRDYTRR LAERRVTALG FAVGVVLERT PEPLREAAAE FGLTLLEVAR ETPFIAIGKE
VSRLLAKEEY EGLSRAFAAQ RDLTRAALTG EAAIVDRLAR ELGAWVLLLS ADGAPRHAAP
AGASPRAAGL AEELDRLREA GVRASVSLTS GGEHVSVQPL ATGRRVRGFL AVGTGGRLGS
DERTLVNAAV SLLSLELERT AHDATARVRE GVLAALLTGA LDPLHPGAER LRGVLPAGPV
LVAAADGVGP AQPPEGVLVT EHDGRVLLLA PADTGTGVLA EVLEGPVGVS DPSPYAELSA
ALSQAERALA AARDAGGGLL RVGDLPGGLL GLADTPAGAR MAGDLLAPLL RQRTSAELLA
SLRAYLAASG RWDAASEALG IHRHTLRYRM RRIRDLLPGD LDDPDYRTEL WIALRVHGSA
G