Gene Ndas_1394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1394 
Symbol 
ID9245244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1709628 
End bp1710809 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content78% 
IMG OID 
Producttranscriptional regulator, CdaR 
Protein accessionYP_003679332 
Protein GI297560358 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.684667 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0014878 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGACACG AACAGCGTGC CCGGGTCGGC GCCGAGCTGG CGCGGGAGCG GGCCGGGCTG 
GTCGAGCGCA TCGTCGCCGA GGTGCACGCC GAGGTCCCGG CCTACCGCGC GCTGCACGGG
TCCCAGCTCA CCGAGGTCCG GGCGATCACC GGGTGGCTGA TGGGCCGCTC ACTGGAGCTG
TGGGCGGCCG GGGCGACGCG GCTGCCGCCG GAGGACGTGG AGCGGCTGCG CGGCATCGGC
CGGTCCCGGG CGGCCGACGG GCGTTCGATC GGCGCGGTGG TGCGCGCGCA CCGGGTGGGC
TCGGCGGCGG CGGTGCGCCT GGTCGCCGAA CTCGCCTCCG ACCGGCTCGA CGCCGCCGAC
GTGTTCGCCC TGGGCGAACT GTGGCTGACC TCGATCGACC AGATCTCCGA GAGCCTGTCC
GCGGGCCACG CCGAGGCCGC GCGCCGCCTG GACGCGGACC TGGAGCGGGC CCGCCGGGCC
TTCCTGGACG ACCTGCTGAT CGGACGGCAG GCCTCGCGCG GGGCCATCCG CGACCGGGCG
CGGACCCTGG GCATCGCCCC GCCCGACCCG GCGGTGCTGG TGGTGGCCGA GGCCGACGGC
GGCCCCTGCG ACGGGGCGCC GCGGTCGGCC GCGCTCGCCG CCGGGATGGA ACTGCTGGGC
CTGGTGGAAC CGGCGGGCGC CGACCCGCTG GTGACCACGC GCTCGGGACG CGTGGTGCTG
CTGGTCCGCC CCGACGACGC CGACCGGGTG GCCGCCGTGC TCGGCGGACG CCCCTGGCGC
GGGTGCGTGC TGGAGCCGCG CGCGCTGACG GACATGTCGG CCGCCTACCG GCTGGCCGAC
GGCGCCCTGG AGACCGCACC CGCGCACGCC TTCGACTCCC GGGGGCTGCT CGGGACCTCC
GACGCGTGCG TGCTGGCACT GCTCAACGGC GGCCCGGTCG CCCCGGCCGC GGTCCGCCGC
ACGGTGCTGG GGCCGCTGCT GGCCGAGGGC AACGCCCACC TGCTGGAGAC GCTGCGGGCC
TACTTGCGCG AGGGCGCGGC GACCACGGCC GCGCAGGCGC TGCACGTGCA CGCCCAGACG
CTGCGCTACC GGCTGCGCCG GGTGCGGGAG CTGACCGGGC ACGACCCGCA CCGGCCCTGG
CAGCGGTTCG TGCTGGAGAC CGCCTGCGCG ATCGCGCCCT GA
 
Protein sequence
MGHEQRARVG AELARERAGL VERIVAEVHA EVPAYRALHG SQLTEVRAIT GWLMGRSLEL 
WAAGATRLPP EDVERLRGIG RSRAADGRSI GAVVRAHRVG SAAAVRLVAE LASDRLDAAD
VFALGELWLT SIDQISESLS AGHAEAARRL DADLERARRA FLDDLLIGRQ ASRGAIRDRA
RTLGIAPPDP AVLVVAEADG GPCDGAPRSA ALAAGMELLG LVEPAGADPL VTTRSGRVVL
LVRPDDADRV AAVLGGRPWR GCVLEPRALT DMSAAYRLAD GALETAPAHA FDSRGLLGTS
DACVLALLNG GPVAPAAVRR TVLGPLLAEG NAHLLETLRA YLREGAATTA AQALHVHAQT
LRYRLRRVRE LTGHDPHRPW QRFVLETACA IAP